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Introduction 

The  33rd  annual  Sanibel  Symposium,  organized  by  the  faculty  and  staff  of  the 
Quantum  Theory  Project  of  the  University  of  Florida,  was  held  March  13  to  20, 

1 993,  at  the  Ponce  dc  Leon  Conference  Center  located  at  the  north  gate  of  the  city 
of  St.  Augustine,  Florida.  In  spite  of  extreme  weather  eonditions  with  high  winds, 
rain,  and  cold  temperatures  that  erippled  air  traffic  along  the  entire  eastern  seaboard, 
over  320  participants  gathered  for  8  days  of  leeturcs  and  informal  diseussions. 
Tornadoes  touched  down  at  the  conference  site  and  the  schedule  had  to  be  worked 
around  power  outages  and  telephone  interruptions,  but  the  adverse  conditions  did 
not  in  the  least  affect  the  high  quality  of  the  daily  schedule  packed  with  plenary 
sessions  in  the  morning  and  late  afternoon,  and  with  poster  presentations  in  the 
evening,  often  lasting  until  midnight. 

The  program  contained  a  special  tribute  to  the  work  of  Professor  Clemens  Root- 
haan  and  a  retrospective  on  the  Chicago  School  of  Theoretical  Chcmi.stry,  with 
some  of  the  key  members  of  that  school  representing  several  generations  of  scientists 
giving  plenary  lectures, 

'I  he  format  of  the  symposium  adopted  for  the  past  few  years  was  followed  again 
this  year  with  a  compact  8-day  schedule  with  an  integrated  program  of  quantum 
biology,  quantum  chemistry,  and  condensed  matter  physics.  The  topics  of  the  ses¬ 
sions  covered  by  these  proceedings  include  Advanced  Scientific  Computing,  Inter¬ 
action  of  Photons  and  Matter,  Quantum  Molecular  Dynamics,  Electronic  Structure 
Methods,  Polymeric  Sy.stems,  and  Quantum  Chemical  Methods  for  Extended 
Systems, 

Special  sessions  were  held  in  memory  of  Thomas  D.  Bouman  and  Michael  Schlii- 
tcr,  who  both  were  regular  participants  in  these  symposia. 

The  articles  have  been  subjected  to  the  ordinary  refereeing  procedures  of  The 
I  ntcrmitianal  Journal  of  Quantum  Chemistry.  The  articles  presented  in  the  sessions 
on  cjuantum  biology  and  associated  poster  sessions  are  published  in  a  separate 
volume  of  the  The  International  Journal  of  Quantum  Chemistry. 

The  organizers  acknowledge  the  following  sponsors  for  their  support  of  the  1993 
Sanibel  Symposium: 

•  U.S,  Army  Research  Office  (Physics)  crdec  through  Grant  DAAH04-93-G- 
0072  “The  views,  opinions,  and/or  findings  contained  in  this  reports  are  those 
of  the  authorf  s)  and  should  not  be  construed  as  an  official  Department  of  the 
Army  position,  policy,  or  decision,  unless  so  designated  by  other  documen¬ 
tation." 

•  U.S.  Environmental  Protection  Agency  through  Grant  N000I4-93-I-0343 

•  3  he  Office  of  Naval  Research  ( Physics  and  Chemistry),  through  Grant  N00014- 
93-1-0343 
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1993  John  Wiley  &  .Sons,  Inc.  CCC  0020-7608/93/01  OOix-02 
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“This  work  relates  to  Department  of  Navy  Grant  NOOO 14-93- 1-0343  issued 
by  the  Office  of  Naval  Research.  The  United  States  Government  has  a  royalty- 
free  license  throughout  the  world  in  all  copyrightable  material  contained  in 
the  publication.” 

•  U.S.  Department  of  Energy  through  Grant  DE-FG05-93ER6 1 5 

•  Digital  Equipment  Corporation 

•  IBM 

•  Kendall  Square 
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This  paper  constitutes  a  brief  personal  history  from  my  beginnings  as  a  physicist 
through  the  period  of  my  discc  very  and  consolidation  of  what  has  since  become 
the  self-consistent  field  theory  of  molecular  structure.  My  career  as  a  molecular 
theorist  began  only  a  few  short  years  after  the  most  traumatic  expenences  of  my 
life  took  place  during  World  War  11.  My  growth  as  a  scientist  did  not  stop  during 
those  difficult  years,  but  proceeded  as  if  driven  by  forces  independent  of  the  sur¬ 
rounding  events. 

It  is  only  dunng  the  last  decade  or  so  that  1  have  spoken  somewhat  more  freely 
about  the  painful  experiences  described  below.  However,  those  events  were  always 
vividly  present  in  my  mind.  I  hope  that  sharing  those  memories  with  my  friends 
and  colleagues  will  help  them  to  understand  me  better,  and  perhaps  also  the  human 
condition.  And  since  my  training  in  the  fundamentals  of  physics  occurred  during 
those  years,  my  account  may  also  offer  some  insight  into  how  learning  can,  and 
does,  occur  under  improbable  circumstances. 

But  first  1  like  to  turn  back  to  my  childhood,  when  I  experienced  the  earliest 
stirrings  of  scientific  curiosity.  I  am  certain  that  my  interest  in  science  began  when 
1  was  three  years  old.  I  did  not  understand  how  toasters,  lamps  and  irons  depended 
on  a  wire  coming  out  of  the  wall.  In  those  days,  wall  outlets  were  surface  mounted 
and  covered  by  porcelain  caps.  One  day  I  found  an  outlet  whose  cap  had  crumbled. 
I  began  to  put  my  finger  systematically  on  the  various  exposed  parts,  intent  on 
getting  some  response  from  this  mysterious  gadget.  I  finally  did,  in  the  form  of  an 
electric  shock  that  made  me  withdraw  my  finger  abruptly,  and  imbued  me  with 
great  respect  for  the  mysterious  force  1  had  just  experienced. 

This  was  one  experiment  that  1  reported  to  no  one;  I  was  brought  up  in  a  proper 
home  and  had  sense  enough  to  know  that,  whatever  I  had  done,  it  was  wrong.  The 
experience  increased  my  prudence  but  in  no  way  diminished  my  cunosity.  By  the 
age  of  seven,  1  was  conducting  less  dangerous  expenments  with  batteries,  lights, 
switches  and  electric  bells.  Radio  broadcasting  was  just  beginning  around  that  time. 
At  the  age  of  ten.  1  built  my  first  crystal  receiver  and,  throughout  my  teens,  1 
continued  experimenting  with  radio  reception  to  the  extent  that  my  meager  budget 
allowed. 

During  my  last  two  years  at  the  gymnasium  (high  school),  1  elected  the  math¬ 
ematical-scientific  curriculum  After  graduation,  I  enrolled  at  the  Technical  Uni¬ 
versity  of  Delft  in  the  Department  of  Electrical  Engineering. 
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Class  attendance  at  Delft  was  optional;  to  satisfy  requirements,  one  only  had  to 
pass  yearly  examinations.  During  my  first  two  years  there,  I  spent  my  afternoons 
in  the  shops  and  laboratories,  but  rarely  attended  lectures;  I  preferred  to  study 
textbooks  and  the  required  literature  on  my  own.  The  extensive  language  training 
mandated  in  Dutch  gymnasia  made  this  self-study  possible,  since  by  the  time  we 
graduated,  we  were  able  to  read  the  scientific  literature  in  French,  German  and 
English. 

During  my  third  year — it  was  now  1938-1939 — 1  became  dissatisfied  with  my 
academic  program.  The  electrical  engineering  curriculum  in  Delft  was  a  two-track 
program,  one  with  emphasis  on  electric  power  (transformers,  transmission  lines, 
electric  motors),  the  other  emphasizing  electronics  and  acoustics.  1  had  a  strong 
interest  in  the  latter,  but  could  not  avoid  a  number  of  mandatory  courses  in  the 
former,  which  interested  me  very  little.  I  had  heard  that  in  the  German  technical 
institutes,  it  was  possible  to  take  a  program  in  which  one  could  concentrate  fully 
on  electronics  and  acoustics.  To  explore  this  option,  I  spent  the  spring  of  1939 
auditing  classes  at  the  Technical  University  of  Karlsruhe. 

Dunng  the  summer  of  1939,  which  I  spent  at  my  parental  home  in  Nijmegen, 
all  hope  for  peace  in  Europe  evaporated.  Events  moved  swiftly.  In  July,  the  Hitler- 
Stalin  pact  was  signed.  By  mid-August,  it  was  clear  that  Poland  would  not  yield, 
that  Germany  would  attack  and  that  France  and  England  would  be  brought  into 
the  war.  On  August  29,  1 939,  the  Netherlands  government  declared  a  general  mo¬ 
bilization  in  order  to  defend  its  neutrality  as  it  had  successfully  done  during  World 
War  I. 

During  peacetime,  Dutch  university  students  were  always  permitted,  and  indeed 
encouraged,  to  complete  their  academic  studies  before  fulfilling  their  military  ob¬ 
ligation.  These  deferrals  were  now  revoked  and,  on  my  21st  birthday,  which  co¬ 
incided  with  the  date  of  the  mobilization,  I  was  inducted  into  the  Royal  Engineers 
Regiment  of  the  Netherlands  Army. 

The  officer  corps  of  that  regiment  consisted  almost  exclusively  of  graduates  of 
the  University  of  Delft.  The  army  now  faced  a  dilemma:  the  students  from  Delft 
who  had  been  inducted  should  be  given  officers  training,  but  a  prerequisite  for  the 
latter  was  prior  graduation  from  the  university. 

After  about  six  weeks,  during  which  we  were  billeted  in  Rotterdam  basically 
doing  nothin;;,  the  army  resolved  its  dilemma  by  sending  the  entire  student  con¬ 
tingent  of  the  Royal  Engineers  back  to  school  in  uniform.  We  were  billeted  in  one 
of  the  university  buildings  requisitioned  for  the  purpose.  After  an  hour  of  drill  in 
the  morning,  we  attended  classes  and  laboratories  as  normal  students  for  the  rest 
of  the  day. 

Having  concluded  earlier  that  year  that  I  did  not  want  to  continue  with  the 
electrical  engineering  program  at  Delft,  half  of  which  did  not  turn  me  on,  I  decided 
to  become  a  physicist.  So,  while  the  world  situation  around  us  became  more  ominous 
all  the  time,  I  eagerly  devoured  all  the  physics  I  could  absorb. 

The  Physics  Department  at  Delft  had  recently  acquired  R.  Kronig  as  its  resident 
theoretician.  Kronig’s  lectures,  together  with  lectures  by  H.  A.  Kramers,  who  came 
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in  once  a  week  from  neighboring  Leiden  for  that  purpose,  opened  up  the  world  of 
physics  for  me.  1  also  learned  differential  geometry  and  its  relation  to  relativity  and 
quantum  mechanics  from  J.  A.  Schouten,  who  had  pioneered  a  very  original  ao- 
proach  to  those  subjects. 

Our  lives  were  rudely  interrupted  in  the  early  morning  hours  of  May  10,  1940, 
when  the  German  attack  on  the  Netherlands  and  Belgium  was  launched — as  was 
Nazi  custom,  with  no  declaration  of  war.  The  only  declaration  of  war  by  Germany 
in  World  War  11  was  against  the  United  States  after  Pearl  Harbor,  direct  invasion 
not  being  an  option. 

Our  group  of  student  soldiers,  who  had  had  no  more  training  than  morning  drills 
for  about  six  months,  and  who  had  never  fired  a  gun,  was  now  deployed  around 
Delft  to  cope  with  German  paratroopers  and  glider  convoys  descending  on  an 
airfield  a  few  kilometers  out  of  town  close  to  The  Hague,  the  seat  of  the  government. 

I  was  assigned  to  guard  one  of  the  canal  bridges  on  the  outskirts  of  Delft.  Quaking 
in  my  boots  one  night,  I  fired  my  only  shot  of  the  war,  missing  a  moving  shadow 
that  later  turned  out  to  be  a  goat  grazing  in  a  nearby  meadow.  The  paratroopers 
were  contained  by  more  experienced  forces,  thereby  permitting  the  Dutch  govern¬ 
ment  to  escape  to  England. 

After  four  days,  The  Netherlands  was  crushed  by  the  overwhelming  military 
might  of  Germany.  !t  was  utterly  depressing.  Nevertheless,  despite  our  defeat,  we 
believed  in  an  ultimate  allied  victory,  and  continued  to  believe  so  even  after  the 
narrow  escape  of  the  British  Expeditionary  Forces  from  Dunkirk  and  the  subsequent 
fall  of  France. 

Initially,  the  German  occupation  of  the  Netherlands  was  relatively  benign.  Upon 
demobilization,  the  Dutch  army  was  allowed  to  return  to  civilian  life  rather  than 
being  interned  in  POW  camps.  Thus,  in  the  fall  of  1940, 1  returned  to  Delft  as  a 
civilian  graduate  student  in  physics. 

The  Physics  Department  had  recently  begun  to  use  teaching  assistants,  and  I  had 
the  good  fortune  to  be  selected  to  assist  in  an  advanced  laboratory  where  several 
classical  experiments  were  set  up  for  student  training.  Among  these  were  the  de¬ 
termination  of  e/m  by  subjecting  an  electron  beam  to  balancing  electrostatic  and 
magnetic  deflections,  and  the  famous  Milliken  oil  drop  experiment  for  the  deter¬ 
mination  of  e.  The  laboratory  was  directed  by  a  distinguished  lady.  Dr.  H.  van 
Leeuwen,  who  had  been  a  student  and  research  assistant  with  H.  A.  Lorentz.  In 
today’s  academic  environment.  Miss  van  Leeuwen  would  undoubtedly  occupy  a 
prestigious  chair. 

Gradually  the  German  occupiers  tightened  the  noose  and  began  to  interfere  in 
virtually  every  aspect  of  life.  The  food  stores  that  had  been  stockpiled  by  the  Dutch 
government  to  be  prepared  for  a  blockade  were  now  carted  off  to  Germany.  The 
first  measures  against  Jews  were  instituted,  beginning  with  registration,  followed 
by  restriction  of  movement  and  exclusion  from  gainful  employment.  In  the  spring 
of  1 94 1 ,  university  students  were  required  to  sign  a  loyalty  oath  stating  that  they 
would  not  undertake  any  acts  of  resistance  or  sabotage  against  the  occupier. 

We  students  considered  it  dishonorable  to  sign  such  a  declaration  and — perhaps 
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naively — decided  to  register  a  protest  by  boycotting  classes.  In  retaliation,  the  Ger¬ 
man  authorities  closed  down  the  universities  of  Leiden  and  Delft.  Leiden  remained 
closed  during  the  entire  war.  but  Delft  was  allowed  to  reopen  a  few  months  later. 

Once  again,  after  a  relatively  brief  interruption.  1  was  able  to  resume  my  studies 
and  to  continue  as  a  student  and  teaching  assistant  in  physics  for  about  two  more 
years.  Fortunately,  the  physics  faculty  remained  intact  and  library  resources  were 
more  than  adequate  despite  the  absence  of  new  acquisitions. 

By  late  1 942,  the  allies  were  gaining  strength  and  we  were  heartened  by  news  of 
major  German  defeats  in  North  Africa  and  Stalingrad.  However,  the  occupation 
Itself  was  becoming  increasingly  oppressive.  Dutch  resistance  gradually  stiffened, 
culminating  in  a  series  of  strikes  during  the  month  of  April.  1943. 

The  Germans  responded  by  declaring  martial  law.  Young  men  were  now  grabbed 
off  the  streets,  routinely  and  at  random,  to  be  carted  off  as  slave  labor  in  German 
factories.  Since  university  communities  would  be  pnmr  targets  for  this  practice.  I 
decided  to  clear  out  of  Delft  fora  while  and  he  low  at  my  family's  home  in  Nijmegen. 
For  the  same  reason,  my  youngest  brother  John  returned  from  Groningen  where 
he  was  a  medical  student;  Victor,  our  middle  brother,  was  still  living  in  Nijmegen 
where  he  was  studying  sociology. 

When  the  three  of  us  were  together,  Victor  informed  us  that  he  was  engaged  with 
a  group  of  friends  in  underground  activities.  Shortly  thereafter,  on  Apnl  30,  1943, 
he  received  a  phone  call  alerting  him  that  several  members  of  his  group  had  been 
arrested  by  the  security  police  (Sicherheitsdienst  or  SD). 

Victor  promptly  took  off  on  his  bicycle.  John  and  1  reasoned  that  the  SD  did 
not  have  anything  on  us;  foolishly,  we  decided  to  stay  put.  A  few  hours  later,  the 
SD,  reinforced  by  collaborationist  Dutch  Police,  showed  up.  Unimpressed  by  our 
declarations  of  non-mvolvement,  they  immediately  placed  us  under  arrest.  They 
never  found  Victor,  who  managed  to  remain  in  hiding  throughout  the  war. 

John  and  I  spent  the  night  in  the  police  lockup  in  Nijmegen.  At  dawn  we  were 
put  on  an  army  truck  to  take  us  to  a  military  court  in  Hengelo.  about  90  km  away. 
On  the  truck  we  met  four  of  Victor’s  co-conspirators — one  laborer.  Mannus,  and 
three  university  students.  Later  that  afternoon,  the  three  students  were  sentenced 
to  the  firing  squad  (they  were  shot  the  next  morning),  and  Marinus  was  given  a 
long  term  in  concentration  camp,  while  John  and  1  were  relegated  to  the  police 
lockup  in  Hengelo. 

We  stayed  in  that  police  lockup  for  three  and  a  half  months.  We  managed  to  get 
books  sent  to  us  from  our  respective  university  libraries,  and  tried  to  keep  our 
minds  focused  on  our  studies.  I  remember  struggling  through  Hermann  Weyl's 
“Raum,  Zeit  und  Materie”. 

We  became  relatively  permanent  fixtures  in  the  lockup;  most  “guests”,  comprising 
common  criminals,  black  marketeers,  political  dissenters,  and  Jews  who  had  been 
flushed  out  of  hiding,  stayed  only  a  few  days.  I  remember  seven  Jews  who  came  in 
dunng  that  period..  Three  managed  to  commit  suicide  in  their  cells;  John  and  1 
carried  out  their  bodies. 
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On  August  13,  1943.  John  and  i  were  suddenly  removed  from  the  lockup  and 
transferred  to  the  concentration  camp  at  Vught  (between  ’sHertogenbosch  and 
Eindhoven).  We  were  never  convicted  of  anything — this  was  simply  “Schutzhaft” 
(protective  custody).  We  were  to  stay  there  until  September  5,  1944. 

In  Vught,  we  found  new  opportunities  to  keep  our  academic  interests  alive.  John, 
as  a  medical  student,  managed  to  capture  the  much-coveted  position  of  hospital 
orderly.  Miraculously,  I  got  back  into  physics. 

Vught  is  not  far  from  Eindhoven,  which  was  then,  as  it  is  now,  the  company 
town  of  the  electronics  giant  N.  V.  Philips.  The  top  management,  including  the 
son-in-law  of  the  company  founder,  had  been  evacuated  by  the  Dutch  army  to 
England,  where  they  could  continue  to  direct  Philips’  worldwide  operations  for  the 
benefit  of  the  allied  cause.  However,  other  managers — including  the  founder’s  son — 
had  been  left  behind  in  Eindhoven.  Now  they  were  forced  by  the  Germans  to 
produce  electronic  components  and  devices  for  the  German  war  effort.  At  the  same 
time,  the  ranks  of  Philips  employees  were  repeatedly  subjected  to  arbitrary  drafts 
of  personnel  to  serve  as  slave  labor  in  German  factories. 

The  executives  of  Philips  then  suggested  to  their  German  overlords  that  they  be 
permitted  to  tap  the  pool  of  labor  in  the  nearby  camp  of  Vught.  This  proposal  was 
received  favorably,  and  thus  Philips  was  encouraged  to  set  up  manufacturing  facilities 
inside  the  camp.  Working  in  the  “Philips  Commando”  was  considered  a  great 
privilege. 

Although  the  Philips  Commando  employed  only  a  small  fraction  of  the  camp 
population,  its  mere  presence  improved  the  ambiance  of  the  camp  significantly. 
Each  morning  a  group  of  Philips  managers  came  into  the  camp  to  direct  the  man¬ 
ufacturing  operations,  namely  of  electric  razors  and  short-wave  radios.  The  radios 
also  had  to  be  tested  at  the  end  of  the  assembly  line,  providing  us  with  an  excellent 
source  of  “forbidden”  news  on  a  daily  basis. 

For  the  academically  inclined,  the  Philips  management  created  the  “Computation 
Chamber”  in  which  about  two  dozen  professors,  students,  and  scientists  from  in¬ 
dustry  got  various  research  assignments.  1  was  invited  to  join  that  group. 

My  first  assignment,  as  a  member  of  the  Chamber,  was  to  carry  out  a  Kronig- 
Penney  calculation  of  the  Schroedinger  equation  for  a  one-dimensional  finite  crystal. 
This  constituted  in  a  sense  my  first  experience  as  a  practising  quantum  theoretician. 
When  that  was  finished,  complete  with  beautifully  drawn  graphs  of  wavefunctions 
and  charge  densities.  I  was  given  a  much  more  ambitious  project,  consisting  of  a 
calculation  of  elastic  constants  in  a  classical  crystal,  where  the  interaction  between 
lattice  sites  consisted  of  central  as  well  as  non-central  forces. 

This  problem  had  been  formulated  by  Dr.  Druivesteyn  of  Philips,  who  had  con¬ 
sulted  with  my  professors  in  Delft  to  give  me  a  project  that  would  further  my 
academic  program.  To  learn  the  field,  I  struggled  through  Max  Born’s  monograph 
“Dynamik  der  Kristallgitter”.  My  assignment  was  completed,  and  a  final  report 
transmitted  to  the  Philips  Management  in  Eindhoven,  a  few  days  before  the  camp 
was  evacuated  on  September  5.  1944.  After  the  war,  this  work  was  accepted  in 
Delft  as  my  master’s  thesis. 
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The  evacuation  of  camp  Vught  conformed  to  standard  Nazi  practice  that  no 
concentration  camp  inmates  should  ever  be  liberated.  The  allied  armies  were  poised 
for  a  push  northward  just  a  few  dozen  kilometers  to  the  south  of  us.  Unfortunately 
for  us.  that  push  occurred  1 2  days  later. 

On  the  day  of  the  evacuation,  about  500  prisoners  who  had  been  in  solitary 
confinement  were  executed.  The  remaining  few  thousand  “less  dangerous”  inmates 
were  crammed  into  railroad  cattle  cars,  and  began  a  three-day  journey  which  ended 
on  September  8.  1 944  at  the  camp  “Sachsenhausen”.  also  called  “Oranienburg” 
after  the  nearby  Berlin  suburb  of  that  name. 

The  world  of  concentration  camps  reminds  me  of  Dante’s  description  of  the 
seven  levels  of  Hell.  At  the  deepest  level  were  the  extermination  camps  with  gas 
chambers,  and  also  some  camps  nicknamed  “Nacht  und  Nebel”  (Night  and  Fog) 
from  which  virtually  no-one  came  out  alive.  Sachsenhausen — like  Dachau  and 
Buchenwald — was  somewhere  in  the  middle;  I  estimate  the  survival  rate  at  15- 
20%.  Vught  had  been  comparatively  benign,  due  in  no  small  part  to  the  presence 
of  the  Philips  civilians. 

In  Sachsenhausen  1  had  to  suspend  my  pursuit  of  physics  and  focus  every  ounce 
of  energy  on  survival.  Bruno  Bettelheim  attributed  his  own  survival  to  his  will  to 
live  and  to  his  understanding  of  the  sociology  of  the  camp  structure.  1  concur  that 
these  were  necessary  prerequisites,  but  they  were  by  no  means  sufficient;  in  order 
to  survive  one  also  needed  a  good  deal  of  luck. 

I  judged  that  being  a  university  student  would  be  of  little  value  in  landing  a  good 
job.  Thus,  at  intake  registration  1  declared  that  1  was  an  electrician.  My  brother 
John  professed  to  be  a  medical  orderly.  After  a  short  while.  1  landed  a  good  job 
witha  “Baukommando”  (construction  commando),  where  1  started  as  a  draftsman 
and  later  served  as  an  electrician.  Unfortunately.  John  ended  up  in  menial  work. 

The  winter  of  1944-1945  was  harsh.  Every  morning  at  4:00  a.m.,  we  had  to 
brave  the  elements  standing  at  attention  for  about  one  and  a  half  hours  at  “Appell” 
(roll  call),  weanng  no  more  than  flimsy  indoor  clothing.  After  roll  call,  the  prisoners 
regrouped  to  march  off  to  arrive  at  6:00  a.m.  at  their  regular  work  assignments. 

By  the  end  of  January  1945  the  Russian  army  was  on  German  soil,  less  than  60 
km  from  Berlin  and  our  camp.  The  German  army  dug  in  for  a  tenacious  defense 
of  their  capital  and,  on  February  1 ,  1 945,  camp  officials  began  a  gradual  evacuation 
of  Sachsenhausen.  moving  inmates  to  the  north  and  west  in  order  to  prevent  lib¬ 
eration  of  the  prisoners  by  the  advancing  Russians.  My  brother  John  went  with 
one  of  these  transports  to  the  infamous  camp  Bergen-Belsen.  where  he  perished 
from  deprivation  and  typhoid  in  late  March. 

I  stayed  in  Sachsenhausen  until  the  final  evacuation  on  April  20  when,  guarded 
by  SS  troops,  with  the  Russian  army  at  our  heels,  we  began  a  forced  march  which 
lasted  12  days  and  covered  160  km.  The  SS  had  made  no  provision  for  feeding  or 
billeting  the  tens  of  thousands  of  undernourished  miserable  victims  whom  they 
were  now  driving  like  a  herd  of  cattle  to  nowhere,  either  to  die  along  the  road  or, 
probably,  be  drowned  in  the  Baltic  Sea.  .\nyone  whose  strength  gave  out  and  sat 
down  was  promptly  shot.  I  have  made  a  rough  guess  that  at  least  one-third  of  us 
perished.  My  own  life  was  saved  in  large  measure  by  a  fellow  prisoner  who.  as  a 
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long-time  inmate  of  Sachsenhausen,  had  been  better  connected  than  1  and  had  thus 
been  able  to  acquire  a  stockpile  of  food  which  he  generously  shared  with  me. 

After  about  one  week  of  this  life-and-death  struggle,  small  rays  of  hope  began  to 
break  through.  Swiss  staffers  from  an  International  Red  Cross  depot  in  the  Baltic 
port  of  Liibeck  courageously  took  matters  in  their  own  hands.  They  loaded  their 
trucks  with  food  parcels  which  had  been  destined  for  POW  camps,  and  supplied 
any  prisoners  they  could  find  on  the  roads  with  their  precious  life-saving  gifts. 

The  Red  Cross  representatives  also  convinced  our  SS  guards — who  had  by  now 
been  abandoned  by  their  officers,  and  were  quite  demoralized — to  let  collapsing 
prisoners  be  taken  on  the  Red  Cross  trucks.  Fearful  of  retribution  by  the  prisoners, 
who  at  this  point  outnumbered  the  guards  about  fifty  to  one,  our  remaining  guards 
gradually  disappeared  from  sight.  Many  were  Eastern  Europeans  who  had  been 
forced  into  service  and  who  were  eager  to  return  to  their  homelands.  As  we  now 
continued  on  our  own,  we  were  joined  by  a  new  group:  German  soldiers  desperately 
rushing  to  the  West  to  avoid  capture  by  the  Russians.  Our  march  ended  about 
halfivay  between  Berlin  and  Denmark  in  Schwerin,  the  regional  capital  of  Meck¬ 
lenburg,  where  we  met  American  troops.  It  was  May  2,  1945. 

I  was  repatriated  to  the  Netherlands  in  late  May — I  do  not  remember  the  exact 
date.  After  resting  up  for  a  few  weeks,  I  returned  to  Germany  to  see  if  I  could  find 
some  trace  of  my  brother  John.  After  amval  in  the  Bergen-Belsen  area,  it  became 
obvious  to  me  that  he  could  not  possibly  have  survived.  Later  investigation  turned 
up  testimony  from  a  few  survivors  that  he  had  indeed  succumbed  about  five  weeks 
before  war’s  end. 

I  returned  to  Delft  about  September  1,  1945.  and  was  officially  granted  my  di¬ 
ploma  of  “Ingenieur”,  roughly  equivalent  to  a  Master’s  degree,  on  October  15.  I 
had  by  then  applied  for,  and  obtained,  a  postgraduate  fellowship  at  the  University 
of  Chicago.  I  arrived  in  Chicago  on  January  6,  1 946. 

Chicago  was  then  the  most  exciting  place  to  be  for  a  young  physicist.  It  boasted 
the  famous  site  of  the  first  self-sustaining  nuclear  chain  reaction;  among  the  physics 
faculty  were  Enrico  Fermi,  Edward  Teller,  Maria  Goeppert-Mayer,  and  Robert 
Mulliken.  Nuclear  research  was  of  course  de  rigueur.  However,  since  nuclear  research 
was  then  still  cloaked  in  heavy  secrecy,  I  concluded  that  as  an  alien  I  would  be 
better  off  in  a  program  that  was  not  encumbered  by  government-imposed  restric¬ 
tions.  I  applied  to  and  was  accepted  by  Robert  Mulliken  as  a  research  student  to 
pursue  the  study  of  molecular  structure  and  spectra. 

My  studies  at  Delft  had  given  me  an  excellent  foundation  in  mechanics,  electricity 
and  magnetism,  thermodynamics  and  statistical  mechanics,  and  of  course  advanced 
calculus  and  other  subjects  in  mathematics.  However,  I  had  not  learned  much 
about  nuclear  physics,  quantum  mechanics  and,  in  modem  mathematics,  linear 
algebra  and  group  theory.  I  spent  the  period  between  January  1946  and  June  1947 
filling  in  some  of  these  gaps,  as  well  as  carrying  out  semiempirical  molecular  orbital 
calculations  on  several  simple  organic  molecules  with  “alphas  and  betas’’. 

I  passed  the  Ph.D.  candidacy  examination  in  physics  in  June  1 947.  Shortly  there¬ 
after  Professor  Mulliken  suggested  semiempirical  MO  calculations  on  substituted 
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benzenes  as  my  thesis  subject.  Through  the  good  offices  of  Maria  Mayer,  1  also 
received  at  that  time  an  offer  from  Professor  Karl  Herzfeld  of  the  Catholic  University 
of  America  in  Washington,  DC,  to  join  the  physics  faculty  as  an  instructor.  I  accepted 
that  offer  and  spent  the  next  two  years  teaching  in  Washington  while  working  on 
my  thesis,  making  frequent  trips  to  Chicago  to  confer  with  Mulliken. 

I  began  with  summarizing  the  theoretical  foundations  of  the  calculations  1  was 
to  perform.  Mulliken  expressed  his  approval  and  praise  for  the  first  account  he 
received.  However.  I  was  not  satisfied  with  the  current  state  of  the  art.  All  MO 
calculations  started  with  a  “one-electron  Hamiltonian”,  but  1  could  not  find  a 
definition  of  the  latter  anywhere  in  the  literature. 

I  realized  that  a  similar  problem  had  been  faced  for  atoms  by  Hartree  and  Fock 
[1].  At  least  for  closed-shell  atoms,  there  was  a  clear  definition  of  the  one-electron 
Hartree-Fock  Hamiltonian;  it  emerged  in  the  process  of  applying  the  A-electron 
variation  principle  to  a  Slater  determinant  of  orbitals  which  had  to  be  optimized. 
Furthermore,  F.  Seitz  and  others  had  extended  the  model  to  deal  with  solids,  albeit 
again  limited  to  a  closed-shell  system  [2].  Clearly,  the  mode!  was  also  applicable 
to  molecules,  most  of  which  are.  conveniently,  closed-shell  systems.  The  difficulty 
with  standard  MO  theory  was  that  a  mystical  one-electron  Hamiltonian  was  adopted 
before  anything  at  all  could  be  written  down. 

The  solution  to  the  puzzle  came  to  me  in  June  1948. 1  had  returned  from  Wash¬ 
ington  to  Chicago  for  the  surr.mer  to  work  intensively  on  my  thesis.  I  went  with  a 
group  of  friends,  including  my  future  wife,  to  a  concert  by  the  Chicago  Symphony 
Orchestra  at  its  open-air  summer  home  in  Ravinia  Park,  north  of  Chicago.  Lying 
in  the  grass,  listening  to  the  music  and  looking  at  the  stars,  it  suddenly  flashed 
through  my  head  that  standard  MO  theory  did  not  start  with  the  right  question. 
Just  like  Hartree,  Fock  and  Seitz  had  applied  the  variation  principle  to  an  N-  lectron 
Slater  determinant  of  yet  to  be  determined  orbitals,  one  should  start  the  same  way 
with  molecules,  with  the  additional  constraint  that  the  MOs  are  constructed  as 
linear  combinations  of  atomic  orbitals  (LCAO).  Nowadays  we  prefer  to  say  that 
the  MOs  are  constructed  from  a  predetermined  basis  set. 

The  LCAO  approximation  for  the  MOs  was  of  course  not  new;  it  had  been  used 
for  years  by  Mulliken,  Lennard-Jones  and  others  in  semi-empirical  calculations. 
However,  entering  this  approximation  formally  into  the  A-electron  variation  prin¬ 
ciple  provided  a  new  and  much  better  mathematical  point  of  departure  for  electronic 
structure  calculations  on  molecules. 

When  1  communicated  my  insight  to  Mulliken,  he  fully  understood  its  signifi¬ 
cance.  It  is  a  testimony  to  his  stature  as  a  scientist  and  human  being  that  he  whole¬ 
heartedly  supported  that  1  would  now  dedicate  myself  primarily  to  working  out  the 
details  of  what  1  then  called  the  LCAO-SCF  theory,  rather  than  the  thesis  topic 
originally  assigned. 

Having  returned  to  Washington  in  the  fall  of  1948, 1  continued  working  out  the 
details  of  the  LCAO-SCF  theory  during  the  following  Academic  year.  The  first 
three  chapters  were  completed  in  late  1948,  consisting,  respectively,  of  an  intro¬ 
duction,  a  derivation  of  the  closed-shell  molecular  Hartree-Fock-SCF  equations. 
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and  the  analogous  derivation  of  the  LCAO-SCF  equations  [3].  Publication  in  a 
journal  had  to  be  postponed  until  after  the  Ph.D.  degree  was  granted  and  the  Ph.D. 
committee  had  approved  the  final  version  of  my  dissertation. 

In  June  1949  1  returned  to  Chicago,  this  time  as  a  research  associate,  for  a  full 
year.  That  period  in  Chicago  proved  even  more  stimulating  and  exciting  than  my 
earlier  stays  in  Mulliken’s  laboratory.  I  had  the  privilege  of  interacting  with  many 
young  physicists  and  theoretical  chemists  who  came  to  quench  their  thirst  at  this 
watering  hole.  1  recall  many  stimulating  discussions  with  Jake  Bigeleisen,  Mike 
Kasha,  Christopher  Longuett-Higgins,  A1  Matsen,  Harden  McConnell,  Bob  Parr, 
and  Bill  Price.  I  thank  them  for  the  many  good  conversations  we  had. 

During  that  year,  the  remaining  four  chapters  of  my  thesis  were  completed  [4] . 
These  four  chapters  covered,  respectively,  ionization  and  excitation  processes,  the 
use  of  symmetry  in  the  Hartree-Fock-SCF  model,  the  use  of  symmetry  in  the 
LCAO-SCF  model,  and  an  LCAO-SCF  calculation  of  the  benzene  molecule.  The 
thesis  was  accepted  and  approved  by  the  committee  in  June  1950. 

The  last  chapter  of  my  thesis,  the  LCAO-SCF  calculation  of  benzene,  constituted 
only  a  crude  and  approximate  application  of  the  new  theoretical  framework.  This 
was,  however,  all  that  could  be  done  within  a  reasonable  time  frame,  due  to  the 
unavailability  of  adequate  computational  tools  in  those  days.  The  inclusion  of  this 
last  chapter  also  put  to  rest  the  reservations  as  expressed  by  one  of  the  committee 
members,  “never  mind  all  those  beautiful  formalities — what  can  you  really  cal¬ 
culate?”  In  the  published  version  of  my  thesis  [  5  ]  I  decided  to  omit  this  last  chapter, 
since  1  was  convinced  that  much  better  and  more  interesting  and  convincing  cal¬ 
culations  would  be  carried  out  within  a  few  years. 

While  our  work  in  Chicago  was  available  prior  to  official  publication  to  our 
colleagues  worldwide  through  our  Technical  Reports  [5a],  we  were  not  always 
informed  about  developments  in  other  laboratories  as  they  were  occurring.  Thus, 
while  I  was  working  out  the  LCAO-SCF  theory,  I  was  not  aware  of  the  parallel 
development  in  Cambridge,  England,  by  Lennard-Jones  and  Hall  [6-9].  A  com¬ 
parison  between  these  two  separate  developments  seems  in  order. 

The  two  developments  show  a  great  deal  of  similarity,  both  in  subject  matter 
and  in  methodology.  This  is  evidenced  in  particular  by  the  adoption  of  a  single 
Slater  determinant  to  describe  the  ground  state  of  a  closed-shell  molecule;  the  trans- 
formability  of  the  one-electron  functions;  the  application  of  the  A-electron  variation 
principle  to  derive  the  molecular  Hartree-Fock-SCF  equations  as  well  as  the  LCAO- 
SCF  equations;  the  exploitation  of  symmetry;  and  the  treatment  of  ionization  and 
excitation  processes. 

The  most  significant  difference  between  the  two  developments  is  in  the  inter¬ 
pretation  and  exploitation  of  the  LCAO-SCF  equations.  Hall  observed  that  these 
equations  simplify  considerably  if  M,  the  number  of  atomic  orbitals,  equals  W,  the 
number  of  occupied  molecular  orbitals  [9] .  Restricting  himself  to  that  special  case, 
A/  =  A.  he  then  derived  useful  relations  between  ionization  potentials  of  related 
chemical  compounds.  On  the  other  hand,  dealing  with  the  general  case  where 
M>  N,  I  noted  that  these  equations  exhibit  a  striking  resemblance  to  the  Hartree- 
Fock  equations.  I  proposed  that  these  equations  be  adopted  to  calculate  wavefunc- 
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lions  for  molecules  in  actual  practice,  using  an  iterative  procedure,  henceforth  to 
be  called  the  “LCAO-SCF  method”  [3]. 

After  obtaining  my  degree,  I  was  invited  to  join  the  faculty  of  the  Physics  De¬ 
partment  of  the  University  of  Chicago,  and  I  accepted.  With  strong  support  from 
Robert  Mulliken  and  his  junior  colleague  John  R.  Platt,  I  now  embarked  on  a 
broad-based  program,  using  my  thesis  as  a  blueprint,  to  develop  the  tools  necessary 
for  calculating  all-electron  wavefunctions  of  atoms  and  molecules  routinely  and 
accurately.  Henceforth,  the  work  of  Mulliken.  Platt  and  myself  would  be  con¬ 
ducted  under  the  auspices  of  the  “Laboratory  for  Molecular  Structure  and 
Spectra”  (LMSS). 

My  first  major  task  was  lO  systematize  the  calculation  of  one-  and  two-electron 
integrals  over  the  basis  functions  from  which  the  MOs  are  constructed.  This  work 
got  a  significant  boost  when  I  was  joined  by  Klaus  Riidenberg,  who  had  come  to 
Chicago  from  Zurich  with  his  mentor.  Professor  Gregor  Wentzel,  who  had  joined 
our  physics  faculty  in  1948. 

A  comprehensive  integrals  project  which  would  cover  polyatomic  molecules  made 
up  of  any  atoms  in  the  periodic  table  was  clearly  too  ambitious.  Thus  we  settled 
for  diatomic  molecules  consisting  of  atoms  in  the  first  two  rows  of  the  periodic 
table.  We  furthermore  adopted  the  Slater-type  basis  functions  as  the  most  suitable 
building  blocks  for  the  construction  of  MOs.  Klaus  tackled  the  mathematically 
most  demanding  class  of  these  integrals,  namely  the  two-center  two-electron  ex¬ 
change  integrals;  I  dedicated  myself  to  the  remaining  types. 

A  second  major  task  was  to  prepare  ourselves  to  use  digital  computers,  first  for 
calculating  the  integrals  we  were  working  on,  and  later  for  developing  programs 
implementing  other  mathematical  procedures  which  crop  up  in  calculations  on 
atoms  and  molecules.  This  was  no  small  undertaking,  since  the  first  computers 
available  to  us  had  no  floating  point  hardware,  while  the  systems  software  consisted 
of  binary  input  and  output  routines  only;  compilers,  and  even  assemblers,  were 
still  in  the  future. 

Many  of  the  tools  needed  by  LMSS  in  this  broad-based  program  were  contributed 
by  energetic  dedicated  research  associates  and  graduate  students.  Prominent  among 
the  former  were  Reinhart  Ahlrichs,  Paul  Cade,  Enrico  dementi,  Serafin  Fraga, 
Juergen  Hinze,  Sigeru  Huzinaga,  Wlodzimierz  Kolos,  Sigrid  Peyerimhoff,  Bernie 
Ransil,  Robert  Sack  and  Lutek  Wolniewicz;  and  among  the  latter,  Paul  Bagus, 
Howard  Cohen,  John  Detrich,  Winnifred  Huo,  Yong-ki  Kim,  Gulzari  Malli,  Doug 
McLean,  Charles  Scherr,  Chris  Wahl,  Andy  Weiss,  Bill  Worley  and  Megumu  Yo- 
shimine. 

While  the  first  LCAO-SCF  model  could  deal  satisfactorily  with  closed-shell  sys¬ 
tems,  an  analagous  satisfactory  treatment  of  open-shell  systems  remained  elusive 
for  quite  a  while.  In  fact,  this  problem  was  endemic  in  virtually  all  SCF  formulations, 
and  had  been  side-stepped  by  Hartree  for  atoms  by  introducing  somewhat  artificial 
approximations  in  implementing  the  A-electron  variation  principle  [1]. 

The  solution  to  this  puzzle,  which  1  discovered  in  1958,  was  another  instance 
where  one  had  to  ask  the  right  question:  the  variation  principle  had  to  be  modified 
so  that  all  degenerate  A^-electron  wavefunctions,  which  jointly  represent  the  open 
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shell,  participate  equally  in  the  variational  process  [10].  While  this  extension  of 
LCAO-SCF  theory  did  not  lead  to  a  simple  scheme  for  all  conceivable  cases,  nev¬ 
ertheless  a  large  class  of  open-shell  problems  now  became  easily  accessible. 

In  1962  1  became  director  of  the  University  of  Chicago  Computation  Center;  I 
held  that  position  until  1968.  Since  then  I  have  spent  considerable  time  and  effort 
on  the  design  and  development  of  computer  hardware  and  software  for  large-scale 
scientific  calculations.  However,  I  never  lost  interest  in  physics  and  quantum  chem¬ 
istry.  During  the  mid-1970s  1  worked  intensively,  in  collaboration  with  my  former 
student  John  Detrich,  on  relativistic  effects  in  atoms  and  molecules  [11,12],  and 
from  about  1980,  again  with  John  Detrich,  on  the  MCSCF  (multi-configuration 
self-consistent  field)  model  of  electronic  structure  [13,14].  I  expect  to  continue 
alternating  between  physics,  quantum  chemistry,  and  computer  development  for 
the  foreseeable  future. 

In  conclusion,  I  want  to  thank  Professor  Orville-Thomas  for  his  kindness  and 
patience,  and  for  inviting  me  to  write  this  account.  1  also  thank  all  my  friends  and 
colleagues  wh''  have  contributed  to  this  issue  of  “Theochem”. 
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Abstract 


Compact  expressions  are  presented  for  the  3n- j  symbols,  where  I  <  n  <  4,  which  feature  sums  over 
products  of  binomial  coefficients,  and  certain  integer  tnanguiar  coefficients.  Tbe  triangular  coefficients 
in  turn  can  be  expressed  as  products  of  binomial  coefficients  Thus  in  Ibe  formulas  presented  for  the 
3n-j  symbols,  the  dependence  on  numerous  factorials,  formally  as  well  as  computationally,  has  been 
completely  eliminated.  While  formulas  which  incorporate  summations  over  products  of  binomial  coef¬ 
ficients  have  been  known  for  the  3- j  and  6-  j  symbols,  the  introduction  of  the  tnanguiar  coefficients,  and 
the  application  of  the  binomial/tnangular  scheme  to  3n- j  symbols  with  n  >  2.  provide  important  new 
results  The  new  formulas  are  simpler,  and  they  permit  more  efficient  computations  of  the  3n  -j  symbols, 
both  in  exact  and  in  floating  point  format,  than  most  schemes  which  are  currently  in  use.  g.  1993  John 
Wiley  &  Sons.  Inc 


Introduction 

The  algebra  of  angular  momenta  occupies  a  central  position  in  the  quantum 
mechanical  description  of  numerous  physical  systems  and/or  phenomena.  The 
structure  of  atoms  and  nuclei,  the  rotational  spectra  of  molecules,  and  many  scat¬ 
tering  processes  provide  typical  examples.  In  practical  calculations  one  always  en¬ 
counters  the  so-called  3n-j  symbols,  where  n  =  1.  2,  ... ;  they  arise  from  the 
coupling  and  recoupling  of  angular  momenta,  and  from  the  evaluation  of  matrix 
elements  of  irreducible  tensors.  For  the  details  of  the  definitions  and  applications 
we  refer  to  the  standard  literature  [1-15]. 

Racah's  Formulas  for  3-j  and  6-y  Symbols 

In  explicit  calculations  of  the  3-  j  and  6-  j  symbols  the  most  commonly  used 
formulas  are  slight  modifications  of  the  famous  formulas  of  Racah  [3].  We  adhere 
to  prevailing  practice,  while  introducing  slight  changes  in  notation  to  suit  our  pur¬ 
poses;  specifically  we  define  the  3-  j  and  6-  j  symbols  by 
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The  sums  in  Eqs,  ( I,  2)  arc  limited  by  the  requirement  that  the  arguments  of  the 
factorials  must  be  nonnegative.  It  should  be  noted  that  the  essence  of  Eq.  ( I )  was 
first  given  by  van  dcr  Waerden  [I],  a  fact  which  has  often  been  overlooked. 

Note  that  a,  h,  c,  d,  e,  f  are  y-quantum  numbers:  they  must  be  nonnegative 
integer  or  half  integer.  The  /-triples  (a,b,c),  (a,e,f),  id,b,/),  id,e,c)  must  satisfy 
the  triangular  condition,  e.g.,  {a,b,c)  must  be  capable  of  forming  a  vector  coupling 
triangle  of  integer  circumference.  Due  to  these  conditions,  the  four  quantities  a  + 
h  +  c,h-^c-a,c-¥a-b,a  +  b  —  ezre  all  nonnegative  integers. 

In  a  similar  vein  a,  ft,  y  are  w-quantum  numbers;  the  m-triple  (a,  ft,  y)  is 
constrained  by  a  +  /3  +  7  =  0,  as  evident  from  Eq.  ( 1 ).  Also  the ym-pairs  {a, a), 
{h,ft),  (c,y)  must  satisfy  the  line-segment  condition,  e.g.,  -a  a^a,  integer  steps 
being  implied;  hcncc  a  ±  a,  b  ±  ft,  c  ±  y  are  all  nonnegative  integers. 

From  Eqs,  ( I  -3 )  it  is  clear  that  a  3-  j  or  6- j  symbol  is  in  general  the  positive  or 
negative  square  root  of  a  positive  rational  fraction,  or  zero.  Such  numbers  have 
been  called  root-rational  fractions,  or  RRFs,  by  Stone  and  Wood  [16];  we  suggest 
that  they  be  called  semi-rational  numbers,  a  term  we  shall  adhere  to  in  this  article. 


Binomial  and  Triangular  Coefficients 


The  binomial  coefficients  are  the  expansion  coefficients  of  (x  +  for  non¬ 
negative  integer  values  of  a,  which  covers  our  present  needs,  they  are  given  by 


_  a\ 

h)  bHa-b)r 


0  £  b  ^  a 


(4) 
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Obviously  b  is  also  a  nonnegative  integer,  and  the  binomial  coefficients  are  positive 
integers. 

In  his  investigations  of  internal  symmetries  of  the  3- j  symbol.  Regge  [  10]  dis¬ 
covered  the  importance  of  the  square  form 

(5) 

which  we  shall  call  the  Regge  square.  The  sum  of  the  elements  of  each  row  or 
column  of  that  square  is  equal  to  the  circumference  of  the  triangle,  a  +  6  +  c. 

Following  in  Regge’s  footsteps,  we  define  six  positive  integer  quantities,  to  be 
called  triangular  coejficienls,  or  shortly  T-coeJfiaenls.  with  respect  to  the  rows  and 
the  columns  of  the  Regge  square  (5).  namely 


T  . 

(a  -h  b  +  c  +  1 )! 

‘  abi 

~(b  + 

c  —  a)l(c  +  a  -  by.(a  +  b  -  c)l 

.. 

(a  +  b  y  c  +  1 )! 

aht,  ad") 

(a  - 

'  + 

(a  +  b  +  c  +  1 )! 

abi,  oii") 

(a  + 

a)l(b  +  0)\(c  +  y)\ 

J'l 

(a  +  b  +  c  +  l)l 

*  ubi.\  a 

■  (b-:- 

c  -  a)\(a  -  «)!(«  +  a)! 

T“. 

(a  +  b  +  c  +  1 )! 

*  ahi,  d 

(c  + 

a  -  by.ib  -  fiy.ib  +  fi)\ 

(a  +  b  +  c  +  1 )! 

‘  <//»<,  ■) 

(a  + 

b  -  c)!(c  -  7)!(c’  +  7)! 

(6) 


To  highlight  the  relation  between  the  T-coefficients  and  the  Regge  square  we 
introduce  the  T-coefficient  array 


T  ’ 

•  aht,  0 

T ah. 

7^3 

*  .7/h.  T 

(b  +  c  -  a)] 

(f  +  0  -  by. 

{a  +  b  -  c)! 

T'  ahi..  niit 

(a  -  ay. 

(b-ay. 

(c  -  7)! 

7-4 

‘  afn,  orf-) 

(a  +  «)! 

(/’  + 18)! 

(c’  +  7)! 

The  triangular  coefficient  Tah,  is  the  only  one  which  depends  exclusively  on  the 
y-quantum  numbers  a,  b.  c.  and  will  be  called  the  principal  triangular  coefficient. 
the  other  five  will  be  called  a.ssociated  triangular  coefficients  It  will  become  clear 
in  the  following  that  the  associated  T-coefficients  play  a  role  only  in  the  3-7  coef¬ 
ficients.  while  principal  r-coefficients  play  a  role  in  all  in-j  coefficients. 

Comparison  of  Eq.  (3)  with  the  first  Eq.  (61  yields 


I 


(8) 
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Note  also  that  the  triple  product  of  the  r-coefficients  from  the  roH’5  of  the  Regge 
square  ( 5 )  is  equal  to  the  corresponding  product  from  the  columns,  as  expressed 
by 


•*  ahc^  ahc,  a^y  *  ahc,  a^y  •*  ahc^  a  ■*  ahc,  /3  ■*  . 


aiK',  y 


(9) 


The  triangular  coefficients  can  easily  be  expressed  in  terms  of  binomial  coefficients. 
Using  Eqs.  (4,  6)  we  find 


■y  +  1) 

(c  +  'y+  1) 


(a  +  b  +  c+  1\/  2a+  1  \  ,  I 

'■*■(  b+c-a  I 

(a  +  b  +  c+  l\/b  +  c  +  a+  \\ 

)(  )(^- 

,  /a  +  b  +  c+  l\/b  +  c-  a+  \\ 

-(  j(  J(c  + 

,  (a  +  b  + c+ \\/2a+ l\ 
rV<.=  (fl  +  a+1) 

\  b  +  c  -  a  ]\a  -  a  ) 

,  (a-^b  +  c+  l\/26+  1\  , 

,  /a  +  ft  +  c+  l\  /2c+l\ 


(10) 


It  will  be  shown  in  the  following  sections  that  in  general  a  3n- j  symbol  can  be 
factorized  into  a  product  QR,  where  R  is  an  integer,  which  can  be  positive,  negative, 
or  zero,  while  the  normalization  factor  Q  is  the  reciprocal  square  root  of  a  product 
of  triangular  coefficients. 

For  the  3-  j  and  6-  j  symbols,  the  integer  R  can  be  factorized  into  a  product  5r, 
where  T  is  a  triangular  coefficient,  and  S  is  a  sum  over  products  of  binomial  coef¬ 
ficients. 

For  the  9-j  and  12-7  symbols,  the  integer  R  is  a  sum  over  products  ofS-sums 
from  6-  j  symbols,  multiplied  by  2z  +  1 ,  where  r  is  the  summation  variable. 

We  shall  speak  accordingly  of  QST  factorization  for  the  3-  j  and  6-  j  symbols, 
and  QR  factorization  for  the  9-j  and  12-7  symbols.  Pending  proofs  of  these 
claims  of  factorization,  it  is  clear  that  a  precomputed  table  of  binomial  coefficients 
is  all  we  need  to  calculate  3/;- j  symbols  without  any  explicit  reference  to 
factorials. 


QST  Factorization  of  the  3-7*  Symbol 
In  view  of  Eqs.  ( 1,  6,  8,  9)  we  can  write  the  3-7  symbol  in  the  form 
la  b  c\  _ 

I  I  Qaht.  ntty  Rahf,  atly 

\a  /3  y) 

where 


(11) 


Qaty.  ndy 


r4 

aht.  afiy 


1/7M  7^2  T'J 

•  *  afn,  a  *  u/n,  fi  *  ohc.  y 


(12) 
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and 

Rah.  ant  -  (a  +  6  +  c  +  1 )! 

xy _ _  /i-»\ 

,  {b-\r  z)\(a- a- z)\(a  + h- c- z)\  '  ’ 

X  (c  -  fl  -  /3  +  z)\{c  ~  b  +  a  +  z)!z! 

We  now  show  that  Rah.  ,„h  *s  an  integer  which  can  be  factorized  in  six  different 
ways,  namely  as  the  product  of  any  one  of  the  triangular  coefficients  (10)  and  a 
corresponding  sum  over  triple  products  of  binomial  coefficients,  to  be  called  a  3-  j 
S-sum. 

In  Eq.  ( 13)  we  convert  the  factorial  outside  the  sum,  (a  +  b  I-  c  +  1 )!,  into  any 
one  of  the  triangular  coefficients  by  inserting  the  appropriate  denominators,  while 
inserting  compensating  numerators  under  the  summation  sign.  Each  of  these  three 
numerators  can  then  be  combined  with  a  pair  of  factors  in  the  denominator  to 
yield  a  binomial  coefficient.  We  thus  obtain  six  different  factorizations  of  Rah. 
into  S,  T  companions,  namely 


where 


Rah.  (I.f-,  Sah.  ady  Tah  ~  C,  ah',  ady  R ah\  ady  ~  R ah.  ady  Rah.  tidy 

=  Sah  .  0{iy  T'  ahi\  a  -  slh  ,  atiy  T' abi,  0  ^ahi.,  a(iy  ^ oh,  y  (14) 


^ah.  ndy  ~  ^a+d+y.  o(  1  )'  ^  "*' 

R ah.  ady  ~  ^o+d+  t.  o(“l  )”  ^  ^ 

R ah.  ady  ~  ^n+rf+i.  o(  ~  I  )°  ^  ^ 

^ah.  ady  ~  o(~l  )°  *  ^ 

slh.  ady  =  5a.fl+..o(-l  )“■'’'" 

x2(-i)-f ](  1 

t  \a-a-z)\a  +  h-c-=)\h  +  0-z) 
Si-  =  5„,«,,.o(-l)“-^-^ 


(15) 
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Analogous  to  the  r-coefficient  array  (7)  we  introduce  the  3-  j  S-sum  array 


Note  that  any  S,  T  companions  of  Eqs.  ( 14)  appear  in  the  exact  same  position  in 
the  corresponding  arrays  (16)  and  (7).  respectively. 

Combining  Eos.  (11)  and  ( 14)  we  obtain  as  our  final  result  for  the  3-y  symbol 
the  six  equivalent  QST  factorizations 


la  h 
\a  0 
la  b 

U  0 

la  b 

U  0 

la  b 

U  0 


;) 


;) 


la  b 

U  0 


la  b 

U  0 


c 

y 


) 


Qahi, 

»(it 

T'ah 

Qahi, 

^  ahi. 

T' ah',  ady 

i2aht. 

Sahi, 

nii’i 

T*4 

*  uhi .  nd't 

Qahi, 

cl 

TU.  „ 

Qaht, 

SIih. 

T ah,,  fl 

c3 

'T'3 

•  ahi .  7 

(17) 


where  the  Q.  S,  7’ factors  are  given  by  Eqs.  ( 12,  15.  10).  respectively. 

In  Eqs.  (17)  all  six  r-coefficients.  the  associated  ones  as  well  as  the  principal 
one.  occur  on  an  equal  footing.  It  is  worth  noting  that  any  one  of  the  six  Eqs.  (17) 
can  be  derived  from  any  other  one  by  exploiting  symmetry  properties  of  the  3-  j 
symbol  given  by  Wigner  [2],  Racah  [3].  and  especially  Regge  [10]. 

The  sum  5„/„,  „.iy  given  by  the  first  Eq.  (17)  was  proposed  some  time  ago  by 
Nachamkin  [18]  and  by  Varshalovich  et  al.  [15].  and  has  recently  been  incorporated 
into  a  3-7  computer  program  by  Lai  and  Chiu  [20].  However  in  all  these  instances 
the  complete  formulas  for  the  3- j  symbol,  which  still  contain  numerous  factorials, 
are  in  practice  much  more  onerous  than  any  one  of  our  Eqs.  (17).  Also.  Na- 
chamkin's  [18]  formula  for  the  3-,/  symbol  contains  an  error,  having  two  factors 
in  the  numerator  which  belong  in  the  denominator. 
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QST  Factorization  of  the  6-j  Symbol 

The  6- j  symbol  as  defined  by  Eq.  (2)  refers  to  and  is  determined  by  four  vector 
coupling  tnangles.  namely  (a.  b,  c),  (a,  e,f),  (d,  b,f),  (d,  e,  c).  We  shall  call 
the  triangle  (a,  b,  c)  the  primary  triangle  of  the  6- j  symbol,  due  to  the  appearance 
of  a,  b,  c  as  the  upper  indices  of  the  6-  j  symbol;  each  of  the  other  three  secondary 
triangles  arises  from  two  lower  indices,  and  the  upper  index  above  the  third 
lower  index. 

In  actual  fact  the  four  triangles  are  equivalent,  and  the  selection  of  one  triangle 
as  the  primary  one  is  arbitrary.  Mathematically  this  is  expressed  by  the  statement 
that  the  6- j  symbol  is  invariant  under  exchange  of  any  pair  of  upper  indices  with 
the  corresponding  pair  of  lower  indices: 


1 

\a  e  f\ 

[d  b  /] 

^d  e  c 

d  e  f\  1 

\d  b  c  J 

[a  e  c  J 

[a  b  f. 

Another  important  symmetry  relation  states  that  the  6- j  symbol  is  invariant  under 
any  permutation  of  the  three  columns: 


a  b  c  ] 

c 

a 

b' 

d  e  f\ 

f 

d\ 

[f 

d 

e 

c 

b] 

\b 

a 

c 

f 

[e 

d 

c  b  a 
fed 


(19) 


The  symmetry  relations  (18.  19)  were  given  by  Wigner  [2];  they  can  easily  be 
verified  directly  from  Eq.  (z). 

The  QST  factorization  of  the  6-  j  symbol  is  considerably  simpler  than  the  cor¬ 
responding  process  for  the  3- j  symbol.  In  view  of  Eqs.  { 2,  8 )  we  can  write  the  6-  j 
symbol  in  the  form 


a  b  c 
d  e  f 


—  nahi 

“  ^dvf^dcf 


where 


VTuft,  Tae!  TjhlTdn 


(20) 


(21) 


and 

n  ulu  _  "V 


(-1)= 


X 


7  (/’  +  c  +  e  +  /-  z)!(c  +  a  +  f  +  d  —  z)\{a  +  b  +  d  +  e  -  z)\ 

{=+\y. 


(z  -  a  -  b  -  c’)!(z  -  e  -  /-  a)!(z  -  f  -  d  -  b)\(,z  -  d  -  e  -  c)\ 

(22) 


20 


ROOTHAAN 


We  now  show  that  RdPi  is  an  integer,  which  can  be  written  as  the  product  of  any 
one  of  the  four  triangular  coefficients  Tau ,  Ta,./.  Tjhh  Tdn  >  and  a  sum  over  quadruple 
products  of  binomial  coefficients,  to  be  called  a  6-  j  S-sum. 

In  view  of  the  equivalence  of  the  four  vector  coupling  triangles,  it  is  sufficient  to 
show  this  factorization  for  just  one  choice  of  the  four  r-coefficients.  Accordingly, 
in  Eq.  ( 22 )  we  introduce  Tahi ,  the  T-coefficient  associated  with  the  primary  triangle, 
as  the  factor  outside  the  sum,  and  the  reciprocal  of  Tatx  under  the  summation  sign, 
yielding 

n  aht  _  c*ahc  t*  ^  O  1  \ 

Kdc/  “  ^defiahi 

where 


=  I(-ir(  )(  /  ) 

.  \z  -  a  -  b  -  c l\z  -  e  -  f- a/ 

I  c  +  a  -  b  \/  a  +  b  ~  c  \ 

^[z-  f-d-  b)\z  -  d-  e-  c] 


(24) 


Combining  Eqs.  (20)  and  (23)  we  obtain  as  our  final  result  for  the  6-  j  symbol 


{^dcl  ^dii  *  afh  V  / 

where  the  Q,S,T  factors  are  given  by  Eqs.  (21, 24,  10),  respectively. 

Lai  and  Chiu  [20]  also  used  a  sum  over  quadruple  products  of  binomial  coeffi¬ 
cients  which  resembles  our  sum  Sj^f,  Eq.  (24).  However  in  order  to  convert  the 
summation  term  from  quotients  of  factorials  to  products  of  binomial  coefficients, 
they  inserted  what  turns  out  to  be  fragments  from  two  different  triangular  coefficients 
as  numerators  under  the  summation  sign.  Consequently,  the  compensating  frag¬ 
ments  outside  the  sum  cannot  be  combined  into  a  triangular  coefficient,  and  their 
complete  result  for  the  6-  j  symbol  cannot  easily  be  recast  into  a  formula  free  of 
explicit  factorials. 


a  b  c 
d  e  f 


QR  Factorization  of  the  9-j  Symbol 


The  9-  j  symbol  was  defined  by  Wigner  [2]  and  by  Arima  et  al.  [6]  in  connection 
with  the  coupling  of  four  angular  momenta.  Nowadays  it  is  customary  to  adopt  as 
the  starting  point  for  further  analysis  or  computation  a  formula  in  terms  of  6- j 
symbols,  namely  [16,19,21  ] 


a  b  c 
d  e  f 
g  h  i 


2  (-l)---(2r  + 


a  d  g 

b 

e  /t  1 

(c  f  i 

h  i  c 

d 

-  f\ 

1  -  ^ 

(26) 


The  derivation  of  Eq.  (26)  is  generally  attributed  to  Wigner  [2].  It  is  also  claimed 
to  have  been  discovered  by  Jahn  [5];  another  independent  derivation,  in  slightly 
different  form,  was  given  by  Arima  et  al.  [6]. 

Each  row  and  each  column  of  the  9-,/  symbol  mu.st  form  a  vector  coupling  triangle; 
one  consequence  of  this  is  that  in  Eq.  (26)  the  values  of  the  summation  index  r 
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are  either  all  integer  or  all  half  integer.  Hence  the  factor  ( - 1  under  the  summation 

sign  is  a  constant,  namely  +1  or  -1,  for  z  integer  or  half  integer,  respectively. 

The  symmetry  properties  of  the  9-  j  symbol  have  been  analyzed  in  detail  by  Jahn 
and  Hope  [5].  Simply  put,  the  9-y  symbol  is  invariant  when  subjected  to  a  reflection 
at  either  diagonal,  or  to  an  even  (cyclic)  permutation  of  the  rows  or  columns;  and 
the  9- j  symbol  is  multiplied  by  (-1 )"  when  subjected  to  an  odd  (anticyclic)  per¬ 
mutation  of  the  rows  or  columns,  where 


For  example 


c]  [if 

f  =  h  e 
i\  [g  d 


c  a  c 

f  d  =(-1)“/ 

/  g}  i 


We  now  proceed  to  derive  from  Eq.  (26)  a  formula  which  is  more  suitable  for 
our  purposes.  We  apply  Eq.  (26)  to  the  last  member  of  Eqs.  (28),  and  use  some 
of  the  symmetry  relations  (18,  1 9 )  so  that  the  summation  variable  z  appears  as  the 
upper  left  index  in  each  6-  j  symbol.  As  indicated  above,  the  phase  factor  ( —  1 
is  independent  of  the  summation  variable  z,  and  can  be  converted,  using  the  fact 
that  all  relevant  triangles  have  integer  circumferences,  into  (-1 )’,  where 

T  =  c- 2a -2e~2i  = -a +  b  +  c  +  d-  e  +  f+g+h-i  (29) 
Our  new  formula  for  the  9- j  symbol  in  terms  of  6- j  symbols  then  becomes 

\d  e  /  =(-!)' 2  (2z+ Dp  ^  ^  ^Ip  ^  (30) 

[g  h  i\  r  la  8  ^^Jl^  b  d\{i  f  h] 

Using  Eqs.  ( 30,  25,  2 1 )  we  obtain  as  our  final  result  for  the  9-  j  symbol 
a  b  c 

d  e  f  =  Qahc.  dcf.  ghi  ^abc,  dcf.  ghi  (31) 

h  i 


iTahcTdefTghiT adgThehTcfi 


^ahc,  dcf,  ghi  (  D'  2  (2z+  i)S=“5;(^s;^' 


The  difference  between  Eq.  (30)  and  Eqs.  (31-33)  is  not  trivial.  In  the  first  case 
the  sum  is  over  products  of  semi-rational  numbers,  while  in  the  second  case  the 
sum  is  over  products  of  integers.  The  occurrence  of  the  summation  index  z  as  the 
upper  left  element  in  each  6-  j  symbol,  together  with  the  fact  that  each  of  the  three 
z-dependent  primary  triangles  occurs  once  more  as  a  secondary  triangle,  causes  the 
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Q-factors  from  the  three  6-  J  symbols  to  combine  into  a  r-independent  (2-factor  for 
the  9-  j  symbol. 

We  note  that  Eq.  (33)  contains  four  formal  summations,  namely  those  required 
for  each  of  the  three  5-factors,  and  the  explicit  summation  over  z.  Extensive  efforts 
to  reformulate  the  9-  j  symbol  so  that  one  or  more  of  the  summations  can  be  carried 
out  analytically  were  undertaken  by  Jucys  and  Bandzaitis  [13],  and  by  Alisauskas 
and  Jucys  [17];  they  succeeded  in  deriving  a  formula  containing  one  less  summation. 
Starting  from  that  formula,  Lai  and  Chiu  [21  ]  attempted  to  recast  the  general  term 
under  the  triple  sum  as  a  product  of  binomial  coefficients.  Introducing  some  needed 
constant  factorials,  they  managed  to  convert  the  summation  term  into  a  product 
of  eight  binomial  coefficients  and  three  factorials.  Our  own  attempt,  making  some 
different  choices,  resulted  in  eight  binomoial  coefficients  and  two  factorials.  Needless 
to  say,  for  both  conversions  the  constant  factors  outside  the  sum  contain  many 
factorials  which  cannot  be  combined  into  triangular  coefficients.  Both  these  con¬ 
versions  at  least  achieve  that  the  required  summation  is  over  products  of  integers, 
which  is  a  necessary  prerequisite  for  obtaining  exact  results. 


QR  Factorization  of  Two  12-y  Symbols 

Two  12- j  symbols  have  been  defined:  one  by  Jahn  and  Hope  [  5  ]  in  connection 
with  the  coupling  of  five  angular  momenta,  and  another  one  by  Elliott  and 
Flowers  [8]  as  the  matrix  element  of  a  particular  irreducible  tensor.  Following 
Judd  [12]  we  shall  refer  to  them  as  12-y  symbols  of  the  first  and  the  second 
kind,  respectively.  The  original  notation  adopted  by  Jahn  and  Hope  [5]  for 
the  first  kind  has  been  replaced  by  a  more  sensible  one  proposed  by 
Ord-Smith  [7]. 

The  two  12-  j  symbols  are  defined  by  very  similar  formulas,  namely 
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where 


a=a+b+c+d+e+ f+ g+b+i+j+k+l 
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The  difference  between  the  two  \2-j  symbols  is  in  the  topology  of  the  index 
structure.  In  Eqs.  (34)  the  upper  right  index  pair  of  each  b-j  symbol  appears 
again  as  the  lower  right  index  pair  of  its  right  neighbor;  the  last  upper  index 
pair,  Ik,  appears  as  the  first  lower  index  pair  in  the  12-y  symbol  of  the  second 
kind,  but  reversed  as  kl  in  the  12-  j  symbol  of  the  first  kind.  The  latter  “twisted” 
index  structure  was  illustrated  by  Ord-Smith  [7]  in  a  diagram  using  a  Mobius 
strip. 

Comparing  Eqs.  ( 34 )  for  the  1 2-  j  symbols  with  Eq.  ( 30 )  for  the  9-  j  symbol,  the 
analogy  is  striking.  Using  Eqs.  (34,  25,  21)  we  obtain  as  our  final  results  for  the 
12-7  symbols 
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The  Diatomic  Molecule  Project  at  LMSS 
1956-1966:  Broken  Bottlenecks 

BERNARD  J.  RANSIL 

Deparlmenl  of  Medicine,  Beth  Israel  Hospital  and  Harvard  Medical  School, 

Boston,  Massachusetts  02215 

Introduction 

The  title  for  this  brief  account  of  my  association  with  the  Laboiatory  of  Molecular 
Structure  and  Spectra  (LMSS)  at  Chicago  is  taken  from  Chapter  25  of  Robert  S. 
Mulliken’s  autobiography,  Life  of  a  Scientist  [  1  ].  It  is  apt  for  two  reasons;  The  dates 
provide  a  convenient,  although  approximate,  window  of  reference  for  the  period, 
and  the  term  “broken  bottlenecks”  alludes  to  the  “bottlenecks”  that  were  identified 
at  the  historic  Shelter  Island  Conference  in  1951  as  the  most  important  barriers, 
at  the  time,  to  the  computation  of  accurate  analytical  molecular  wave  functions. 

As  it  happens,  the  dates  selected  by  Mulliken  to  delimit  the  period  coincide  with 
the  year  of  my  arrival  at  the  University  of  Chicago  as  a  post-doctoral  research 
associate  and  the  year  that  he  was  awarded  the  Nobel  Prize  in  chemistry.  But  there 
were  many  events  prior  to  1956  that  led  up  to  the  project,  upon  which  it  had  been 
built,  and  without  which  it  could  not  have  happened.  And  neither  my  departure 
from  the  project  in  1964  nor  Mulliken’s  acceptance  of  the  Nobel  Prize  in  1966 
signaled  the  project’s  decline  or  close,  for  under  Cade  and  Roothaan  it  proceeded 
at  a  momentum  that  carried  it  through  to  the  publication  of  the  extended  basis  set 
functions  in  1974. 

Among  the  many  events  that  led  up  to  this  decade  at  LMSS  and  made  it  possible, 
let  me  cite  a  few  of  what  I  consider  to  be  the  most  significant. 

•  Publication  of  Pauling’s  valence-bond  and  Hund  and  Mulliken’s  MO  theories  in 
the  1930s  and  the  subsequent  dissemination  and  discussion  of  these  theories 
during  the  next  several  decades,  with  many  publications  on  small  molecule  cal¬ 
culations  by  workers  in  Europe,  the  United  States,  and  Japan. 

•  The  Shelter  Island  meeting  in  September  195 1,  which  brought  most  of  the  workers 
in  the  field  together  in  five  sessions,  including  two  that  focused  on  the  integrals 
problem,  especially  the  difficult  three-  and  four-center  integrals,  and  the  means 
of  calculating  these  with  existing  computers.  Individuals  presenting  papers  at  the 
Integrals  Session  were: 

Boys,  Cambridge 

Herzfeld  and  Griffing,  Catholic  University 

Roothaan  and  Ruedenberg,  Chicago 
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Kotani,  University  cf  Tokyo 
Barnett  and  Coulson,  Oxford 
Kopineck,  Max  Planck  Institute,  Gottingen 
Lundquist  and  Ldwdin,  Uppsala 
Buehler  and  Hirschfelder,  Wisconsin 
Parr,  Carnegie  Institute  of  Technology 

•  Development  of  centers  for  integral  calculations:  Kotani  in  Japan;  Roothaan, 
Ruedenberg,  and  Jaunzemis  at  Chicago;  Barker,  Thome,  and  Eyring  at  Salt  Lake 
City;  Sklar  at  Catholic  University;  Coulson  and  Barnett  at  Oxford;  Preuss  in 
Gottingen;  Sahni  and  Cooley  at  New  York  University, 

•  Development  of  early  digital  computers:  Whirlwind  at  MIT,  SEAC  at  NBS, 
Washington,  DC,  ILIAC  at  University  of  Illinois,  and  the  Remington  Rand 
Univacs. 

•  Development  of  digital  computer  programs  to  compute  integrals  by  Barker  and 
Eyring  at  Utah,  Roothaan  and  Ruedenberg  at  Chicago  with  an  offshoot  at  MIT, 
Nesbei  at  Boston  University,  Slater  at  MIT,  Barnett  at  Oxford,  and  others;  pub¬ 
lication  of  accurate  integral  tables  by  several  centers. 

For  myself,  a  young,  inexperienced  Ph.D.,  fresh  from  a  year  of  post-doctoral  work 
in  the  Heat  Division  at  the  National  Bureau  of  Standards,  where,  at  the  suggestion 
of  my  chief,  Charles  Beckett,  I  had  become  acquainted  with  the  Bureau’s  digital 
computer,  SEAC  (Standards  Eastern  Automatic  Computer),  there  was  a  series  of 
events  that  led,  in  random  walk  fashion,  to  my  decision  to  go  to  Chicago.  Among 
these,  I  recall  the  following  as  the  most  decisive: 

•  The  availability  of  a  home-made  analogue  computer  in  the  Chemistry  Department 
at  Catholic  University  in  the  early  1950s,  on  which  my  Ph.D.  dissertation — an 
LCAO-MO-SCF  treatment  of  the  H3  complex  in  open-shell  minimal  Slater  orbital 
approximation — was  done,  using  self-calculated  integral  values  and  values  in¬ 
terpolated  from  Kotani’s  tables. 

•  The  publication  of  Roothaan’s  seminal  and  breakthrougt>  paper  on  the  lcao- 
MOSCF  nje,hod  for  computing  wave  functions  in  1951,  just  as  I  entered  the 
Ph.D.  program  at  Catholic  University  [2], 

•  A  post-doctoral  year  at  NBS  1 955-56  during  which  I  learned  the  theory  and 
elements  of  digital  computing  by  programming  the  SEAC,  with  the  assistance  of 
Emilie  Haynsworth  and  Phil  Walsh,  to  solve  the  Schrodinger  wave  equation  for 
the  H3  complex;  verified  the  analogtie  computer  results  to  three-figure  agreement, 
which  was  one  to  two  figures  less  than  the  integral  accuracy;  and  recognized  the 
possibility  of  generalizing  the  program  to  compute  wave  functions  for  simple 
diatomic  molecules. 

•  A  letter  to  Professor  Mulliken  in  the  spring  of  1956,  inquiring  about  the  possibility 
of  doing  a  post-doctoral  at  LMSS. 

•  The  gradual  coming  together  of  many  things  involving  Mulliken  and  a  grant  that 
he  was  negotiating  with  NSF  to  support  such  a  progiam;  the  integrals  work  by 
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Roothaan  that  had  been  in  progress  at  Chicago  for  several  years;  Roothaan's 
plans  for  a  one-year  sabbatical  on  a  Guggenheim  in  Europe;  the  decision  that 
Roothaan's  graduate  students  at  the  time,  Doug  McLean,  Andy  Weiss,  and  Meg- 
umu  Yoshimine,  would  contribute  their  integrals  work  and  programming  skills 
to  the  construction  of  a  computer  program  on  the  Remington  Rand  Univac  1 103 
at  Wright-Patterson  Air  Force  Base,  to  which  LMSS  had  contractual  access.  The 
program  would  compute  ab  diatomic  molecule  wave  functions  in  minimal 
Slater  orbital  LCAO-MO-SCF  approximation,  in  which  the  Slater  exponents  would 
be  iteratively  optimized  for  minimum  molecular  energy.  Three  levels  of  approx¬ 
imation  were  visualized,  distinguished  by  the  orbital  exponents;  Slater  exponents, 
best  optimized  atomic  exponents,  and  best  optimized  molecular  exponents.  The 
concept  of  an  ah  initio  computer  program  had  occurred  to  others  as  well,  e.g., 
Nesbet  at  Boston  University,  working  alone,  was  programming  integrals  and  an 
SCF  routine  at  the  time  for  selected  diatomics. 

Early  Days  in  the  Project 

When  1  arrived  at  LMSS  in  September  1956  as  an  NSF  Research  Associate,  the 
faculty  consisted  of  Professor  Robert  S.  Mulliken,  Associate  Professor  John  R. 
Platt,  and  Assistant  Professor  Clemens  C.J.  Roothaan.  Rucdenberg  had  just  departed 
for  Iowa,  and  Scherr,  completing  his  thesis  on  Ni,  was  about  to  leave  for  Texas. 
Jaunzemis  had  already  been  gone  a  year.  Doug  McLean  and  Andy  Weiss  were 
research  assistants  working  toward  their  Ph.D.’s.  The  computational  staff  consisted 
of  Tracy  Kinyon  on  a  desktop  mechanical  computer  at  Chicago,  Phil  Merryman 
on  the  Whirlwind  at  MIT,  and  Megumu  Yoshimine  programming  integrals.  Bill 
Lester  was  an  undergraduate  student  working  as  a  research  assistant.  Augmented 
by  Doug  McLean’s  dog.  Rover,  McLean,  Weiss,  Yoshimine,  and  I  constituted  the 
group  working  on  the  diatomic  molecule  project  for  the  subsequent  1 8-24  months, 
with  Robert  as  Big  Boss,  Clemens  as  Little  Boss,  and  me  as  Straw  Boss,  titles  con¬ 
ferred  on  us  one  memorable  day  by  Andy,  Doug,  and  Megumu. 

The  Project  Under  Way 

[The  history  of  the  next  18-24  months  is  succinctly  recounted  in  Chapter  25  of 
Mulliken’s  autobiography  from  which  the  following  highlights  have  been  adapted.] 
Beginning  in  the  fall  of  1956,  the  project  moved  ahead  steadily  for  the  next  18 
months  with  the  team  of  Weiss,  Yoshimine,  McLean,  and  Ransil  flying  back  and 
forth  to  Dayton  in  groups  of  twos,  threes,  or  fours,  for  two  to  three  evenings  and 
nights  of  computing,  wi  b  Ransil  assuming  responsibility  for  the  overall  design  of 
the  computer  program,  tne  production  runs  and  administrative  problems,  and 
Yoshimine,  McLean,  and  Weiss  assuming  responsibility  for  programming  the  in¬ 
tegrals  and  the  SCF  routine.  Everybody  on  the  team  did  what  is  now  called  alpha 
and  beta  testing  against  the  very  rigorous  hand  calculations  of  Tracy  Kinyon  on 
his  mighty  Marchant  mechanical  desktop  calculator,  and  the  Kotani  tables.  Most 
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calculational  work  in  those  days  was  done  on  Monroe,  Marchant,  or  Olivetti  cal¬ 
culators. 

When  Roothaan  returned  from  Europe  in  December  1957,  the  minimal-orbital 
diatomic  molecule  program  was  close  to  produc‘ion-run  status.  It  was  agreed  that 
McLean.  Yoshimine,  and  Weiss  would  remain  on  the  project  until  the  program 
was  debugged  and  operational,  after  which  they  would  spend  full  time  on  their 
doctoral  theses. 

In  the  mecntime,  as  Wright-Patterson  updated  to  an  1103  and  later  to  an 
1 103 A,  we  began  to  talk  of  an  extended  orbital  version  of  the  program,  and 
Roothaan  recruited  a  new  generation  of  graduate  student  assistants  and  began 
his  accurate  atom  and  molecule  calculations.  The  second  generation  of  graduate 
student  programmers  began  to  trickle  in  to  LMSS  beginning  the  summer  and 
fall  of  1958  and  continuing  into  1960,  with  Paul  Bagus  in  the  vanguard  followed 
by  Main,  Miroslav  Synek,  Chris  Wahl,  Winifred  Huo,  and  many  more  during 
the  early  sixties. 


Minimal  Basis  Set  Production  Runs 

But  to  return  to  the  diatomic  molecule  program,  early  in  1958,  Serafin  Fraga 
Sanchez  arriv  ;d  from  Spain  on  a  Juan  March  Fund  scholarship  to  join  the  project, 
preparatory  to  a  collaboration  with  Mulliken.  When  the  diatomic  molecule  program 
became  operational  in  the  spring-summer  of  1958  and  McLean,  Weiss,  and  Yosh¬ 
imine  undertook  their  thesis  work  on  Hartree-Fock  quality  atomic  and  molecular 
calculations  under  Roothaan,  Fraga  and  Ransil  began  production  runs  on  the  first 
row  diatomic  homopolar  and  hydride  and  selected  heteropolars  (BF,  CO,  and  LiF). 
The  output  consisted  of  the  lcao-mo  wave  functions,  total  molecular  and  orbital 
energies,  binding  energies,  ionization  potentials,  dipole  and  quadrupole  moments, 
population  analysis,  spectroscopic  constants,  limited  mixed  configuration  functions, 
and  potential  energy  curves  for  H2,  HeJ^,  and  He2.  The  original  output,  bound  in 
hard-board  covers,  still  exists. 

Early  Fruits  of  the  Project 

The  early  results  of  the  diatomic  project  and  Roothaan’s  exact  atom  and  molecule 
work  were  presented  at  the  Conference  on  Molecular  Quantum  Mechanics  at 
Boulder  in  June  1959  and  were  subsequently  published  in  the  issue  of  the  Reviews 
of  Modem  Physics  dedicated  to  that  meeting  [3].  By  this  time,  the  prospect  of 
accurate  numerical,  rather  than  analytical,  solutions  to  the  Schrodinger  eqi'ation 
had  been  seen  as  realizable  and  was  being  actively  pursued  by  Roothaan  and  Wlod- 
zimierz  Kolos,  on  leave  from  the  University  of  Warsaw. 

Also,  by  this  time.  LMSS  was  a  beehive  of  activity  with  many  student  technicians. 
rese"rch  assistants  and  research  associates,  and  visitors  from  all  over  the  world. 
Amc  ig  the  individuals  who  came  to  participate  in,  and  contribute  to.  the  various 
LMSS  activities  in  this  10-year  period  as  research  associates  and  faculty,  the  LMSS 
Technical  Reports  record  the  following: 
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Karl  H.  Hausser 

Max  Pla.'ck  Institute.  Heidelberg 

9/56-6/57 

Takashi  Itoh 

University  of  Tokyo 

12/56-12/57 

James  R.  Richardson 

Purdue  University 

4/56-9/56 

Robert  G.  Parr 

Carnegie  Institute  of  Technology 

6/57-7/57 

Robert  A.  Sack 

British  Rayon  Industries 

9/58-10/59 

Serafin  Fraga  Sanchez 

Juan  March  Fund  Fellow.  Spain 

6/59-4/61 

Sigeru  Huzinaga 

Kyushu  University 

6/59-7/61 

David  Peters 

Royal  Holloway  College,  London 

10/60-9/61 

Charles  W.  Scherr 

University  of  Texas 

6/60-2/61 

Paul  E.  Cade 

University  of  Wisconsin 

9/61-8/69 

Enrico  Clementi 

IBM 

10/60-7/61, 

1/66-12/66 

Wlodzimierz  Kolos 

University  of  Warsaw 

7/57-6/58, 
6/61-9/61 
also  1966,  1967 

Stanton  Ehrenson 

Brookhaven  National  Laboratory 

9/60-12/62 

John  B.  Greenshields 

Duquesne  University 

9/62-8/64 

Roberto  Moccia 

Instituto  Chimico  della 

Universita,  Naples 

12/60-8/62 

Keith  Sales 

University  of  London 

9/61-1962 

John  J.  Sinai 

Purdue  University 

2/61-7/64 

Willis  B.  Person 

University  of  Iowa 

9/60-9/6 1 ,  also 
1966 

William  Lichten 

Sloan  Foundation  Fellow 

1/62-1/63 

Sigrid  Peyerimhoff 

Justus  Liebig  University 

Giessen,  Germany 

-4/64 

Lutoslaw  Wolniewicz 

Nicholos  Copernicus  University 
Torun,  Poland 

-7/64 

Juergen  M.  Hinze 

Bielefeld 

2/66-1970 

Minimal  Basis  Set  Results 

The  LMSS  contributions  to  the  Boulder  meeting  earlier  that  year  had  included 
Roothaan’s  SCF  formalism  for  open-shell  atomic  systems  and  accurate  calculations 
on  atoms  and  2-  and  3-body  systems,  with  Kolos.  Sack,  Sachs,  Weiss.  McLean, 
and  Yoshimine;  Ransil  and  Fraga's  minimal  orbital  diatomic  molecule  results; 
Mulliken  and  Ransil's  systematization  of  lcao  nomenclature;  and  Mulliken’s  study 
with  Fraga  of  coulomb  energy  in  valence  bond  theory.  Following  the  meeting,  the 
production  runs  on  the  minimal  orbital  project  were  completed  and  were  eventually 
published  in  eight  journal  articles  and  two  reports  in  the  LMSS  Technical  Reports. 
The  first  two  papers,  containing  a  description  of  the  project  and  tabulated  wave 
functions,  were  published  in  the  Reviem  of  Modern  Physics  in  1960  [4,5];  the 
remaining  six,  reporting  computed  spectroscopic  constants,  population  analyses, 
potential  curves  for  H2.  He:  and  selected  first  row  diatomics  and  limited  config- 
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uration  interaction  (as  it  was  called  then),  for  selected  first  row  diatomics,  appeared 
at  intervals  in  the  Journal  of  Chemical  Physics  until  1962  [6-13]. 

The  operational  success  of  the  atomic  calculations  and  the  ab  initio  diatomic 
molecule  program  and  the  results  it  generated,  clearly  heralded  the  dawn  of  a  new 
era.  It  seemed  at  the  time  that  as  both  computer  speed  and  memory  capacity  im¬ 
proved,  the  main  obstacle  to  obtaining  accurate  analytic  wave  functions  and  energies 
for  small  molecules  would  be  how  to  give  accurate  analytical  representation  to 
electron  correlation.  All  other  considerations — the  nature  of  the  basis  sets,  basis 
set  expansion  limits,  and  parametric  optimization — were  perceived  as  technical 
problems  that  would  yield  to  the  inexorable  advance  in  computer  technology. 

In  1959,  Mulliken  and  Roothaan  surveyed  the  progress  made  since  the  Shelter 
Island  conference  eight  years  earlier,  in  breaking  the  two  major  bottlenecks  that 
had  been  identified  at  that  time.  The  first,  the  methods  of  computing  difficult  in¬ 
tegrals,  had  been  solved  by  Kotani,  Amamiya,  and  Sumose  in  Japan,  Roothaan 
and  Ruedenberg  at  Chicago,  and  Coulson  and  associates  at  Oxford.  The  second, 
the  excessively  time-consuming  process  of  computing  these  integrals  and  solving 
the  Schrddinger  wave  equation,  had  been  solved  by  the  timely  development  of  high¬ 
speed  computing  machines. 

Moving  on  to  Extended  Basis  Set  Functions 

Coincident  w)th  the  harvesting  of  the  minimal  basis  set  program  was  the  planning, 
design,  and  beginning  implementation  of  a  second  generation,  extended  basis  set 
diatomic  molecular  program,  with  the  assistance  of  the  new  generation  of  graduate 
student  programmers  recruited  by  Roothaan.  Ransil  made  plans  to  enter  medical 
school  in  September  1960,  during  which  he  worked  parttime  at  LMSS  to  complete 
the  publication  of  the  minimal  orbital  work,  and  to  relinquish  responsibility  for 
the  extended  basis  set  work  to  Paul  Cade  from  Professor  Hirschfelder’s  group  at 
the  University  of  Wisconsin.  Cade  took  over  in  September  of  1961.  At  that  time, 
it  was  projected  that  he  would  follow  a  design  similar  to  the  minimal  orbital  study, 
but  would  use  an  extended  Slater  basis  set  lcao-mo-SCF  approximation.  This  effort 
eventually  produced  a  rich  harvest  of  diatomic  molecule  studies;  tables  of  the  ex¬ 
tended  basis  set  functions  were  published  by  Cade  et  al.  in  1974  [14]. 

Accurate  Numerical  Functions 

As  these  activities  were  carried  out  on  the  molecular  side,  Roothaan  concentrated 
largely  on  accurate  numerical  calculations  on  the  hydrogen  molecule  with  Kolos. 
Others  involved  in  Roothaan’s  accurate  atomic  and  molecular  calculations  during 
the  decade  being  remembered  were  Sack,  Weiss,  Sachs,  and  Wolniewicz,  a  student 
of  Kolos,  from  Copernicus  University  in  Torun,  Poland. 

In  Retrospect 

Looking  back  from  the  perspective  of  some  30  years,  it  is  my  perception  that 
what  happened  during  the  first  half  of  that  eventful  decade  at  Chicago,  as  a  result 
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of  ( 1 )  the  mathematical  and  integrals  groundwork  laid  by  Kotani,  Roothaan,  Rue- 
denberg,  Coulson,  and  a  host  of  others,  (2)  the  development  of  the  first  generation 
of  commercial  digital  computers,  and  (3)  the  emergence  of  graduate  students  and 
post-docs  trained  in  the  physics  of  molecules,  mathematics,  and  early  computer 
science,  was  the  gradual  coalescence  of  all  these  factors  into  a  new  science,  in  a 
serendipitous  environment  that  made  working  together  not  only  possible  but  fun. 
It  was  a  privilege  to  be  there  at  the  time  and  to  work  with  Andy,  Doug,  Megumu, 
and  Clemens.  I  have  never  had  the  opportunity,  and  probably  never  will  again,  to 
publicly  thank  them  and  to  acknowledge  my  deep  and  lasting  indebtedness  to  them 
for  being  what  they  were  to  me  in  those  days:  teachers,  colleagues,  and  stalwart 
friends.  It  was  a  once-in-a-lifetime  experience,  and  I  thank  the  organizers  of  this 
conference  for  setting  aside  time  over  the  coming  few  days  during  which  we  can 
revisit  that  era,  reflect  upon  it,  catch  perhaps  some  of  the  optimism  and  spirit  of 
the  period,  and  appreciate  it  for  what  it  was:  the  birth  of  a  new  scientific  discipline 
that  was  neither  chemistry  nor  physics,  neither  mathematics  nor  computer  science, 
but  a  fusion  of  all  of  these  which  produced  a  prototype  tool  for  exploring  and  testing 
theories  of  molecular  binding  and  interaction. 
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Abstract 

Statistical  electron  correlation  coefficients  preside  oserall  measures  of  the  difference  between  the  electron 
pair  densits  and  the  product  of  one-electron  densities.  Explicitls  correlated  svasc  functions  arc  used  to 
compute  radial  and  angular  correlation  coefficients  for  19  low-lsing  states  (n'S.  n'S.  n'P.  n'P.  n'D.  and 
n^D  with  /I  <■  7)  of  the  two-electron  ions  from  helium  through  Nc*'.  This  data  base  of  1566  correlation 
coefficients  permits  a  ssstematic  studs  of  trends  with  respect  to  sanation  of  nuclear  charge,  degree  of 
excitation,  angular  momentum,  and  spin  multiphcit>.  c  i99.t  John  Wilcs  &  Sons.  Inc 

Introduction 

In  contemporary  quantum  chemistry,  those  effects  neglected  by  the  Hartree- 
Fock  model  are  normally  called  electron  correlation  effects.  However,  from  a  sta¬ 
tistical  standpoint,  electron  correlation  means  that  the  motions  of  electrons  are 
statistically  correlated;  that  is.  the  electron  pair  density  is  not  simply  the  product 
of  one-electron  densities  [1].  These  two  interpretations  of  electron  correlation  are 
different  because  the  antisymmetric  Hartree-Fock  wave  function  does  account  for 
the  Fermi  correlation  [2]  between  electrons  of  like  spin.  Statistical  electron  cor¬ 
relation  has  received  only  a  very  tiny  fraction  of  the  attention  that  conventional 
electron  correlation  has. 

The  purpose  of  this  paper  is  to  explore  systematically  how  statistical  correlation 
varies  with  nuclear  charge  Z.  degree  of  excitation,  angular  momentum  L,  and  spin 
multiplicity  in  the  helium  isoelcctronic  scries.  To  this  end,  we  consider  statistical 
electron  correlation  in  29  low-lying  states  (n'S,  n’S.  «'P,  n  V.  «'D,  and  with 
n  <  7)  for  each  of  the  nine  ions  from  He  through  Ne"^ .  It  would  be  difficult,  if  not 
impossible,  to  assimilate  mentally  the  trends  displayed  by  a  series  of  plots  of  statistical 
pair  correlation  densities  for  each  of  261  states.  Therefore,  it  is  natural,  and  even 
imperative,  to  turn  to  simple  measures  that  give  an  overall  sense  of  statistical  cor¬ 
relation.  Convenient  indices  of  this  sort,  termed  statistical  correlation  coefficients, 
were  introduced  by  Kutzelnigg  et  al.  [3].  In  this  work,  we  use  three  radial  and  three 
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angular  correlation  coefficients  to  characterize  the  statistical  electron  correlation  in 
each  of  these  states.  Even  with  this  simplification,  we  are  left  with  the  task  of  dis¬ 
cerning  trends  in  1566  correlation  coefficients.  The  method  is  outlined  in  the  next 
section,  and  the  results  are  presented  and  discussed  in  the  third  section.  Hartree 
atomic  units  are  used  throughout. 


Method 

Standard  concepts  from  mathematical  statistics  and  probability  theory  led 
Kutzelnigg  et  al.  [3]  to  use  probe  functions  g(  7)  of  the  electronic  position  vectors 
to  define  generalized  correlation  coefficients  in  terms  of  one-  and  two-electron  ex¬ 
pectation  values: 

Thakkar  and  Smith  [4]  pointed  out  that  Eq.  ( 1 )  can  be  rewritten  as 

2  J  c(7,,72)g(7,)^(r2)  dr^  drz 

- ,  (2) 

J  Di(ri)D,(r2)[g^(r,)  +  g^(r2)  -  2g(ri)g(r2)]  dri  dr2 

where 

t'(7|,72)  =  D2(rt,T2)  -  /)|(7|)Z),(72)  (3) 

is  the  statistical  pair  correlation  density  in  which 

£>2(7i,72)  =  J  ^*{xi,X2, . . .  ,x„)^p{xi,X2, . . .  ,x„)  i/(T,  d(j2  dxi  dx^-  •  •  dx„ 

(4) 

is  the  unit  normalized  one-electron  density.  Equations  (2)  and  (3)  clearly  show 
that  a  statistically  m« correlated  pair  density,  c(7|  ,72)  =  0,  is  a  sufficient,  but  not 

=  J  4'*(xuX2 . x„)4^(xi,X2,  . . .  ,x„)da,  dx2  dx^  dx4-  •  •  dx„  (5) 

is  the  unit  normalized  one-electron  density.  Equations  (2)  and  (3)  clearly  show 
that  a  statistically  Hwcorrelated  pair  density,  c(7i,  72)  =  0,  is  a  sufficient,  but  not 
necessary,  condition  for  vanishing  correlation  coefficients.  There  are  two  ambiguities 
in  the  definition  of  the  coefficients.  They  depend  on  the  origin  of  the  coordinate 
system  used,  and  the  product  of  probe  functions  is  ambiguous  if  the  probe  is  a 
vector  function.  These  ambiguities  are  resolved  as  follows.  Since  we  are  concerned 
only  with  atoms,  the  nucleus  is  the  natural  origin.  The  product  of  probe  functions 
will  be  interpreted  as  a  scalar  product  [3]. 

Radial  correlation  coefficients  are  obtained  if  the  probe  function  is  spherically 
symmetric.  The  simplest  radial  probe  function  isg(7)  =  r’',  which  leads  to 
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<rtr§>-<rt>- 

<rf >  -  {r\y  ■ 


(6) 


The  radial  coefficients  with  /c  =  + 1  have  been  studied  for  the  five  lowest  states  of 
the  two-electron  ions  [3-5]  and  for  the  ground  and  first  excited  states  of  the  Li- 
like  and  Be-like  ions  [6-8].  The  /c  ==  -1  coefficient  was  considered  in  the  latter 
studies  to  investigate  shell  effects.  Since  the  Rydberg  states  we  consider  are  relatively 
diffuse,  we  also  examine  k  =  2,  which  should  place  emphasis  on  the  outer  regions. 
We  use  the  /:  =  - 1 ,  1,2  coefficients  to  characterize  radial  correlation  in  the  K- 
shell,  intermediate,  and  Rydberg  regions,  respectively,  of  the  charge  cloud. 

Angular  correlation  coefficients  result  from  the  probe  function  g  ( r)  =  rr^.Thus, 

(r’ir^Trh) 


These  angular  correlation  coefficients  cannot  exceed  1  in  absolute  value.  Perfect 
positive  correlation  ( r  =  1 )  means  that  the  position  vectors  are  expected  to  coincide 
whereas  perfect  negative  correlation  (r  =  - 1 )  means  that  the  position  vectors  are 
diametrically  opposed  with  respect  to  the  nucleus.  Statistically  uncorrelated  pair 
densities  lead  to  r  =  0  but  so  do  orthogonal  position  vectors.  The  case  /<  =  0  is  of 
special  interest  because  r?  can,  in  principle,  be  obtained  experimentally  [4,9].  When 
A:  =  - 1,  the  probe  function  is  a  unit  vector,  and  the  correlation  coefficient  is  simply 
the  expectation  value  of  the  cosine  of  the  angle  $12  subtended  at  the  nucleus  by  a 
pair  of  electron  position  vectors:  rj/r  =  (cos  ^12).  We  choose  the  A:  =  -1,  1.  2 
coefficients  to  characterize  angular  correlation  in  the  K-shell,  intermediate,  and 
Rydberg  regions,  respectively,  of  the  charge  cloud. 

Three  radial  and  three  angular  coefficients  were  computed  for  each  of  the  29  low 
lying  states  («'S,  n^S,  «'P,  «^P,  n'D,  and  n^D  with  n  <1)  for  each  of  the  nine 
10ns  from  He  through  Ne*^.  The  computations  are  based  on  Cann  and  Thakkar’s 
systematic  collection  of  very  accurate  wave  functions  for  these  states  [10].  Their 
spin-free  wave  functions  incorporate  explicit  dependence  on  the  interelectronic 
distance  r\i=  |  -  r2i  as  follows: 


^  =  2  1  ±  PM)r\ expf-a^r,  -  &,,r2  -  Tin2)T/,.o(fi|)>o.o(iA2) 

A-1 

A'+A/ 

+  5a,2  2  dd  \  ±  P\2)rxr2  expf-atr,  -  -  7*n2)T(fii,  ^2) ,  (8) 

k=N+\ 


where 


Y(Q,,  fi2)  =  2r,,o(n,)y,.o(fi2)  +  T,.,(n,)y,._,(fi2)  +  Y,,-dW^A^2)  ■  (9) 

In  the  above,  upper  and  lower  signs  correspond  to  the  singlet  and  triplet  states, 
respectively,  r,  =  (r,,Q,),  is  a  permutation  operator  that  interchanges  r,  and 
72,  Lis  the  total  orbital  angular  momentum  quantum  number,  6/,.2  is  a  Kronecker 
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delta,  y/..M  are  spherical  harmonics,  and  the  {c/t,  d^,  a^,  ^k,  7a,}  are  variational 
parameters.  They  each  contain  100  terms,  with  N  =  70  and  M  =  30  for  the  D 
states.  These  wave  functions  lead  to  the  lowest  variational  energies  published  so  far 
except  for  neutral  He  and  the  3'P  and  3^P  states  of  the  ions.  Even  in  the  worst 
cases,  the  energies  are  no  more  than  30nEn  above  the  best  ones  [10].  A  fuller 
discussion  of  the  wave  functions  may  be  found  elsewhere  [10].  Correlation  coef¬ 
ficients  in  the  infinite  Z  limit  were  computed  from  hydrogenic  wave  functions. 

Results  and  Discussion 

The  correlation  coefficients  are  shown  graphically;  complete  tables  may  be  found 
elsewhere  [11].  Figures  1-3  show  the  K-shell  correlation  coefficients  ri/^  and  Tf/, 
for  the  «  =  1,  2,  n  =  3,  and  n  =  4  states,  respectively.  Figures  4-6  show  a  similar 
sequence  for  the  intermediate  region  correlation  coefficients  T,and  tj,  and  Figures 
7-9  show  the  Rydberg  region  correlation  coefficients  and  77,.  The  results  for 
the  «  =  5  and  n  =  6  states  are  not  shown  because  they  resemble  those  for  «  =  4 
qualitatively,  because  the  trends  observed  as  n  increases  from  3  to  4  persist  to  larger 
n,  and  because  t?  and  show  some  large  Z  erratic  behavior  that  is  probably  a 
computational  artifact. 

The  ground  state  is  qualitatively  different  from  the  excited  states;  in  the  orbital 
model,  the  two  electrons  occupy  the  same  orbital  in  the  ground  state  whereas  they 
occupy  different  orbitals  in  the  excited  states.  Moreover,  statistical  electron  corre¬ 
lation  in  the  ground  state  has  been  studied  previously  [3,4],  as  have  some  of  the 
coefficients  for  the  «  =  2  states  [4,5].  Therefore,  in  this  paper,  we  focus  on  trends 


i/z  i/z 

Figure  I.  r,/,  and  as  a  function  of  1/Z  for  the  1  'S  (0),  2'S(A),  2’S  (A),  2'P  (O), 

and  2’P  (•)  states. 
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Figure  2.  r,,,  and  as  a  function  of  I /Z  for  the  3'S  (A).  3^S  (A).  3'P{0).  3’P  (•). 
3'D(D).and  3 (■)  states. 


Figure  3.  T|/,and  t?/,  as  a  function  of  I  /Z  for  the  4'S  ( A),  4^S  ( A),  4'P  (O).  4^P  (•), 
4'D  (□),  and  4^D  (■)  states. 


.  3^P(*). 
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Figure  6.  r,and  t?  as  a  function  of  l/Z  for  the  4'S  (A).  4^S  (A).  4'P  (O),  4’P  (•), 
4  'D  (□),  and  4’D  (■)  slates. 


1/Z  1/Z 

Figure  7.  r,:  and  r,?  as  a  function  of  1/Z  for  the  1  'S  (0),  2'S  (A),  2’S  (A),  2'P  (O), 

and  2’P(*)  stales. 
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Figure  8.  t,i  and  t,j  as  a  function  of  l/2for  the  3'S  (A).  3^S  (A).  3'P  (O).  3’P  (•). 
3  'D  (□),  and  3’D  (■)  stales. 


Figure  9  t,i  and  T,r  as  a  function  of  l/Z  for  the  4 'S  (A),  4'S  (A),  4’P  (O),  4’P(#), 
4'D  (□),  and  4’D  (■)  states 
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of  the  excited  state  correlation  coefficients  with  variation  in  nuclear  charge  Z,  angular 
momentum  L,  degree  of  excitation  «,  and  spin  S. 

Consider  the  relative  magnitudes  of  correlation  in  the  three  regions.  The  ratio 
Ti/r'.Tr'.Tr  is  approximately  3:7:5,  indicating  that  radial  correlation  is  most  important 
in  the  intermediate  region,  and  least  important  in  the  K-shell.  Angular  correlation, 
on  the  other  hand,  decreases  in  importance  with  distance  from  the  nucleus.  For  « 
=  2,  the  ratio  Tj/Z.Ty-.Trr  is  4:2:1  but  for  «  =  6  this  ratio  becomes  100,000:500:1. 
These  ratios  can  be  understood  quite  simply.  In  the  K-shell,  angular  correlation  is 
more  efficient  than  radial  correlation  because  of  the  small  size  of  the  K-shell;  thus, 
in  the  K-shell,  correlation  energy  is  obtained  primarily  through  decreased  shielding, 
leading  to  an  increase  in  electron-nucleus  attraction.  In  the  intermediate  region, 
both  forms  of  correlation  can  be  important.  In  the  huge  Rydberg  region,  radial 
correlation  is  favored  because  greater  stabilization  can  be  obtained  by  reduction  of 
interelectronic  repulsion  than  by  increase  of  attraction  to  the  distant  nucleus.  The 
probability  of  finding  both  electrons  in  one  of  these  regions  is  greatest  for  the  in¬ 
termediate  region,  and  so  radial  correlation  is  most  significant  there. 

On  the  basis  of  the  arguments  presented  above,  one  would  expect  radial  correlation 
to  increasingly  ovefshadow  angular  correlation  as  the  size  of  the  Rydberg  region 
increases  with  n.  Indeed,  the  figures  show  that  as  n  increases,  the  radial  correlation 
coefficients  increase  slightly  in  magnitude  but  the  angular  correlation  coefficients 
decrease  dramatically,  ry/r,  ry,  and  rr^decrease  by  factors  of 40, 4000,  and  500,000, 
respectively,  as  n  increases  from  2  to  6. 

The  behavior  of  the  correlation  coefficients,  for  fixed  n.  L,  and  S,  as  a  function 
of  Z  is  described  by  1  /Z  expansions  [4]; 

Tg  —  fl^.o  +  fljj.i/Z  +  OgjtZ'  +  •  •  •  .  ( 10) 

For  each  of  the  correlation  coefficients  in  each  of  the  29  states,  the  leading  coefficient 
in  the  1  /Z  expansion  was  computed  from  hydrogenic  wave  functions,  and  one  or 
two  higher  order  coefficients  were  obtained  by  fitting  and/or  differencing  our  cal¬ 
culated  values;  tables  of  these  1  /Z  expansion  coefficients  may  be  found  elsewhere 
[11].  The  figures  show  that  most  of  the  radial  t  decrease  linearly  witn  1  /Z,  showing 
that  the  higher  order  terms  in  Eq.  ( 10)  are  not  significant  for  Z  S;  2.  By  contrast, 
the  angular  t  show  distinctly  nonlinear  variation  with  1  /Z. 

The  variations  with  L  and  S  are  not  simple.  We  set  aside  Try  and  T|/,  because 
radial  correlation  in  the  K-shell  and  angular  correlation  in  the  Rydberg  region  are 
less  important.  The  t,  and  Tr^  show  that  radial  correlation  is  more  important  in 
the  S  states  than  in  the  P  and  D  states.  This  is  reasonable  since  the  spherically 
symmetric  S  states  are  less  conducive  to  angulai  correlation  than  the  L  >  0  states. 
The  Tyyr  and  Ty  suggest  that  angular  correlation  is  most  significant  in  the  P  states, 
and  confirm  [4]  that  positixe  angular  correlation  coefficients  are  found  only  for 
large  Z  and  only  in  the  'P  states.  The  only  plausible  explanation  that  we  can  offer 
for  these  observations  is  that  they  may  be  artifacts  of  the  probe  function  used;  it 
may  be  interesting  to  test  probe  functions  with  more  general  angular  dependence. 
The  triplet  is  always  the  more  compact  state  of  the  singlet-triplet  pair  associated 
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with  each  Z,  n,  and  L  [12];  however,  the  correlation  coefficients  do  not  show 
behavior  that  can  be  attributed  to  this  size  effect.  Radial  correlation  seems  more 
important  in  'P  states  than  in  states.  Detailed  studies  of  pair  correlation  densities 
may  help  explain  some  of  these  features  of  the  P  states. 
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Abstract 

Supermoleculai  model  of  the  intermolecular  interaction  energy  is  studied  The  correlation  energy 
components  are  calculated  by  using  the  many-body  perturbation  theory  (mbpt)  in  the  canonical  rep¬ 
resentation  as  well  as  in  the  localized  one.  The  components  of  the  corrections  of  the  different  order  are 
analyzed  individually.  The  effects  of  the  localization  procedure  and  the  basis  set  are  also  investigated 
An  H2O  tnmer  is  used  to  study  the  two-  and  three-body  components  of  the  interaction  energy.  C  1993 
John  Wiley  &  Sons.  Inc 


Introduction 

The  difficulties  of  the  determination  of  the  intermolecular  interactions  in  a  theo¬ 
retical  way  are  numerous.  The  various  problems  of  the  quantum  chemical  calcu¬ 
lations  are  the  subject  of  several  excellent  review  articles  (some  of  them  are  not 
restricted  to  the  discussion  of  two  interacting  systems).  It  is  well  known  from  ex¬ 
perience  that  a  large  number  of  intermolecular  interactions  behave  itself  in  quite 
a  different  way  when  they  are  acting  in  a  reaction  catalyzed  or  uncatalyzed.  The 
presence  of  water  molecules,  e.g.,  may  effect  seriously  the  interaction  energy  of 
reacting  systems.  The  interaction  between  the  molecules  CH2O  and  NH3  is  an 
example  for  this  case  [  1 ,2  ] .  It  was  shown  [  3  ]  that  a  single  ancillary  water  molecule 
interacts  favorably  with  the  reacting  system.  It  was  also  pointed  out,  in  the  same 
study,  that  the  incorporation  of  a  second  water  molecule  leads  to  a  further  decrease 
the  distance  between  the  interacting  formaldehyde  and  ammonia  molecules. 

One  can  thus  conclude  that  the  ancillary  water  molecules  aid  the  catalysis.  The 
accurate  study  of  this  and  similar  interactions  are  of  incredible  importance  because 
of  their  biological  activity.  Although  the  above  calculations  have  been  performed 
on  ah  inttio  SCF  level,  some  attempts  have  already  been  done  including  the  effects 
of  electron  correlation,  too  [4] .  It  is  generally  believed  that,  for  serious  investigations 
of  intermolecular  interaction  energies,  the  correlation  component  must  be  taken 
into  account  [5]. 

Tremendous  efforts  have  been  made  in  the  past  years  for  shedding  light  on  the 
understanding  the  van  der  Waals  bonding,  the  hydrogen  bonding,  the  solvatation 
effects,  etc.,  using  a  large  variety  of  quantum  chemical  methods  [Nonadditive  po- 
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tential,  configurational  interaction  (Cl),  many-body  perturbation  theory  (MBPT), 
coupled  cluster  theory  (CC)]  [6-20]. 

The  theoretical  investigations,  using  ab  mitio  methods  including  the  study  of 
correlation  energy  components  are  varying  from  the  exhausting  analysis  of  the 
dipole  moments  [1 1],  of  the  dipole-dipole  exchange,  etc.  partitioning  [12]  up  to 
the  discussion  of  these  components  in  the  localized  version  of  the  mbpt  [13-16]. 
Some  of  the  present  authors  also  contributed  to  these  investigations  [17,18]. 

The  important  role  of  the  so  called  basis  set  superposition  error  (BSSE)  has  also 
been  studied  in  several  papers  (see,  e.g..  Refs.  [19-21]  and  references  therein). 

As  the  interaction  of  water  molecules  is  of  basic  importance  in  chemical  and 
biochemical  systems  many  papers  have  chosen  them  as  model  [7,20,21].  Recently 
various  papers  have  been  published  on  the  van  der  Waals  interaction  in  the  (H2O) 
trimer  systems.  In  Ref.  [8]  the  nonadditive  potential  of  Campbell  and  Mezei  is 
used  to  determine  the  local  minima  of  the  energy  surface.  In  Refs.  [22,23]  the 
interaction  energy  is  calculated  by  MBPT  with  fairly  large  basis  sets.  For  the  water 
dimer  the  localized  version  of  the  MBPT  was  successfully  applied  [24,25]. 

Theory 

The  MBPT  in  the  supermolecular  model  starts  as  taking  zero-order  approximation 
the  HF  approximation  of  the  interacting  total  system.  The  higher-order  terms  contain 
the  many-electron  effects. 

The  symmetry  adapted  perturbation  theory  (sapt  )  (see  Ref.  [26]  and  references 
therein.)  starts  with  the  noninteracting  monomers  as  zero-order  approximation.  It 
is  a  double  perturbation  theory,  the  perturbations  are  ( I )  the  intermolecular  inter¬ 
action  and  (2)  many-electron  perturbation. 

Our  method  is  a  version  of  the  supermolecular  model  [17,18.24].  The  occupied 

(4^,,  i  =  1.2 . N)  and  the  virtual  (ipa,  a  =  N  +  1,  A  +  2 . M)  solutions  of 

the  canonical  HF  equations  are  localized  separately  by  unitary  transformations 

<P.  =  2  U,jij ,  ( 1 ) 

J 

<P<.=  'Zv,,4^h.  (2) 

h 

The  off-diagonal  Lagrangian  multipliers  are  incorporated  into  the  Fock  operator 
of  the  localized  representation  (the  zero-order  Hamiltonian), 

F'-  =  F  -  2  |/>c,Xy|  -  Z  \a}ca,(h\  (3) 

i.j  o,h 

i^j  a^h 

and  also  appear  in  the  perturbation  operator 

H'/'oe  =  Z  {  2  +  2 

A--!  l/.v*--!  a,b=\+\ 

i^j  a^b 


(4) 
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The  perturbation  operator  consists  of  two  terms:  H'is  a  two-particle  interaction, 
and  the  second  and  third  terms  are  single-particle  perturbation  due  to  the  LMOs.  It 
means  that  the  mbpt  in  the  localized  representation  is  a  double  perturbation  theory. 
The  perturbational  energy  corrections  have  two  indices:  n  denotes  the  order 

with  respect  to  the  two-particle  perturbation,  and  m  denotes  the  order  with  respect 
to  the  single-particle  perturbation. 

Summing  over  the  second  indices  to  infinite  order 

i  (5) 

m=  1 

we  obtain  the  canonical  perturbational  series.  The  localization  (1),  (2)  can  be 
carried  out  by  an  ordinary  Boys’  procedure  [27]. 

It  is  usually  suitable  for  the  occupied  orbitals,  but,  for  the  virtual  subspace  par¬ 
ticularly  in  case  of  larger  basis  sets,  the  ordinary  Boys’  procedure  disperses  the 
centroids  of  the  LMOs,  and  the  virtual  LMOs  are  not  localized  in  the  same  regions 
as  the  corresponding  occupied  ones.  As  can  be  seen  from  Eq.  ( 4 ) ,  the  single-particle 
perturbation  will  be  small  if  the  LMOs  are  well  separated  and  the  off-diagonal  La- 
grangian  multipliers  are  small  at  the  same  time.  These  two  requirements  cannot 
be  satisfied  simultaneously  by  an  ordinary  Boys’  procedure.  We  are  using  the  fol¬ 
lowing  modified  procedures.  For  medium-sized  basis  sets  for  the  localization  of 
virtual  LMOS,  taking  into  account  the  spatial  symmetry  of  CMOS,  we  partition  them 
into  several  groups  and  localize  these  groups  separately. 

In  this  way  the  LMOs  will  be  less  localized,  but  the  off-diagonal  Lagrangian  mul¬ 
tipliers  will  be  smaller  ( see  Ref  [  28  ] ) .  With  these  modification  we  obtained  adequate 
LMOs  for  many  cases.  For  large  basis  sets  we  suggest  another  modification:  local¬ 
ization  of  the  LMOS  according  to  a  given  localized  set  of  functions  (see  Ref  [28]). 

The  essential  character  of  our  method  that  LMOs  are  used  also  in  the  virtual 
space.  As  a  consequence,  every  effect  can  be  attributed  to  a  certain  “localized”  part 
of  the  wavefunction. 


Calculations 

Our  purpose  is  the  investigation  of  the  correlation  energy  components  of  the 
interaction  energy  as  function  of  the  distance  of  the  monomers.  Therefore,  the 
geometries  have  not  been  optimized.  We  carried  out  the  calculation  for  various 
geometries.  Water  molecule  I  and  the  O  atoms  of  molecule  II  and  III  lie  in  the  xv 
plane,  and  the  OH  bonds  of  molecule  I  point  towards  the  lone  pairs  of  molecules 
II  and  III.  The  intramolecular  distances  are  taken  as  fixed  and  the  intermolecular 
distances  are  increased  from  d  =  4.0  a.u.  to  t/  =  12.0  a.u.  (see  the  Appendix). 

We  applied  standaiti  6-3 IG*  basis  set,  which  is  insufficient  for  small  systems  but 
for  larger  (especially  biological )  systems  is  widely  used.  We  think  that  the  standard 
basis  set  is  suitable  to  carry  out  preliminary  inquiry  in  the  application  of  LMOs. 
For  these  geometries  Hartree-Fock  calculations  have  been  carried  out  for  the  trimer, 
dimer,  and  monomer  systems  [28-32].  The  calculations  have  been  carried  out  for 
the  corresponding  dimer  and  monomer  systems  but  with  trimer,  dimer,  and  mono- 
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Table  I  The  comnbutions  of  the  correlation  energy  to  the  interaction  energy  of  the  (H20)3  tnmer 
at  different  geometries  (in  /ihartree  units)." 


4.0 

5.0 

6.0 

8.0 

10.0 

12.0 

ASCF 

-550.09 

-4241.91 

-2329.45 

-514.66 

-190.34 

-85.47 

-2891.92 

-1771.33 

-645.426 

-50.10 

-13.23 

-4.85 

700  714 

455.35 

1008.08 

813.15 

4404.02 

8536.12 

9455  90 

336  61 

1702.89 

5  64 

29656.49 

-15114.48 

"  CP  corrected  results. 


mer  basis  sets  in  order  to  use  the  full  counterpoise  (CP)  method.  For  example,  the 
binding  energy  of  the  trimer  AEi  n  m  are  calculated  in  the  following  way: 

A£i  11  111  =  E\  II  III  —  £(i  II  111  basis)  “  £(1 II  III  basis)  ~  £("11  III  basis)  • 

This  correction  is  calculated  for  every  order  of  the  perturbation  theory  similarly. 
In  the  localized  representation  the  BSSE  can  be  taken  into  account  in  a  dif¬ 
ferent  way. 

The  two-electron  integrals  are  transformed  to  those  in  terms  of  canonical  mo¬ 
lecular  orbitals  (CMOs)  and  the  second-,  third-,  and  fourth-order  perturbational 
corrections  are  calculated  by  the  canonical  many-body  perturbation  theory 
(CMBPT)  [33-38]. 

The  results  for  the  water  trimer  are  shown  in  Table  1.  for  certain  geometries.  The 
contributions  to  the  interaction  energy  are  given  in  detail  (ASCF,  A£'^’,  A£'^’, 
etc.)  applying  the  Boys-Bernardi  full  counterpoise  correction  and  without  it  (see 
Table  II.)  To  show  the  effect  of  increasing  the  basis,  we  repeated  the  calculation 
for  water  dimer  with  two  c(-sets  {d  exp:  0.73  and  d'  exp:  0.30).  The  results  are 
shown  in  Table  III.  and  compared  to  those  obtained  with  single  d  set.  It  can  be 
seen  that  the  CP  corrected  second  order  interaction  energy  component  increased 
by  27%.  (The  increase  of  the  total  energy  is  much  less.)  For  certain  geometries  the 
CMOS  were  localized  by  using  the  modified  version  of  the  Boys  procedure  [28].  As 
we  have  already  pointed  out,  the  LMOs  have  very  interesting  properties. 


Table  II  The  contributions  of  the  correlation  energy  to  the  interaction  energy  of  the  (H20)3  tnmer 
at  different  geometncs  (in  /ihartree  units)." 


4.0 

5.0 

6.0 

80 

10.0 

12.0 

ASCF 

-3885.72 

-4923.23 

-2603.06 

-513.18 

-1999 42 

94.47 

A£'2> 

-5223.23 

-2493.15 

-83006 

-56.38 

-13.26 

-4.85 

AE'” 

1036.28 

559.39 

-260.11 

-394.94 

-2195.04 

-4260.51 

A/-<"' 

-8997.01 

-452  71 

-1027.68 

-879.18 

-14800.04 

-5957.04 

Results  without  CP  correction 
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Table  III.  The  change  m  the  components  of  the  interaction  energy  of  the 
water  dimer  when  the  basis  set  was  increased  * 


6-3 1G» 

-345.21 

-6129.98 

-4644.28 

6-3 1G»  +  d‘ 

-468.83 

26.32 

-97.44 

'  At  the  geometry  d  =  6.0  a.u.  (in  /ihartree). 


Several  contributions  of  the  lmos  for  the  monomer,  dimer,  and  trimer  systems 
can  be  investigated.  The  transferability  of  the  Fock-matrix  elements,  for  example, 
and  of  the  two-electron  integrals  over  lmos  have  advantages.  Good  results  have 
been  obtained  for  any  interacting  dimer  systems  studied  so  far  by  our  method. 

The  characterization  of  the  lmos  can  be  done  in  different  ways.  We  give  the 
coordinates  of  centroids  for  the  occupied  bond-  and  lone-pair  LMOs  as  well  as  the 
largest  centroid  lengths  of  centroids  for  the  virtual  LMOs.  The  centroid  vector  r,  of 
an  LMO  is  defined  as  r,  =  (y?,  Ir|^, ).  The  results  obtained  for  the  systems  studied 
at  several  intermolecular  distances  are  in  agreement  with  our  earlier  conclusions: 
All  of  the  LMOS  are  well  separated.  The  contributions  of  lmos  are  expected,  therefore, 
to  show  systematic  changes  when  enlarging  the  intermolecular  distances.  Up  to 
correlated  level,  we  investigated  the  correlation  energy  components  of  all  of  the 
monomer,  the  dimer,  and  trimer  systems,  respectively,  at  the  second  and  at  third 
order  of  the  mbpt.  The  results  obtained  at  the  mp2  and  the  mp3  levels  both  in  the 
canonical  (CMBPT  )  and  in  the  localized  (lmbpt  )  representation  make  possible  a 
comparison  between  the  corresponding  values  and  thus  to  get  information  on  the 
“quality”  of  the  lmbpt.  The  results  are  given  in  Table  IV  and  Table  V,  respectively. 
One  can  see  that  for  all  the  systems  studied  the  values  of  cmbpt  and  lmbpt  are 
close  to  each  other.  It  is  encouraging  however,  that  the  deviations  between  the 
corresponding  canonical  and  localized  correlation  energy  are  less  than  10%  in 
each  case. 

The  transferable  properties  of  lmos  as  well  as  the  satisfactory  agreement  between 
the  canonical  and  localized  energy  values  both  at  the  mp2  and  mp3  allowed  us  to 
calculate  the  one-,  two-,  and  three-body  interaction  energies  at  the  different  levels 
of  correlation. 


Table  IV.  Interaction  energy  calculated  by  the  I.MBPI  in  the  water  tnmer  at  t/  =  5  0 

geometry. 


Uncorrcctcd 

Bssi;  free 

2nd  order 

One-body 

-558.30  mhartrcc 

-560.72  mhartrcc 

Two-body 

-97 1 .48  /ihartrec 

-993.42  fihartree 

3rd  order 

One-body 

-15691.36  /ihartrcc 

-17101  65  /ihartree 

Two-body 

4.66  /ihartrcc 

27.57  /ihartrcc 

Three-body 

-0  89  nhartrec 

-0  34  /ihartrcc 
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Table  V.  Intcractton  energ>  calculated  by  the  lmbpt  in  the  water  trimer  aid  =  6.0 

geometry 


Uncorrected  BSSE  free 


2nd  order 

One-body 

-557  3 1  mhartree 

-534.42  mhartree 

Two-bodv 

-322.77  /ihartree 

-254.48  /ihartree 

3rd  order 

One-body 

17542.56  /ihartree 

-17599.84  /ihartree 

Two- body 

7.02  /ihartree 

7.57  /ihartree 

Three-body 

-0.21  /ihartree 

-0.21  /ihartree 

It  has  already  been  pointed  out  [18,23]:  when  the  (localized)  basis  sets  are  non¬ 
overlapping,  the  intramolecular  parts  can  be  assigned  to  each  monomer  separately. 
For  a  trimer  (abc)  system,  for  example,  the  intrasystem  (one-body)  correlation 
energies  can  be  obtained  if  we  sum  over  only  those  occupied  and  virtual  LMOs’ 
indices  in  the  so  called  pair-correlation  energy  terms  (see  details  in  [24])  which 
belong  only  to  the  monomer  A,  only  to  the  monomer  B.  or  only  to  the  monomer 
C,  respectively.  The  two-body  terms  can  be  expressed  in  a  similar  way:  the  indices 
run  over  only  the  corresponding  lmos  of  the  monomers  A  and  B.  or  A  and  C,  etc. 
The  two-body  terms  can  also  be  named  the  dispersion  energy  part  of  the  correlation 
both  at  the  mp2  and  mp3  levels.  The  results  obtained  for  the  intrasystem  correlation 
energies  for  the  water  monomer  and  dimer  have  already  been  discussed  in  [24]. 
Here  we  present  the  values  obtained  for  the  many-body  terms  at  the  mp2  level  (see 
Table  IV,  first  part).  The  advantage  of  using  the  LMBPT  theories  lies,  furthermore, 
in  the  calculation  of  the  three-body  interaction  terms  ( in  similar  manner  as  described 
above).  The  results  obtained  for  the  studied  tnmer  system  are  presented  in  Table 
IV  and  Table  V.  These  values  are  significantly  smaller  than  the  tv  o-body  terms,  as 
expected. 

The  localized  representation  of  the  mbpt  (lmbpi  )  are  carried  out  up  to  third 
order  As  shown  in  Eq.  (5)  summing  over  the  single-particle  perturbation  (order 
m)  we  obtain  the  canonical  series.  In  our  case  if  we  sum  over  m  to  2.  for  n  =  2. 


Table  VI. 
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0.0 
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00 
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1.4.306 

0.0 

0.0 
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0.0 
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MjOdll) 
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Figure  1.  Water  tnmcr  (see  Appendix). 


we  obtain  the  canonical  second  order  correction  with  an  error  of  0.28%.  If  we  omit 
completely  the  single-particle  perturbation,  i.e.,  if  we  evaluate  the  canonical  diagram 
with  localized  orbitals,  we  obtain  94.96%  of  the  canonical  second-order  correction. 
If  we  sum  over  m  to  4  for  «  =  3,  we  obtain  a  corresponding  canonical  third-order 
correction  with  an  error  of  3.7%.  In  our  calculations  we  stopped  the  summations 
at  these  orders.  Omitting  the  single-particle  perturbation  from  the  third-order  terms, 
we  overestimate  the  canonical  value  by  10.1 1%.  It  means  that,  in  the  LMBPT,  mp2 
level  calculations  contain  a  finite  summation;  consequently,  it  is,  literally,  a  near 
mp2  calculation.  Also  mp3  level  calculations  are,  literally,  a  near  mp3  calculation. 

The  localized  representation  has  some  advantages.  The  interaction  energies  can 
be  partitioned  according  to  their  source.  The  occupied  orbitals  belong  uniquely  to 
one  of  the  interacting  systems.  The  interactions  accordingly  can  be  partitioned  as 
one-,  two-,  three-body,  etc.  interactions.  In  a  trimer  system  in  second  order  we 
obtain  one-  and  two-body  interactions.  In  third  order  we  obtain  one-,  two-,  and 
three-body  interactions. 

Another  advantage  of  the  localized  representation  is  that  we  can  easily  find  out 
the  source  of  the  BSSE.  The  LMOs  are  approximately  transferable  from  the  nonin¬ 
teracting  monomers  to  the  interacting  systems;  by  utilizing  this  we  can  omit  the 
contributions  of  the  LMOs  which  comes  from  the  other  monomer.  The  BSSE  cal¬ 
culated  by  this  method  are  usually  smaller  than  those  calculated  by  Boys-Bernardi 
procedure. 


Appendix:  Model  Geometry  Taken  for  the  System  Studied 

The  intramolecular  values  (O — H  =  1.809  bohr,  HOH  =  104.52° )  are  the  same 
for  the  three  HiO  molecules.  The  distance  between  monomers  was  systematically 
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enlarged,  starting  from  an  intermolecular  distance  (J)  of  4.0  bohr,  and  then  increased 
through  5.0,  6.0,  8.0,  10.0  up  to  12.0  bohr.  The  coordinates  for  the  first  geometry 
arrangement  are  given  in  Table  VI. 
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Abstract 

A  review  is  presented  of  our  recent  work  which  deals  with  some  aspects  of  the  inclusion  of  connected 
tnple  excitations  in  single  reference  coupled-cluster  theory  We  first  consider  basic  aspects  of  the  theory 
and  computational  cost,  noting  the  need  for  accurate  and  economical  approximations  to  the  complete 
coupled-cluster  single,  double  and  triple  excitation  method  (CCSDT ).  Several  such  approximations  are 
now  aw-ilable,  including  the  ccsD  ccSD)  and  cxrsD(T)  methods.  We  have  now  efficiently  implemented 
analytical  denvative  methods  for  these  and  related  methods  for  open-shell  reference  determinants,  including 
restricted  open-shell  Hartree-Fock  for  ccsd(t  ).  Some  results  with  large  basis  sets  are  presented  for  the 
HCO  radical  which  further  demonstrate  the  reliability  of  these  methods  for  the  spectroscopic  characterization 
of  small  open-shell  species  The  remainder  of  this  article  deals  with  several  applications  of  the  complete 
CCSDT  method,  a  new  implementation  of  which  we  have  recently  completed  Like  our  previous  work, 
this  is  for  general  single  reference  determinants,  although  we  have  now  incorporated  Abelian  symmetry 
We  present  a  comparison  with  full  configuration  interaction  data  for  CHj;  a  study  of  the  Nj  potential 
curve  using  restricted  and  unrestneted  Hartree-Fock  references,  a  comparison  of  the  dipole  moments 
and  polarizabilities  of  BeO,  and  a  study  of  the  harmonic  force  field  of  ^2  J  CJ .  These  studies  also  provide 
assessments  of  the  accuracy  of  the  approximate  schemes  for  including  T3.  Finally,  we  present  some 
timing  data  for  some  CCSDT  calculations  with  our  new  implementation  1993  John  Wiley  &  Sons,  Inc 

Introduction 

Coupled-cluster  (cc)  methods  are  now  widely  used  to  compute  the  electronic 
correlation  energy  of  atoms  and  molecules,  as  well  as  the  correlation  energy  of  a 
wide  variety  of  other  physical  systems.  The  theory  was  introduced  by  Coester  and 
Kummel  to  study  nuclear  matter  [1].  It  was  first  formulated  for  the  electronic 
structure  of  atoms  and  molecules  by  Cizek  [2]  in  the  1960s.  These  methods  were 
later  applied  within  a  semi-empirical  framework  and  to  a  minimal  basis  set  ab 
initio  study  of  BH3  [3].  Several  years  after  these  pioneering  studies,  the  coupled- 
cluster  doubles  (CCD)  method  was  independently  implemented  in  general  purpose 
programs  by  Bartlett  and  Purvis  [4]  and  Pople  et  al.  [5].  In  the  subsequent  15 
years  there  has  been  intense  activity  in  the  field,  which  has  led  to  clear  definitions 
and  efficient  computational  implementations  of  a  wide  variety  of  CC  methods,  and 
there  have  been  many  applications.  Several  reviews  [6]  document  this  work. 

A  wide  variety  of  methods  are  included  by  the  term  cc  methods.  There  are 
several  categories,  all  of  whose  wave  functions,  at  least  in  the  exact  versions  of  the 
theories,  are  obtained  through  the  action  of  some  kind  of  exponential  wave  operator 
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on  one  or  more  determinants  of  orthonormal  spin  orbitals.  When  there  is  just  one 
reference  determinant,  we  have  the  single  reference  version  of  the  theory,  which  is 
the  most  highly  developed  and  widely  used.  There  are  several  kinds  of  multireference 
versions  of  the  theory,  including  the  Fock  space  [  7  ]  and  Hilbert  space  [  8  ]  theories. 
These  fields  are  now  being  actively  investigated  and  applied.  As  well  as  the  single 
and  multireference  methods,  there  is  the  eqmtion-of-motion  method  (also  termed 
linear  response  CC  method),  which  may  be  viewed  as  a  generalization  of  the  single 
reference  theory  to  excited  states  [9). 

In  this  article  we  focus  on  two  aspects  of  the  inclusion  of  triple  excitations  in  the 
single  reference  theory.  The  first  is  concerned  with  the  evaluation  of  analytical 
derivatives  of  the  energy  for  several  cc  methods  which  include  triple  excitations 
noniteratively.  The  second  desciibes  some  recent  research  with  the  complete  cc 
single,  double,  and  triple  excitation  method  (CCSDT ).  Along  with  results,  we  shall 
include  some  timings  of  ccsdt  calculations  with  a  new  implementation  of  this 
method.  This  will  help  to  assess  the  degree  to  which  this  most  rigorous  way  of 
including  effects  of  triple  excitations  can  now  be  applied  to  chemical  problems. 
The  emphasis  throughout  this  article  is  on  the  results  rather  than  on  the  details  of 
the  theory.  We  shall  present  some  basic  equations  but  will  not  go  into  details,  which 
are  covered  in  cited  literature. 


Basics  of  CC  Theory 


Formal  Aspects 

Single  reference  CC  theory  begins  with  a  reference  determinant  of  orthonormal 
spin  orbitals,  |  ^>o)-  This  may  be  made  up  of  SCF  orbitals,  but  this  is  not  necessary. 
The  CC  wave  function  is  obtained  by  operating  on  |  <i>o)  with  the  exponential  wave 
operator: 

=  (1) 

where  T  is  the  sum  of  one-body,  two-body,  three-body,  etc.  cluster  operators 

T=T,  +  T2+T,+  ’--  (2) 

These  are  given  in  second-quantized  form  by 

T,  =  (3) 

la 

(4) 

ijah 

and  so  on.  The  ,  tij’,  are  termel  the  cc  amplitudes,  and  obtained  by  solution  of 
the  CC  equations.  The  cc  equations  and  energy  expression  are  obtained  by  inserting 
I  <l>„  )  in  the  Schrodinger  equation,  operating  on  the  left  by  e~^,  and  projecting  on 
the  left  with  either  the  reference  determinant  or  an  excited  determinant.  Writing 
the  energy  as 
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A'  =  (<I>„|//|<i>„>  +  A/;.,  (5) 

and  defining 

//.^  = //- <<l>«!//|<I>o)  (6) 

we  obtain  equations  for  the  amplitudes  [2] 

(Kt'  I  f  '//^(''  |<I>(.>  =  ).|<l>o)  -  0  (7) 

and  the  correlation  energy 

SI-,,  =  'Use' \%}  =  <'b,.l(//Ac'  (8) 


The  subscript  (  indicates  the  restriction  to  “connected"  diagrams  [2,  10],  Although 
presented  m  this  abstract  and  general  form,  these  equations  may  readily  be  written 
in  terms  of  Fock  matrix  elements,  two-electron  integrals,  and  cluster  amplitudes 
[10],  They  are  nonlinear  in  the  amplitudes,  but  by  suitable  definitions  of  inter¬ 
mediates  may  be  rewritten  so  that  they  arc  simply  a  scries  of  contractions  of  an 
amplitude  and  an  intermediate,  operationally  taking  on  the  appearance  of  equations 
which  arc  linear  in  the  amplitudes  [11,-12],  and  arc  thus  well-suited  to  exploiting 
vector  computer  architectures. 

Truncal  ion  and  C  'omputa!  tonal  Cost 

The  formal  theory  is  all  well  and  good,  but  without  approximation  it  is  just  the 
cc  representation  of  the  full  configuration  interaction  (Id)  wave  function  and 
energy.  Therefore  in  order  to  obtain  computationally  feasible  cc  methods,  the 
cluster  operator  /'  is  truncated,  just  as  Cl  expansions  arc  truncated  For  a  system 
represented  by  n  occupied  orbitals  and  A’  virtual  orbitals,  the  costs  of  the  different 
truncations  scale  as  indicated  in  Table  I.  The  rapid  increase  in  cost  means  that  in 
order  for  these  methods  to  be  of  practical  value,  a  large  fraction  of  the  correlation 
energy  must  be  recovered  by  low-order  cluster  operators.  Fortunately.-  this  is  the 
case.-  with  '/T  accounting  for  a  considerable  fraction  of  the  dynamic  correlation 
energy  (i.c.  the  cct)  method  [4,5]).  Important  orbital  relaxation  eftccts  are  ac¬ 
counted  for  by  7  |.  so  that  for  many  purpo.ses  the  “singlcs-and-doublcs"  model 
(cesD)  [  13]  IS  a  very  good  approximation,  much  better,  for  example,  than  the  CLSD 
approximation,  even  with  the  usual  l^nghotf-Dav idson  unlinked  diagram  corrcc- 


Tabii  1  the  siMlinp  of  ihc  cost  of  various  ((  approvmialions 


Trunealion  of  /' 

Meihod 

Cosl/iteration* 

/,  t  /.' 

<(  SI) 

n'S\  n'S\  n'S- 

7,  f  /  ':  1  /  ', 

(<SI>I 

ii'X'.  /I'.V,  /I'.V' 

/,  1  /■;  1  /<  1  /) 

(<  S!)IQ 

iiW*.  /i'.\',  /I^\■' 

“  /land  A'arc,  icspeclivcly,  numbers ol iKCupicd  and  virtual orbilals 
I  he  evael  operalion  eounl  is  propoilional  lo  ihese  numbers 
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tion  [14],  However,  in  order  to  achieve  high  accuracy,  one  must  include  T^.  Judging 
by  comparison  with  r  ci  results  for  small  systems,  the  CCSDT  method  [15,16,17] 
does  indeed  seem  to  be  a  rather  accurate  method.  Adding  Ta  either  through  the 
CCSDTQ  model  f  1 2  ]  or  various  approximations  thereto  [  1 8, 1 9  ]  shows  usually  small, 
but  sometimes  significant,  improvements. 

Unfortunately,  even  the  CCSDT  approximation  is  somewhat  too  expensive  for 
most  applications.  Therefore,  a  lot  of  effort  has  been  devoted  to  finding  ways  of 
approximating  CCSDT  yet  reducing  the  cost.  Two  approaches  have  been  followed. 
The  first  is  iterative  approximations,  in  which  the  Tj  equation  is  truncated  but  is 
still  iterated.  This  leads  to  methods  termed  CCSDT- 1  [20,21],  ccsdt-2  [22],  and 
CCSDT-3  [22].  The  second  approach  makes  what  sounds  like  an  even  more  drastic 
approximation,  which  is  not  to  iterate  the  equation  at  all.  but  to  estimate  the 
contnbution  of  Tj  to  the  energy  from  CCSD  T,  and  T2  amplitudes  using  equations 
based  on  perturbation  theory  [10].  This  has  led  to  several  methods,  including 
CCSD  +  t(ccsd)  [21],  CCSD(t)  [23],  and  ccsd  +  tq*(ccsd)  [19].  The  first  and 
second  of  these  are  rather  economical,  requiring  a  ccsd  calculation  followed  by  a 
single  evaluation  of  the  Ti  energy,  the  latter  requiring  a  cost  proportional  to 
+  II* N^).  The  third  method  is  more  sophisticated  and  includes  connected  quadruple 
excitation  (Ta)  corrections,  but  is  correct  through  fifth  order  of  many-body  per¬ 
turbation  theory  and  has  a  cost  proportional  to  (n^N^  +  n*N*  +  n^N^).  The 
simple  noniterattve  approximation  CCSD(t)  works  remarkably  well  [24-29],  al¬ 
though  we  caution  the  reader  that  this  is  often  somewhat  fonuitous  [30].  It  is 
sufficiently  economical  to  be  used  with  over  200  basis  functions  when  a  molecule 
has  C2i  or  higher  symmetry  [31-33].  Accordingly,  this  and  the  closely  related 
QCISd(t)  approach  [34]  (which  is  equivalent  through  fifth  order  of  mbpt  )  are 
being  widely  used. 


Analytical  Derivatives  for  CC  Methods  with  Noniterative  Triple  Excitations 

As  we  discussed  in  the  last  section,  simple  noniterative  methods  of  including  Tj 
are  of  wide  applicability.  Therefore  efficient  analytical  evaluation  of  energy  deriv¬ 
atives  for  these  methods  is  of  considerable  value.  This  we  now  consider. 

Methods  for  the  analytical  evaluation  of  derivatives  of  the  CC  energy  are  now  at 
an  advanced  level.  These  techniques  are  essential  for  exploration  of  potential  energy 
surfaces,  including  location  and  characterization  of  stationary  points,  as  well  as 
computation  of  vibrational  frequencies  and  intensities.  They  may  also  be  used  for 
the  computation  of  fii'St-order  properties.  Although  only  first  derivatives  are  routinely 
available,  these  may  be  numerically  differentiated  to  obtain  higher  derivatives,  and 
this  is  cheaper  and  more  numerically  stable  than  numerical  differentiation  of  the 
energy. 

At  first  sight,  evaluation  of  derivatives  of  the  cc  energj  appears  to  pose  a  serious 
computational  problehi.  Thus,  the  derivative  of  the  energy  with  respect  to  some 
property  «  (for  example,  a  nuclear  displacement  or  electric  field)  may  be  written 
in  the  general  form 
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dE  dE  dX  dE  dc  dE  dl  ,  „ , 

da  dX  da  dc  da  dl  da 

when/(/i"^  is  a  function  of  the  perturbation.  If.  Here  X  denotes  basis  functions,  c 
denotes  mo  coefficients,  and  t  notes  CC  amplitudes.  For  nonvariational  (or 
more  precisely  nonstatiomry )  me  '  .aIs  such  as  cc  methods,  all  of  these  tei  ms  must 
be  evaluated.  For  variational  methods,  however,  one  or  more  of  these  terms  vanishes. 
For  example,  in  SCF  and  '  *^SCF  methods,  the  second  and  third  vanish,  and  for  Cl 
methods  the  third  term  vanishes.  For  cc  methods  the  third  t^rm  must  be  considered, 
and  this  appears  to  present  a  significant  difficulty.  Thus,  it  appears  that  in  order  to 
evaluate  the  cc  energy  derivative,  one  must  explicitly  evaluate  the  derivative  of  the 
cc  amplitudes,  which  would  require  solution  of  3N  cc-like  equations  for  com¬ 
putation  of  the  force.  Fortunately,  however,  further  investigation  showed  that  there 
was  an  alternative,  much  more  economical  evaluation  of  the  effect  of  these  amplitude 
derivatives  [35-38].  This  requires  the  solution  of  a  single  perturbation  independent 
set  of  equations  (the  A  equations  [38]). 

Recent  work  with  expectation  value  and  unitary  cc  schemes  [39]  has  brought 
to  light  a  straightforward  derivation  of  these  equations,  which  is  more  transparent 
than  the  original  derivations  and  provides  more  insight  into  cc  energy  derivatives. 
Consider  the  functional 

A£(A,  r)  =  <-I>o|(l  +A)c-V/ac'|$o)  (10) 

where  A  is  a  de-excitation  operator  defined  by 

A  =  2  Kii^a}  +  2  +  •  •  •  (11) 

ai  abt/ 

If  we  attempt  to  make  A£'(  A,  7)  stationary  with  respect  to  A  amplitudes,  then 
we  obtain  the  cc  equations.  For  example,  differentiating  with  respect  a  particular 
amplitude  .  and  setting  the  result  equal  to  zero  yields 

(^olU^arh-  ■■}\(P+Q)e-^Hf,e^\%) 

=  (%\{raj^b-  •  • }  ih><h|  e-'7/A,c'  |<I>o>  =  0  (12) 

where  P  is  the  reference  state  projector,  |(f’o)(4>o|  and  Q  is  the  projector  for  the 
manifold  of  excited  determinants  lh){h|,  and  we  have  used  P  +  Q  =  1.  It  follows 
that 


<h|e  '■//;ve^|<l>o>  =  0  (13) 

and  we  have  recovered  the  cc  equations  ( Eq.  ( 7 )  ] .  Similarly,  if  we  attempt  to  make 
the  functional  stationary  with  respect  to  T  amplitudes  we  find 

<4>„|(1  +  A)e-^//A.cMh>- A£,<<f>o|A|h>  =  0  (14) 

which  are  the  A  equations  [  38  ] .  If  the  functional  is  stationary  with  respect  to  A 
and  T,  it  follows  that 
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(9A£(A,  r) 

da 


=  <<I>ol(!  +  X)e  '  H\y) 


(15) 


From  this  it  is  readily  apparent  that  the  derivative  of  the  CC  (and  mbpt)  energy 
has  the  same  general  form  as  that  for  ci  and  other  methods  [40],  namely,  after 
(ransforming  to  thciatomic  orbital  basis  set 


dE 

da 


dh. 


—  +  7 
da 


da 


d(^fiv\\a'y 


da 


(16) 


That  is,  the  derivative  is  expressed  in  terms  of  derivatives  of  one-elcetron,  two- 
electron,  and  overlap  integrals.  The  matrices  D,  I,  and  G  are  independent  of  the 
perturbation  a,  but  of  course  contain  amplitudes  of  A.  The  derivatives  of  overlap 
integrals  arise  from  the  orbital  response. 

As  a  result  of  the  introduction  of  A,  analytical  derivatives  have  now  been  im¬ 
plemented  and  arc  routinely  evaluated  for  several  cc  methods,  including  ccsd 
[41-45],  CCSD(  T )  [ 46-49  ] ,  cesDT- 1  [  50  ] ,  as  well  as  for  qcisd  [51]  and  0CISD( T ) 
methods  [48,52], 

For  the  ccsd(t)  method  analytical  derivatives  have  been  implemented  for  closed- 
shell  RHF  reference  determinants  [46,47],  likewise  for  QCtSD(T)  [52],  In  recent 
research  we  have  extended  this  to  the  case  of  UitF  reference  determinants  for  the 
CCSD  +  t(ccsd),  ccsd(t  )  and  qcisd(t  )  methods,  as  well  as  mbpt(4)  [48],  This 
has  been  implemented  in  the  aces  ii  program  system  [53,54],  with  a  vectorized 
algorithm  and  Abelian  symmetry  incorporated  along  similar  lines  to.previously 
presented  work  [55],  The  resulting  program  has  been  used  in  several  applications, 
including  C4  [33],  Hcco  [56],  and  HOO  [48  J.  The  computational  effort  to  evaluate 
the  energy  derivative  is  about  twice  that  required  to  evaluate  the  energy. 

In  related  work  we  have  considered  derivatives  for  CCSD(T )  when  the  reference 
determinant  is  of  the  restricted  open-shell  Hartrce-Fock  (rohf)  type.  While  cc 
methods  largely  remove  the  spin  contamination  of  the  UHF  determinant  [6,57], 
this  is  not  always  the  case  and  consequently  it  is  sometimes  beneficial  to  use  spin 
eigenfunction  rohf  reference  determinants  (or  other  “quasi-RHF”  (qrhf)  deter¬ 
minants)  for  open-shell  systems  [57],  Just  defining  the  energy  expression  for  such 
a  method  is  not  entirely  straightforward  [45,49],  however,  and  has  led  to  confusion. 
We  earlier  presented  the  formalism  and  initial  results  for  rohf-ccsd(t)  [45], 
and  expanded  on  this  recently  [49],  Of  great  importance  is  the  need  for  a  trans¬ 
formation  to  scmicanonical  orbitals  [58]  (i.e.,  the  occupied-occupied  and  virtual- 
virtual  blocks  of  the  jpin  orbital  Fock  matrix  arc  diagonal).  This  removes  the 
equivalence  of  the  a  and  d  spatial  orbitals,  but  docs  not  alter  the  spin  eigenfunction 
characteristics  of  the  method.  If  one  docs  not  transform  to  scmicanonical  orbitals, 
it  is  necessary  to  iterate  the  Tj  equation  in  order  to  achieve  proper  invariance 
properties,  which  defeats  the  whole  purpose  of  noniterative  methods  such  as 
CCSDIT),  Building  on  a  previoius  implementation  of  rohf-ccsd  gradients  [45], 
we  have  now  implemented  analytical  derivatives  for  our  rohf-CCSD(t)  method 
[49],  The  cost  is  c.sscntiaily  the  same  as  in  the  uhf  case. 
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As  an  illustration  of  results  that  can  be  obtained  with  rohf-ccsd{t  )  gradients, 
we  show  data  obtained  for  HCC  with  several  basis  sets  [49],  compared  with  exper¬ 
iment  [59],  in  Table  H.  This  is  a  good  example  for  comparison  since  this  is  one 
of  the  few  open-shell  molecules  for  which  liamionu  frequencies  have  been  deduced 
from  experiment.  With  the  largest  basis  set.  very  good  agreement  with  expenment 
is  achieved.  This  example  and  work  on  HOO  [48]  illustrate  that  these  methods 
should  be  suitable  for  characterizing  small  transient  molecules.  It  should  be  em¬ 
phasized  that  the  basis  set  used,  while  large,  certainly  did  not  exhaust  our  capabilities. 
In  this  example,  for  which  UHt  spin  contamination  is  not  large,  we  also  see  the 
equivalence  of  UHF-  and  ROHF-based  results. 


Some  Recent  Research  with  the  cesDF  Method 

The  ceSDT  method  was  first  implemented  by  Noga  and  Bartlett  in  i987  [15], 
followed  by  Scuseria  and  Schaefer  in  1988  [16].  These  implementations  were  ex¬ 
plicitly  restricted  to  RHF  closed-shell  reference  determinants,  and  we  implemented 
the  method  for  general  closed-  and  open-shell  reference  determinants  in  1 990  [17]. 
CCSDi  is  rather  an  expensive  method,  and  chemical  applications  are  somewhat 
limited,  although  there  have  now  been  a  large  number  of  studies  of  small  systems 
with  DZP  or  slightly  larger  Lasts  sets  [15-17,27-.30,60,61].  Included  among  the 
larger  cesDT  calculations  are  our  studies  of  the  harmonic  force  fields  of  C.i  [61] 
and  Oj  [30]  and  a  study  of  the  hyperpolanzabilities  of  Ne  by  Rice  et  al.  [62],  Our 
initial  implementation  of  ceSDT  did  not  exploit  point  group  symmetry,  and  nor 
did  that  of  Noga  and  Bartlett.  We  have  recently  completed  another  implementation 
which  exploits  Abelian  symmetry  by  the  DPD  approach  [55],  and  this  has  been 
ir’plementcd  in  aces  ii  [53,54],  This  should  enable  us  to  tackle  somewhat  larger 
P''jbiems  with  Cf'SDT  than  has  hitherto  been  the  case.  This  will  be  discussed  later. 
First  we  present  results  of  four  scries  of  ccsi;)i  calculations. 


Table  II  uhe-  and  ROiir-ccsiXl )  caiculaicd  properties  of  .\  ’.F  nco 


Ref/Bas* 

Ak  /.'i 

rcoA'l 

Of 

oji ,  a.'2.  cs;t/( ni  ' 

R/421 

1.1212 

1  1974 

124.25 

2779 

1882 

1 1.16 

U/421 

1.1214 

1.1972 

124.24 

2778 

1890 

11.46 

R/5.12 

1  1198 

1.1812 

124  00 

2677 

1871 

11.42 

U/532 

1.1 199 

I  1811 

124  01 

2676 

1878 

1144 

R,/5.t2l 

1.1174 

1  1797 

124.41 

2727 

i896 

11.44 

R/.S4.'>I 

LI  16.4 

1.1804 

124.55 

2765 

189.4 

1121 

F.xpt 

l.l  19 

1.175 

124.4 

2790 

1920 

1126 

*  Reference  dclerminani  and  basis  set  R  and  U  denote  Roilt  and  i  til  reference  determinants  re¬ 
spectively.  42 1  denotes  a  4s2/)lr/  basis  set.  5.42  a  5sip2d  set.  and  so  on. 
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FCl  Comparison  of  Bond  Stretching  in  CHs 

As  bonds  in  a  molecule  are  stretched,  the  reference  determinant  becomes  a  pro- 
gressiveh  poorer  approximation,  and  great  demands  are  therefore  placed  on  single 
reference  methods  at  the  stretched  geometries,  cc  methods  have  been  compared 
with  FCi  results  for  several  molecules,  mostly  closed-shell  species  [15].  although 
there  have  been  some  open-shell  comparisons  [  1 7.34.49  ] ,  We  now  compare  the 
performance  of  the  ceSD,  ccsd(t).  and  cesDT  methods  for  the  simultaneous 
stretching  of  the  three  C — H  bonds  of  CH3.  The  FCi  data  are  from  the  work  of 
Bauschlicher  and  Taylor  [63].  uhf  and  rohf  reference  determinants  are  used  in 
the  cc  calculations.  The  data  are  shown  in  Table  111.  It  is  seen  that  both  cesD  and 
CCSD(  1  )  methods  are  unable  to  reproduce  the  energy  at  2  R,.  very  well  at  all.  More¬ 
over.  the  results  obtained  with  the  two  reference  determinants  are  quite  different. 
Moving  to  ceSDT,  however,  much  better  agreement  with  FC'I  is  achieved,  with  the 
energies  from  the  two  determinants  being  relatively  close  (the  FCi  results  will  be 
the  same  for  the  two  determinants,  of  course,  apart  from  the  small  difference  due 
to  different  dropped  core  orbitals).  The  UHF-based  results  su.fer  considerably  from 
spin  contamination,  some  of  which  is  removed  by  the  cc  methods,  especially  cesDT. 
It  is  noticeable  that  the  performance  of  CCSD(T )  at  stretched  geometries  appears 
to  be  poorer  for  open-shell  molecules  (e.g.,CH3.  NHi.  and  S1H2)  than  for  closed- 
shell  molecules  (e.g..  HiO),  regardless  of  whether  a  UHF  or  ROHF  reference  deter¬ 
minants  is  used  [49], 

Bond  Stretching  in  N: 

This  example  provides  another  stern  test  for  single  reference  ct  methods.  Some 
work  on  this  problem  was  done  some  time  ago  by  Laidig.  Saxe,  and  Bartlett  [64], 
who  investigated  the  behavior  of  several  single  reference  CC/MBPT  methods,  along 
with  MR-Cl  and  a  linearized  MR-cc  method.  With  an  RHF  reference,  all  single 
reference  methods  fail  in  vanous  ways,  yielding  curves  which  have  a  local  maximum 
before  falling  well  below  the  true  curve.  At  an  internuclear  distance  of  about  2.2a„. 


1  MU  I  III  B'jno  Stretching  in  A;  CH^.  Conipanson  with  full  1 1  fora  ozrbasisset  Data  arc  deviations 
from  1C  I  energies  in  10'  ’  The  rc'l  data  are  from  reference  6.V 


Ref." 

Mcihod 

1.5R, 

2R, 

ROlIl 

<  cso 

2  849 

8  84.1 

.17..122 

KOMI 

CCMXT) 

0  518 

2  in 

10.984 

KOIII 

<(.SDI 

-0.722 

0  1.18 

4.402 

x'Wl 

C  (  SI) 

2  790 

8.768 

19  771 

rill 

<CM>  1  1((<SI>) 

0.474 

2.2.10 

.12.157 

1  HI 

(  CMXl ) 

0  499 

2  669 

-28  908 

I  HI 

(  (  SDI 

0.744 

-  0.108 

1  7.12 

*  I  >  pc  of  reference  del  rminanl. 
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there  is  an  rhf-uhf  instability  of  (r„  symmetry,  and  so  a  UHF  determinant  of 
orbitals  may  be  used  for  larger  distances.  This  greater  flexibility  in  the  reference 
determinant  helps  the  cc/mbpt  methods,  as  shown  by  Laidig  et  al.  [64],  In  par¬ 
ticular,  it  was  found  that  the  UHF-CCSD  curve  was  quite  reasonable.  We  have  now 
investigated  this  problem  with  the  full  CCSDT  method  with  both  uhf  and  RHK 
reference  determinants  and  the  same  4i3  p  1  d  basis  set  as  used  by  Laidig  et  al.  The 
RHF  results  are  shown  in  Figure  1,  and  are  largely  in  line  with  those  of  Laidig  et 
al.,  even  the  full  rhf-ccsdt  method  failing  at  large  enough  distances.  The  UHF 
results  are  shown  in  Figure  2.  Here  the  behavior  is  much  better.  Addition  of  triple 
excitations  improves  the  cesD  results,  and  the  CCSDT  curve  is  in  very  good  agreement 
with  MR-ci.  Note,  the  ccsd(t  )  result  turns  over  from  spin  contamination,  although 
the  CCSDT  energy  does  not  appear  to  reflect  that,  but  other  properties  may.  It  is 
seen  from  this  work  that  UHF-based  cc  methods  can  describe  the  breaking  of  a 
triple  bond.  In  fairness  it  should  be  mentioned  that  the  UHF  reference  dissociates 
approximately  into  ground  state  atoms  in  this  case,  which  is  not  always  the  case 
(for  example,  O2).  Further  work  on  a  wider  class  of  systems  is  needed  to 
establish  fully  the  ability  of  UHF-based  single  reference  CC  methods  to  desenbe 
molecular  dissociation.  Incidentally,  when  Tj  is  included.  RHF-  and  UHF-based  CC 
methods  are  well  known  to  be  able  to  describe  dissociation  of  single  bonds  [64]. 
One  practical  problem  encountered  in  uhf  cc  calculations  at  highly  stretched  ge¬ 
ometries  is  slow  convergence,  although  RHF-based  cc  calculations  e\entually  will 


•  ceso 

•  CeSD+T 

•  CCSD(T) 
O  CCSDT 
+  MR-Cl 

X  MR-LCCM 


Internuclear  distance  (a.u.) 


Figure  I.  Calcul.itcd  potential  cncrg>  eur\cs  for  various  cx  methods  based  on  an  Rlir 
reference  determinant  (4i.t/)l  </basisscl,  l);i,  computational  symmetrv ).  The  mr-C'i  and 
MR-<X'  data  are  from  the  work  of  Ixiidig  ct  al 
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•  CCSO 
A  COSD(T) 
O  CCSDT 
+  MR-CI 
X  MR-LCCM 


Internuclear  distance  (a.u.) 


Figure  2.  Calculated  potential  energy  curves  for  various  CC  methods  based  on  a  uhf 
reterence  determinant  (4s3pl<f  basis  set,  Cjc computational  symmetry).  The  MR-ci  and 
MR-CC  data  are  from  the  work  of  Laidig  et  al. 


not  converge  at  all  at  such  geometries,  as  several  amplitudes  get  very  large.  Also, 
in  the  present  work  on  N2  the  symmetry  was  relaxed  to  Cn ,  making  the  calculations 
more  costly  than  if  £>2/,  symmetry  was  used. 

Electrical  Properties  of  X'l,  BeO 

The  ground  state  of  BeO  is  highly  polar  and  provides  a  challenge  to  theoretical 
calculations  [65].  Electrical  properties  have  been  of  interest  for  some  time  [66], 
particularly  in  view  of  the  stability  of  various  complexes  [  67  ]  and  the  dipole  bound 
anion  [68].  In  recent  work  we  have  been  comparing  different  cc  methods  through 
calculations  of  the  dipole  moment  and  parallel  and  perpendicular  dipole  polariz¬ 
abilities.  A  series  of  calculations  were  performed  with  the  POL  basis  of  Sadlej  [69], 
which  is  a  5 s2p2d  set  designed  to  provide  a  reasonable  description  of  electrical 
properties,  even  though  it  is  only  of  modest  size.  For  methods  for  which  analytical 
derivatives  were  available,  the  dipole  moments  were  evaluated  analytically,  and 
was  obtained  from  the  difference  of  dipole  moments  evaluated  in  the  presence  of 
electric  field  along  the  molecular  axis,  while  was  obtained  from  finite  differences 

of  energies.  For  other  methods,  all  resuits  were  obtained  by  taking  finite  diffeiences 
of  the  energy  with  respect  to  an  electric  field.  The  results  a.e  shown  in  Table  IV. 
Particularly  interesting  is  the  fact  that  the  CCSD(T )  result  for  is  so  different  from 
the  CCSD  +  T(CCSD)  and  CCSDT- 1  and  CCSDT-3  results.  Since  the  three  latter  meth- 
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Table  IV.  Electrical  properties  of  BeO.  R  =  1.3308  A, 
Ssipld  basis  set.  Atomic  units. 


Method 

a.. 

SCF 

2.95 

22.4 

19.7 

mbpt(2) 

2  48 

32.3 

29.3 

mbpt(4) 

2.30 

36.1 

37.3 

CCSD 

2.56 

31.3 

29.8 

CCSD  +  T(CCED) 

2  26 

38.7 

45.9 

CCSD(T) 

2.43 

35.1 

34.4 

CCSDT-lb 

2.27 

35.3 

45.6 

CCSDT-3 

2  30 

34.9 

44.6 

CCSDT 

2.40 

34.3 

36.0 

ods  all  give  somewhat  similar  results,  one  would  not  expect  the  method  of  inclusion 
of  triple  excitations  to  affect  the  results  very  much.  Moreover,  through  fifth  order 
of  MBPT,  CCSD(T )  is  equivalent  to  CCSDT-lb  and  ccSDT-3.  Somewhat  to  our  sur¬ 
prise,  then,  we  have  found  that  the  ccsdt  result  turns  out  to  be  rather  close  to 
CCSD(T).  The  agreement  of  CCSD(T)  with  ccsdt  is  evidently  somewhat  fortuitous. 
Moreover,  it  is  not  certain  in  this  case  how  close  the  CCSDT  result  is  to  FCI  since  a 
noniterative  estimate  of  74  effects  indicates  a  significant  value  for  these  [  70  ] .  More 
discussion  of  results  for  electrical  as  well  as  spectroscopic  properties  is  provided 
elsewhere  [70],  along  with  some  interesting  results  obtained  with  the  QCi  methods. 

Hessian  Rank  Cj 

The  C3  cation  has  been  the  subject  of  considerable  recent  theoretical  attention 
[61,71-76].  This  has  stemmed  from  the  uncertainty  of  its  geometrical  structure, 
experimental  data  being  unable  to  clearly  resolve  whether  it  is  bent  or  is  linear  but 
with  a  low  bending  frequency  [  77  ] .  All  reliable  theoretical  work  has  indicated  that 
a  bent  structure  (electronic  state  *82)  with  a  bond  angle  of  about  70°  is  of  lower 
energy  than  the  ^2^  species  with  which  it  correlates  in  Da,  h-  The  exact  value  of  the 
energy  difference  differs  considerably  according  to  method  and  basis,  although  it 
is  likely  to  be  only  a  few  kcal/mol.  What  has  received  less  attention,  but  is  somewhat 
intriguing,  is  the  question  of  whether  there  is  a  linear  local  minimum.  Investigation 
of  this  point  has  been  hampered  by  symmetry  breaking,  and  so  the  imaginary 
frequencies  for  have  been  regarded  with  suspicion.  Also,  there  are  multiple  UHF 
solutions,  and  the  less  contaminated  one  has  several  Hartree-Fock  instabilities, 
including  one  of  <t„  symmetry.  In  our  previoi  >  work  [6 1  ]  we  showed  that  use  of  a 
quasi-RHF  reference  determinant,  consisting  of  neutral  Q  orbitals,  circumvented 
these  problems  for  the  bent  species;  the  reference  orbitals  are  unambiguously  de¬ 
termined  and  do  not  suffer  from  symmetry  breaking.  Another  point  of  interest 
regarding  linear  geometries  is  whether  the  linear  species  is  stable  to  bending  or  not. 
We  have  therefore  investigated  the  linear  geometries  with  the  QRHF  approach  and 
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the  full  CCSDT  method  using  the  same  DZP  basis  set  as  in  our  previous  work  and  a 
5s3p2dhas\s  set  defined  by  Taylor  et  al.  [75]. 

We  present  the  data  in  Table  V.  The  UHF-based  results  indicate  clearly  instability 
along  the  asymmetric  stretching  coordinate,  and  show  that  the  bending  frequency 
is  small.  It  is  interesting  that  the  CCSD(T )  method  predicts  a  real  bending  frequency, 
in  line  with  the  findings  of  Raghavachari  and  Taylor  et  al.  with  the  QCISD(T) 
method,  which  indicate  that  there  is  a  local  maximum  along  the  path  connecting 
the  ’Bt  and  states.  With  ccsdt,  however,  this  seems  not  to  be  true.  Moving 
now  to  the  QRHF  results,  we  see  that  with  both  basis  sets  the  imaginary  <t„  frequency 
remains,  which  seems  to  be  strong  evidence  that  this  is  not  an  artifact  after  all. 
These  results  also  show  instability  to  bending.  It  is  interesting  to  compare  our 
results  with  second-order  electron  propagator  [  ept(  2 )  ]  results  of  Ortiz  [  76  ],  which 
represent  the  most  detailed  investigation  of  the  force  field  of  the  linear  species  to 
date.  These,  which  also  are  based  on  neutral  Cj  orbitals,  predict  an  imaginary  <t„ 
but  a  real  n„  frequency.  Ortiz  allowed  the  two  bond  lengths  to  become  inequivalent 
and  located  the  *2^  state,  which  he  finds  to  be  a  local  minimum.  Taylor  et  al.  also 
located  this  state  at  the  casscf  level  but  appear  to  regard  it  as  an  artifact.  Finally, 
one  might  ask  why  the  bonds  of  linear  Cj  would  be  inequivalent.  A  plausible 
answer  may  come  from  the  second-order  Jahn-Teller  theorem.  Thus,  according  to 
the  calculations  of  Ortiz,  the  ‘2^  state  lies  less  than  1  eV  above  the  state,  and 
these  states  may  be  coupled  b>  *he  <r„  vibration,  leading  to  a  state. 


Prospects  for  Larger  CCSDT  Calculations 

In  Table  VI  we  present  preliminary  times  for  CCSDT  calculations  with  our  new 
implementation.  Where  these  are  available,  we  present  also  times  for  our  previous 
implementation.  1  he  times  for  the  new  implementation  are  roughly  a  factor  of  10 
smaller  than  for  the  old  version  for  two  Cn  examples,  and  this  is  a  conservative 
ratio  since  the  Ti.  vector  was  not  held  in  core  in  the  new  implementation,  while  it 
was  in  the  previ,.  js  implementation.  The  use  of  the  DPD  algorithm  to  exploit  sym- 


1 ABLF,  V.  Harmonic  vibrational  frequencies  of 'S,;  Cj  (cm"'). 


Ref/Bas" 

Method 

U/421 

scr 

1277 

2456 

1901 

U/421 

cesD 

1208 

2500i 

176i 

U/421 

CCSD(T) 

1149 

14,tli 

59 

U/421 

CCSDT 

1158 

45li 

76i 

0/421 

CCSDT 

1162 

20641 

69i 

Qnn 

CCSDT 

1177 

25641 

2l5i 

•  Reference  determinant  and  basis  set  (see  footnote  to  Table  II).  ^denotes 
a  QRHK  reference  determinant  comprising  neutral  Cj  orbitals. 
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Table  VI.  Some  preliminary  timings  of  CCSDT  codes  on  Cray-t  MP  (CPU  sec). 


System 

Ref/Bas* 

Symmetry'’ 

Nto' 

01(1“ 

New' 

■a,  0, 

R/421 

C2K 

48 

15215 

1478 

'BjCJ 

U/421 

G. 

48 

9850 

795 

^11  C) 

U/532 

Di, 

72 

2200 

’IT  C: 

U/54321 

D,, 

110 

3289 

’  Reference  determinant  and  basis  set  (see  footnote  to  Table  II). 

Compuiatwnal  symmetry. 

'  Number  of  basis  functions. 

■’  Old  code  had  in  core. 

'  New  code  had  T3  out  of  core.  All  calculations  were  run  with  6.5  Mwords  of  memory 


metry  allows  for  a  theoretical  speedup  of  /)*,  where  h  is  the  order  of  the  compu¬ 
tational  point  group,  so  a  factor  of  10  is  reasonable  given  the  different  modes  of 
Ti  storage.  With  the  new  implementation,  it  is  seen  that  cesDT  calculations  with 
over  1 00  basis  functions  { for  relatively  small  numbers  of  electrons)  are  quite  feasible. 
Of  course,  such  calculations  require  of  the  order  of  hours  of  supercomputer  time, 
and  one  can  often  achieve  similar  quality  results  with  approximate  methods  such 
as  ccSDfT )  in  a  small  fraction  of  the  time.  However,  as  seen  in  some  of  the  examples 
in  this  and  other  articles,  there  can  be  significant  differences  between  the  methods, 
and  so  it  is  always  helpful  to  be  able  to  compare  approximate  methods  with  CCSDT. 
If  there  are  large  differences,  it  may  be  true  that  the  approximate  method  is  not 
appropriate  for  the  problem,  although  it  may  (fortuitously)  yield  good  agreement 
with  experiment.  If  the  ccsdt  results  are  in  agreement  with  those  of  a  more  ap¬ 
proximate  method,  it  is  like.y  that  application  of  the  approximate  method  is  ap¬ 
propriate. 
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Abstract 

Ground  state  electron  correlation  is  mtroducei.  into  the  one-partiele  propagator  via  coupled  cluster 
theory.  This  defines  a  si.nilanty  transformation  of  the  Hamiltonian,  which  leads  to  the  complete  separation 
of  the  ionization  and  electron  attachment  aspects  of  the  propagator  The  latter  makes  it  possible  to  solve 
for  each  property  independently.  Furthermore,  the  frequency  (or  energy )  dependence  which  characterizes 
propagator  theory  is  eliminated  by  introducinga  wave  operator  formalism  It  is  shown  that  this  procedure 
IS  equivalent  to  the  summation  of  certain  types  of  terms  in  the  electron  propagator  perturbation  expansion 
to  infinite  order.  Finally,  the  resulting  equations  are  found  to  be  equivalent  to  those  of  the  Fock-space 
coupled-cluster  (or  equivalently  the  equation  of  motion  coupled-cluster)  method,  which  provides  an 
explicit  wave  function  for  each  state,  demonstrating  the  connection  between  these  diflerent  approaches 
for  the  calculation  of  ionization  potentials  and  electron  affinities  U  nderstanding  this  relationship  permits 
new  and  powerful  approximations  to  be  proposed,  (c  1993  John  Wiley  &  Sons,  Inc 

Introduction 

One-particle  propagator  methods  have  been  employed  in  molecular  theory  pri¬ 
marily  for  the  calculation  of  electron  binding  energies  ( ionization  potentials  and 
electron  affinities).  Applications  have  been  presented  by  Pickup  and  Goscinski  [1] 
and  Purvis  and  Ohrn  [2]  using  the  electron  propagator,  Simons  and  Smith  using 
the  equation  of  motion  method  [  3  ] ,  and  Cederbaum  employing  the  Green’s  function 
method  [4],  All  these  approaches  have  been  shown  to  be  equivalent  [5-8],  Prop¬ 
agator  methods  depend  upon  two  approximations,  one  for  the  reference  state  which 
defines  the  average  value  of  operators,  and  a  largely  independent  choice  of  operator 
manifold  that  permits  the  description  of  ionized,  electron  attached  or  excited  states 
Consequently,  hybrid  methods  have  been  proposed,  where  coupled-cluster  (cc) 
theory  [9- 1 1  ]  is  used  to  define  the  reference  state  solutions  together  with  customary 
operator  choices  as  in  propagator  theory.  This  has  been  done  for  the  electron  prop¬ 
agator  [12]  and  for  the  polarization  propagator  [13],  however,  the  exponential 
nature  of  the  CC  wave  function  invariably  requires  making  rather  several  approx¬ 
imations  to  just  linear  or  maybe  quadratic  terms  [12,13]  in  implementation  that 
jeopardizes  the  full  advantages  of  the  cc  method.  In  this  paper,  we  consider  the 
natural  evolution  of  this  idea  for  the  electron  propagator  and  demonstrate  its  cul- 
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mination  in  Fock  space  cc  [14-16]  and  the  equation-of-motion  (EOM-CC) 
method  [18,19]. 

A  characteristic  feature  of  the  electron  propagator  method  is  that  it  couples  prob¬ 
lems  of  ionization  potentials  and  electron  affinities  [5-8]  (for  an  overview,  see 
[5]).  In  this  paper,  we  consider  the  electron  propagator  formalism  in  which  electron 
correlation  effects  for  a  reference  state  are  assumed  to  be  known  and  calculated 
within  the  cc  framework  [9],  without  truncation  of  the  exponential  cc  wavefunc- 
tion,  except  to  excitation  levels  like  CCSD  [20].  The  cluster  operator  is  used  to 
perform  a  similarity  transformation  of  the  Hamiltonian.  Then,  in  this  way,  the 
transformed  Hamiltonian  replaces  the  original  one  in  the  electron  propagator  su¬ 
peroperator  formalism  as  similarity  transformations  do  not  change  the  eigenvalue 
problem.  Th.  equivalence  holds,  however,  in  the  general  case  and  cannot  be  ex¬ 
tended  to  approximate  schemes.  The  transformed  Hamiltonian  has  a  very  simple 
right-hand-side  eigenfunction  corresponding  to  the  total  ground  state  energy  which 
is  the  Hartree-Fock  determinant.  In  this  way  both  the  transformed  and  the  zero- 
order  Hartree-Fock  Hamiltonian  have  the  same  right  ground  state  eigenfunction. 
The  use  of  the  transformed  Hamiltonian  leads  to  a  great  simplincation  in  the  electron 
propagator  equations.  First,  it  decouples  the  ionization  and  electron  attachment 
problems  which  permits  them  to  be  treated  separately.  Second,  we  can  reduce  the 
superoperator  formalism,  which  is  a  compact  form  of  commutator  and  anticom¬ 
mutator  algebras,  to  simpler  Fock-space  operator  considerations.  Third,  the  trans¬ 
parency  of  the  modified  theory  facilitates  introducing  new  and  improved  approx¬ 
imations. 

Another  characteristic  of  electron  propagator  theories  is  that  in  most  formulations 
the  frequency  dependence  is  retained.  That  suggests  a  Brillioun-Wigner  (B-w) 
perturbation  expansion  as  a  natural  way  of  handling  the  problem  [5-8].  If  the 
transformed  Hamiltonian  is  used,  then  the  perturbation  expansion  is,  of  course, 
expressed  in  terms  of  the  new  transformed  Hamiltonian.  However,  the  B-w  type 
expansion  has  several  disadvantages.  First,  it  permits  obtaining  only  a  single  fre¬ 
quency  at  a  time  in  spite  of  the  fact  that  the  entire  matrix  representation  of  the 
electron  propagator  must  be  constructed.  Second,  unlike  the  Rayleigh-Schrddinger 
expansion  (R-S),  each  calculation  must  be  performed  in  a  self-consistent  manner 
due  to  the  frequency  dependence  [5-8].  To  avoid  all  these  problems  one  can  in¬ 
troduce  a  correlation  operator  and  effective  Hamiltonian  (see,  for  example,  [14]) 
that  eliminate  the  frequency  dependence.  Once  the  correlation  operator  corre¬ 
sponding  to  a  selected  part  of  the  spectrum  is  determined,  diagonalization  of  the 
effective  Hamiltonian  provides  all  frequencies  due  to  the  universality  of  the  for¬ 
mulation.  Moreover,  it  can  be  seen  that  truncation  of  the  correlation  operator  at 
some  excitation  level  is  equivalent  to  a  summation  of  i^crtain  types  of  the  B-w 
perturbation  diagrams  to  infinite  order. 

The  electron  propagator  formalism  based  on  the  assumption  that  the  transformed 
Hamiltonian  includes  a  description  of  correlation  effects  for  the  ground  state  and 
on  the  elimination  of  frequency  dependence,  results  in  a  set  of  equations  that  can 
be  found  to  be  exactly  those  given  by  the  Fock-Space  cc  (FSCC)  formalism  for 
ionization  and  electron  attachment  calculations  with  all  orbital  levels  assumed  to 
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be  active  [15-17],  Furthermore,  the  equivalence  of  FSCC  and  the  EOM-cc  method 
for  the  principal  ionization  potentials  (IPs)  and  electron  affinities  (EAs)  has  also 
been  established  [21],  This  shows  some  formal  advantage  of  the  FSCC  and  eom- 
cc  methods  over  the  electron  propagator  formulation.  It  does  not  necessarily  mean 
that  the  numerical  performance  of  the  standard  electron  propagator  approaches 
should  be  worse,  since  the  electron  propagator  furnishes  energy  differences,  and  a 
poor  description  of  the  ground  state  accompanied  by  a  comparably  poor  description 
of  ionized  or  electron  attached  states  can  result  in  a  quite  reasonable  energy  differ¬ 
ences.  Instead,  the  FSCC  or  eom-CC  method  assumes  a  higher  standard  for  the 
description  of  the  reference  state  and  hence  a  comparable  standard  would  be  an¬ 
ticipated  for  the  other  states  to  get  adequate  IPs  and  eas.  In  the  following  we  will 
address  formal  aspects  only.  The  next  section  is  devoted  to  demonstrating  the  sep¬ 
aration  of  the  ionization  and  electron  attachment  problems.  Then  in  the  third 
section  the  problem  of  the  ionization  potential  is  discussed  in  detail.  The  fourth 
section  briefly  addresses  the  electron  affinity  equations,  which  differ  little,  concep¬ 
tually,  from  those  for  ionization  potentials. 

The  Superoperator  Representation  of  the  Electron  Propagator 
with  a  Correlated  Ground  State 

Superoperators  have  been  employd  [  1 ,7  ]  to  recast  the  commutator  and  anticom¬ 
mutator  algebras  involved  in  the  standard  electron  propagator  formulation  into 
matrix  and  vector  space  techniques.  This  has  been  done  by  introducing  a  linear 
space,  elements  of  which  are  linear  combinations  of  the  field  operator  products  [  1  ] ; 

hi  =  {«! },  \^i^N<a,  (la) 

hj  =  \ala,aj}  ,  {a]  aji/,}  ,  l:^i<j<N<a<b,  (lb) 

which  could  be  continued  to  or  higher.  N  is  the  number  of  electrons  of  the 
system.  The  above  are  operators  formed  from  the  Hartree-Fock  basis  where  /,  j, 
•  •  •  indicate  occupied  spin  orbitals  while  *  •  •  are  those  of  unoccupied  orbitals. 

For^  and  B  being  general  elements  of  this  linear  space  the  superoperator  identity 
/and  superoperator  Hamiltonian  //can  be  defined  as  [1,7] 

fA  =  A,  (2a) 

HA  =  [A,H].  (2b) 

The  scalar  product  then  takes  the  form  [1,7] 

{A\B)  =  <4-|[5,  s  {[B,  A^]^)  ,  (3) 

where  1 4>)  is  the  Hartree-Fock  determinant  and  (  )  means  the  Hartree-Fock 
average.  These  definitions  permit  expressing  the  electron  propagator  in  the  compact 
matrix  form 


G(£)  =  (a|(£/-//)-'a), 


(4) 
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where  a  is  a  column  supermatrix  containing  all  annihilation  operators.  Employing 
inner  projections  and  the  partitioning  technique  of  Ldwdin  [  22  ] ,  the  superoperator 
resolvent  becomes  the  matrix  inverse  [1,7] 

G(£)  =  (a|h)(h|(£/-  .W)h)-(hla) ,  (5) 

where  the  inner  projection  manifold  h  consists  of  elements  of  the  linear  space. 
Partitioning  ofh  =  {a;f}with(a|f)  =  0,  where  a  spans  the  model  space  while  f 
is  its  orthogonal  complement  leads  to  [1,7] 

G-'(£)  =  (a|(£/-  //)a)  -  (a|^f  )(f  |(£/-  //)f  )-'(f  |//a) .  (6) 

The  unperturbed  superoperator  Hq  is  chosen  and  then 

H  =  Ho+V,  (7) 

where  F  is  a  perturbation  superoperator.  One  can  express  Hq  in  second  quantized 
form  as 

•fffl  =  2  f^kOk^k  ■  (8) 

k 

Using  the  partition  of  the  Hamiltonian,  we  can  write  the  Dyson  equation  [23] 

G-‘(£)  =  G?'(£)-2(£),  (9) 

where 

Go"  =(a|(£/-//o)a),  (10) 

and  the  self-energy  2(E)  is  defined  as  the  remainder  in  Eq.  (6V  IP*:  and  EAS  are 
given  by  the  poles  of  G(£),  from 

detG‘'(£^)  =  0.  (11) 

In  the  following  we  assume  that  we  are  interested  in  the  eigenvalue  problem 
only.  It  is  easy  to  see  that  iPs  and  eas  are  invariant  with  respect  to  a  similarity 
transformation  of  H 


(12) 

where  the  cluster  oeprator  T  is  defind  as  [9-1 1] 

iV 

7'=  2  2  2  •  "  fl.i  •  (13) 

n-l  ai<'  ■  ■<a„  /|<-  •  •<!„ 

The  T  amplitudes  are  antisymmetric  in  their  upper  and  lower  indices.  According 
to  the  definition  of  the  superoperator,  we  can  also  add  some  constant  term  to  the 
operator  without  causing  any  change.  Hence,  we  define  a  new  superoperator  H 
instead  of  //,  where 


H  =  e-^He^  -  £o  . 


(14) 
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It  is  assumed  that  T  amplitudes  and  Eq  satisfy  the  equations 

<4>1/?1#>  =  0,  (15a) 

<a:a,4>|^|4>>  =  0,  (15b) 

=  0  ,  (15c) 

and  so  on;  hence  T  and  Eo  can  be  obtained  from  these  equations  [9,10].  f’o  is  now 
the  total  ground  state  energy.  Equation  (15)  can  be  equivalently  written  in  the 
form 

^|4>)  =  0.  (16) 

ll  is  expressed  as  a  second-quantized  normal-ordered  (with  respect  to  the  Hartree- 
Fock  determinant)  operator  consisting  of  one-,  two-,  •  •  •  particle  parts  (see,  for 
example,  [24] ).  The  zero-particle  part  does  not  appear  due  to  the  definition  (14). 

Let  us  now  introduce  an  additional  classification  of  the  elements  spanning  the 
orthogonal  space.  Considering  the  action  of  elements  of  f  on  4>,  we  can  obtain  two 
classes  of  elements,  f  =  { X^,  Y  } ,  defined  as 

/€X^  if  <^|/=0,  (17a) 

/eV  if  /|<i.>  =  0.  (17b) 

It  should  be  noted  here  that  elements  of  X  and  do  not  belong  to  the  linear  space 
under  consideration.  The  annihilation  operators  with  respect  to  the  physical  vacuum 
can  be  transformed  into  a  new  set  of  annihilation  and  creation  (quasiparticle  or 
particle-hole)  operators  but  now  with  respect  to  the  Hartree-Fock  determinant  as 
the  Fermi  vacuum  [9]: 

bo  =  aa.  (18a) 

bl  =  a,.  (18b) 

They  fulfill  the  same  anticommutator  relations  as  the  a  and  operators.  Now  it 
can  be  seen  that  each  element  of  X^  is  a  product  of  new  creation  operators  while 
each  element  of  Y  is  a  product  of  the  new  annihilation  operators.  It  can  be  noted 
that  X^  includes  2h-p,  3h-2p,  •  *  •  operators  and  Y^  contains  2p-h,  3p-2h,  •  •  • 
operators. 

Let  us  show  the  consequences  of  using  H  instead  of  H  for  the  structure  of  the 
G  “'(£■)  matrix.  Consider  [G  “'(£^)]ia  elements.  For  the  first  term  on  the  rhs  ofEq. 
(6),  we  have 

{a,  |(£/ -  ff)aa)  =  {Ha^a]  -  +  a^Hoa  -  a]a„H)  =  0  .  (19) 

In  a  similar  way  we  obtain 


(X^IHY)  =  0, 
(Xn/?aj  =  0, 
(a,i^Y)  =  0. 


(20a) 

(20b) 

(20c) 
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Equations  (20a-c)  can  be  used  to  show  that 

(«,|//f)(f|(£/-/7)f)  '(f  |/k)  (21) 

is  also  zero  for  any  /  and  a.  In  order  to  do  this,  one  can  use  the  expansion 

rt 

(f|(/;/-  .  (22) 

A--I 

Because  of  Equations  (20a)  and  (20b).  a  nonzero  contribution  from  the  last  two 
factors  in  Eq.  ( 2 1 )  is 

y 

2  £-^(Y|^Y)^-'(Y  .  (23) 

k-\ 

Finally  we  have  the  following  expression  for  ( 2 1 ) 

(a,|/7Y)(Y](£/-/?)Y)  '(Y|/7«„).  (24) 

but  it  is  equal  to  zero  because  of  Eq.  (20c).  In  this  way  we  have  shown  that 

[G-‘(£)],„  =  0,  (25) 

for  all  /  and  a.  Hence,  G  ■'(£)  has  the  structure 


Because  of  this  we  can  write 

det  G-'(£)  =  det[G-'(£)<,^]  det(G-’(£),J  ,.  (26) 

and  instead  of  Eq.  (II)  one  can  consider  two  equations 

det[G-'(£)„ft]  =  0„  (27a) 

det[G-'(£),J  =  0.  (27b) 

separately.  That  means  that  the  use  of  H  instead  of  //  decouples  the  calculation  of 
ionization  potentials  and  electron  affinities,  or,  in  other  words,  the  inclusion  of 
correlation  effects  for  the  ground  state  leads  to  separation  of  these  two  problems. 
One  should  also  notice  that  the  transformation  H  -*■  iV  provides  the  similar  effect 
of  eliminating  the  other  off-diagonal  block. 

[G-'(£)]«,  =  0.  (28) 

for  all  a  and  /. 

To  arrive  at  Eq.  (25)  we  have  taken  advantage  of  the  fact  that  the  transformed 
Hamiltonian  has  a  very  simple  right  eigenfunction  which  is  the  Hartree-Fock  de¬ 
terminant.  That  leads  to  separation  of  the  eigenvalue  problems  given  by  Eq.  (27). 
However,  one  should  be  aware  that  this  separation  does  not  include  properties 
other  than  energies.  For  the  eigenvalue  problem  knowledge  of  the  right  ground 
state  eigenvector  of  /7  is  sufficient  for  the  separation.  If  one  is  also  interested  in 
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Other  properties  than  IPs  or  eas,  then  again  the  whole  matrix  of  G“'(£)  must  be 
considered.  This  is  because  for  other  properties,  a  knowledge  of  the  left  eigenfunction 
of  H  is  also  necessary .  One  can  introduce  the  left  eigenfunction  of  H  in  the  electron 
propagator  formalism  in  an  explicit  way  since  the  left  eigenfunction  of  7?  can  be 
obtained  as  discussed  elsewhere  [25].  See  also  Arponen  [26].  That  would  lead  to 
vanishing  of  the  second  off-diagonal  submatrix  ofG"'(£')and  complete  separation 
of  both  subproblems.  Very  recently  such  an  idea  has  been  explored  by  Nooijen  and 
Snijders  [27].  In  the  following,  we  shall  concentrate  on  the  eigenvalue  problem 
showing  the  relation  with  other  approaches  designed  to  calculate  IPs  and  eas. 

Ionization  Potentials 

In  this  section  we  consider  the  [G “'(£)],;  submatrix  which  corresponds  to  the 
ionization  potential  calculation.  In  the  following  we  assume  that  we  are  interested 
in  principal  ionization  potentials.  From  Eq.  (6)  we  have 

[G-\L)]„  =  (aMEf-  H)a,) 

-(fl,|//X^)(X*|(£/-7?)X^)-'(X^|/?fl,).  (29) 
where  X’  replaces  f  because  of  Eqs.  (20c)  and  (20a).  Denoting  by 

(X'lXfl;)  =  (X’|(£/-  //)X^)-'(X*|/?a;) .  (30) 

we  have  from  ( 29 ) 

[G-'(£)],;  =  (aMEi- ihaj)  -  (fl,  |/?X^)(XMxa;)  -  (31) 

It  should  be  noted  that  X  is  not  explicitly  defined  by  Eq.  (30)  since  it  gives 
only  the  matrix  representation  of  X.  Multiplication  of  Eq.  (30)  from  the  left  by 
(XM(£/-  //)X")  gives 

(X^|/U)  +  (X*|/?X^)(X^lXa,)  -  2  (XMXfl,)  5„£  =  0  .  (32) 

t 

It  can  be  seen  that  by  introducing  X  the  problem  of  dealing  with  the  inversion  of 
the  matrix  (X^!(£/  -  can  be  avoided.  The  possibility  of  obtaining  the 

inverse  matrix  directly  is  limited  because  the  dimension  increases  rapidly  with  the 
size  of  orbital  basis  [  7  ] .  Some  progress  in  dealing  with  the  matrix  inversion  problem 
has  been  made  by  Baker  and  Pickup  [28];  however,  the  most  frequent  approach 
in  electron  propagator  theory  is  to  express  the  inverse  matrix  through  a  perturbation 
expansion.  From  Eq.  (31 )  we  have 

[G - ' ( £)]„  =  Eb.,  -  (a.  I lia,)  -  (a,  |  /?X^)( X* |  Xa^) .  ( 33 ) 

Lei  us  introduce  the  following  notation: 

+  («.  |/?X')(XMxr/;) .  (34) 

From  (27b)  we  have  that  diagonalization  of  the  energy  poles  gives  the  desired 
frequencies 

I(//f~b„E)c,{L)  =  0.  (35) 
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where  c(  £)  is  an  eigenvector  of  //'"^corresponding  to  the  eigenvalue  E.  Multiplying 
Eq.  (32)  by  Cj{E),  making  the  summation  over  j.  using  (35),  and  finally  taking 
into  account  that  the  c{E)  are  linearly  independent,  one  can  get  an  equation  for 
the  matrix  elements  of  X  which  is  £-independent: 

(X^l^a,)  +  (XM^^)(X^lXa,)-  2  (XMXfl, )//'"  =  0.  (361 

I 

The  set  of  equations  (36;  j*  id  (34)  is  for  //f/  and  matrix  elements  of  the  super¬ 
operator  X.  In  this  way  the  E  dependence  has  been  eliminated  from  the  equations. 
Now  diagonalization  of  //'"  gives  all  principal  ionization  potentials.  By  inserting 
(34)  into  (36),  one  can  see  that  Eq.  (36)  is  nonlinear  in  the  matrix  elements  of 
X,  and  thus  has  more  than  one  solution.  The  other  solutions  provide  a  description 
of  the  satellite  ionization  potentials  (shakeups)  [29]. 

Similar  equations  are  well  known  from  the  theory  of  effective  Hamiltonians, 
where  //'"  is  called  the  matrix  representation  of  the  effective  Hamiltonian  while  X 
is  usually  called  the  correlation  eperator.  X  can  also  be  expressed  as  a  second- 
quantized  operator  ( see,  for  example,  [14]).  Equations  (36)  and  (34)  allow  a  de¬ 
termination  of  ( X^  1  Xfl, )  iteratively.  Equation  ( 36 )  can  be  expressed  in  the  form 

(X^|(c/-//o)Xt)(X^|Xa,) 

=  (X+I^fl;)  +  (X'|//X’)(X'|Xa,)  -  2  (XMXfl,)2,  ,  (37) 

where 

2„  =  =  (aAHa,)  +  (a,  |/;xM(XMxa,) ,  (38a) 

and 

H  =  Ho  ^  Ho -{Ho}.  (38b) 

The  matrix  (X^lcj/  -  //o)X^)  is  diagonal  and  contains  the  R-s  PT  energy  denom¬ 
inators.  For  =  ala/^ai,  one  has  the  energy  denominator  D  =  Cj  +  Ca  -  ci,-  e/. 

One  can  notice  that  the  superoperator  formalism  is  not  too  convenient  once  the 
ionization  and  electron  attachment  problems  are  separated.  Using  the  definitions 
(2)  and  (3)  and  Eqs.  ( 16)  and  ( 17),  we  have 

<X(C;-//o)X'><X[X,aJ> 

=  <X//a,>  +  <X//X'><X(X,a,]>-  2<X[X,a,]>2,.  (39a) 

I 

2„  =  (a.Haj)  +  <ajHX^)(X[X,  a,])  .-  (39b) 

We  assume  that  the  operators  H,  Ho.  and  X  are  in  normal  order  with  respect  to 
the  Hartree-Fock  determinant  [14-17],  Due  to  the  definitions  (14)  and  (38b), 
//  and  Ho  do  not  contain  the  zero-particle  parts  (constant  terms).  Now  we  can 
deine  the  X  operator  explicitly.  We  argue  that  X  can  be  expressed  by  the  linear 
expansion  of  connected  operators 


X  =  2  X(«). 

n=2 


(40) 
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with 


X(«)=  2  2  ZK''''.yal 

a,<-  ■  -<0,-1  I|<-  ■  -<I„  J 


(41) 


where  the  X  coefficients  are  antisymmetric  in  the  a  and  i  indices.  Connected  means 
here  that  the  equation  for  X  contains  connected  terms  only. 

Due  to  the  definition,  Eq.  (41 ),  we  have 

<X[X,aJ>  =  <XXfl,>.  (42) 

To  obtain  an  explicit  form  of  the  equations,  diagrammatic  methods  can  be  used 
[24,  30].  The  rules  for  creating  a  diagrammatic  representation  follow  from  the 
generalized  Wick  theorem.  The  graphical  representation  of  H  is  shown  in  Figure 
1(A).  Equation  ( 15)  is  also  presented  in  diagrammatic  form  in  Figure  1(B).  Dia¬ 
grams  representing  X  are  shown  in  Figure  1(C).  Since  X  contains  one  hole-anni¬ 
hilation  operator,  then  the  X  diagrams  must  have  precisely  one  line  at  the  bottom 
representing  a*  and  an  odd  number  of  lines  at  the  top  representing  the  product  of 
quasiparticle  creation  operators  corresponding  to  elements  of  X^.  The  equation  for 
the  X(2)  operator  that  represents  excitations  to  the  space  created  by  xj  while  acting 
on  4>  [see  Eqs.  (lb)  and  (17a)  for  the  notation]  is  shown  in  Figure  1(D).  The 
equation  is  very  simple  and  identical  with  those  of  the  Fock-Space  cc  method  for 
ionization  potentials  with  all  active  orbital  levels  [15-17]  and  the  equation  of  motion 
cc  [  1 8-20  ] .  One  can  note  that  only  connected  diagrams  can  be  obtained.  This  is 
because  of  the  fact  that  disconnected  diagrams  can  be  constructed  only  with  those 
shown  in  Figure  1  ( B ) ,  but  they  are  equal  to  zero  due  to  Eq.  ( 1 5b ) .  This  observation 
can  be  extended  for  equations  for  all  components  of  X,  making  our  assumption  of 
connectedness  of  the  X  expansion  Eq.  (40)-(41 )  justified.  The  connected  structure 
of  the  equations  for  X  is  usually  sufficient  for  extensivity  ( connected  structure  in 
terms  of  the  perturbation  expansion)  for  the  principal  ionization  potentials  but  not 
for  the  satellite  roots  (for  details,  see  (31  ]). 

However,  the  most  frequent  approach  in  the  theory  of  electron  propagators  is 
not  to  employ  the  correlation  operator  x,  but  instead  keep  the  E  dependence  through 
the  self-energy  term  S(£)  [Eq.  (9)]  [5-8].  In  such  a  case  the  Brillioun-Wigner 
type  of  perturbation  expansion  is  the  most  natural  choice  for  generating  approxi¬ 
mations.  Thus  the  second  order  or  third  order  as  well  as  ;ome  schemes  allowing 
approximate  summation  of  certain  types  of  terms  to  infinite  order  have  been  ex¬ 
tensively  exploited  [  5-8  ] .  The  importance  of  different  types  of  diagrams  has  been 
investigated  showing  that  it  is  essential  to  include  ring  and  ladder  diagrams  in  the 
third  order  [4]. 

The  B-w  expansion  can  be  easily  obtain  within  our  formalism  if  one  uses  Eq. 
(32),  where 

<X(£-  ^o)X^><XXa,>  =  {XHoj}  +  <X/7X+><XXa,>  .  (43) 

Then,  since  (X  (£  -  Hoy'X^}  is  diagonal,  one  can  obtain  the  simple  expansion 


(Xxaj)  =  2  <X(£:-.^o)''//Xt>'^<X(£-//o)-'//a,>,  (44a) 
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Figure  1.  (A)  Graphical  representation  of  H.  (B)  Graphical  representation  of  coupled 
cluster  equations  (15).  (C)  Graphical  representation  of  operator  X.  (D)  Graphical  rep¬ 
resentation  of  equations  for  X  and  2  with  the  orthogonal  space  rcstiicted  to  xj. 


and  finally 

[2(£)]y  =  (a]flaj)  -r  J  <aJ//X^)<X(£  -  Wor'i/X^/<X(£  -  . 

(44b) 

Both  approaches  are  exact  in  the  limit,  but  they  differ  in  truncation  schemes.  For 
FScc  we  consider  types  of  x  operators  that  should  be  included  [15-17]  (or,  equiv- 
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alently,  a  truncation  scheme  for  X^)  while  the  order  of  the  perturbation  expansion 
is  the  natural  choice  as  the  truncation  criterion  for  the  second  t.pproach  [  5-8  ] . 

One  can  notice  that  Eqs.  ( 39a-b)  and  (44b)  are  equivalent  as  long  as  restnctions 
imposed  on  are  the  same  in  both  approaches.  Hence,  with  a  certain  approxi¬ 
mation  for  X^,  FSCC  can  be  regarded  as  a  method  that  sums  contributions  from 
certain  types  of  terms  in  the  perturbation  expansion  to  infinite  order.  For  example, 
if  the  orthogonal  space  is  restricted  to  X3,  FSCC  can  be  considered  as  a  method  that 
sums  all  2h-p  terms  (intermediate  states  are  restricted  to  those  having  two  holes 
and  one  particle)  to  infinite  order.  They  include  the  ring  and  ladder  diagrams  that 
have  been  found  to  be  very  important  in  third-order  calculations  [4]. 

The  energy  can  usually  be  expressed  in  terms  of  the  perturbation  expansion  as 
a  sum  of  connected  terms  (diagrams)  to  infinite  order.  This  takes  place  for  the 
Rayleigh-Schrodinger  (R-S)  perturbation  expansion  but  usually  not  for  the  B-w 
one.  However,  in  this  specific  case  where  the  cluster  expansion  for  X  is  reduced  to 
a  linear  one,  the  B-w  expansion  has  this  property  as  well. 

The  above  considerations  show  how  some  important  contributions  appearing  in 
the  electron  propagator  ionization  potential  perturbation  expansion  are  summed 
to  infinite  order  by  the  FSCC  method. 


Electron  Affinities 


Because  of  the  nonhermicity  of  /?,  the  structure  of  equations  for  electron  affinities 
can  be  different  from  those  for  ionization  potentials.  To  have  equations  for  electron 
affinities  similar  to  those  for  ionization  potentials  requires  use  of  instead  of  B 
for  which  the  left  eigenfunction  corresponding  to  the  ground  state  is  known.  This 
leads  to  the  property  of  Eq.  (28)  and,  again,  separation  of  EAs  and  IPS.  We  have 

[G-(£)U  =  (a.l(£/ - /?')a„)  -  (a„|^^Y)(Y  |(£/- /?t)Y)-'(Y  l/^r/ft) , 

(45) 


where  now  f  is  replaced  by  Y,  because 

<4>17?+  =  0,  (46) 

and  hence 

(aJW^X+)  =  0,  (47a) 

{\\H^X^)  =  0.  (47b) 


Denoting  by 

(Ylx+fl/,)  =  (Y  !(£/ -  fV)\)-'{Y\B^a,) ,  (48) 

we  have  from  Eq.  (45) 

[G-'(^)U  =  E5,f,  -  (a^lVal)  -  <[a,,  X^Y^XY/raJ)  ,  (49) 

and  from  Eq.  (48) 

+  ([a^,  X^]Y^)(Y(B^  -  £)Y^>  =  0  .  (50) 
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Taking  the  Hermitian  adjoint  of  Eq.  (50),  we  get 

<Y7?4>  +  <Y(^-  F)Y-><Y[X,  al]}  =  0  ,  (51) 

and  similarly  for  Eq.  (49) 

[G-'(£)]?„  =  E8„i,  -  (aMal)  +  <  fl«£Y'><Y[x,  nj])  .  (52) 

That  leads  to  the  following  expression  for  the  matrix  representation  of  the  effective 
Hamiltonian 

Hf,  =  (UaHal)  -  (a^HY  ><Y[X.  4]>  .  (53) 

Diagonalization  of  gives  the  d(  -'d  electron  affinities  if  the  energy  dependence 
is  eliminated  from  (51 )  in  the  way  md  cated  in  the  previous  section.  The  resulting 
equations  are  those  of  the  FSCC  [14-17]  and  EOM-cc  [18,21,31,32]  methods  for 
electron  affinities.  Again,  the  discussion  analogous  to  that  for  ionization  potentials 
could  be  presented  showing  similar  relations. 

One  ca""  also  see  that  Eqs.  (51)  and  (53)  can  be  reached  in  a  different  way. 
Starting  wit*’  a  different  definition  of  the  space  under  consideration 


hi  =  {a] 

}.  {fla}  .  1  ^  N  <  fl  . 

(54a) 

-  {al'ila  }, 

(1*  ,  \<.i<'^N<a<b. 

(54b) 

and  highei  .  the  one-elecf' 

-n  p'  'pagator  defined  accordingly. 

C  i 

!  '  ,(£/-//)  a*). 

(55) 

one  can  obtain  t’  cquatio'  ' 

•”  ‘lectron  affinities  in  the  manner  indicated  in  the 

previous  section. 


Conclusion 

The  electron  propa^,  tor  is  potentially  capable  of  simultaneously  describing  ion¬ 
ization  potentials  and  electron  affinities  as  well  as  some  other  properties  of  the 
system.  Also  the  frequency  dependence  that  is  kept  in  many  approaches  complicates 
calculation  since  for  each  of  the  IPs  and  eas  separate  calculations  must  be  performed. 

However,  if  the  correlation  effects  for  the  ground  state  are  known,  a  priori,  then 
the  formalism  can  be  significantly  simplified.  We  have  assumed  that  the  result  of 
a  coupled  cluster  calculation  for  the  ground  state  is  known  and  we  used  it  to  perform 
a  similarity  transf  rmation  of  the  Hamiltonian,  Eq.  (12).  The  transformed  Ham¬ 
iltonian  has,  of  course,  the  same  eigenvalues  in  the  Fock  space,  but  it  has  a  very 
simple  right-hand-side  eigenfunction  corresponding  to  the  ground  state  energy 
which  is  the  Hartree-Fock  determinant.  Use  of  the  transformed  Hamiltonian  fl 
instead  of  //  results  in  vanishing  of  one  of  the  off-diagonal  blocks  of  G  '(£).  This 
asymmetric  behavior  is  caused  by  the  lack  of  Hermicity  of  //;  however,  disap¬ 
pearance  of  one  of  the  off-diagonal  blocks  is  enough  to  decouple  IP  and  EA  calcu¬ 
lations. 

Elimination  of  the  energy  dependence  from  the  self-energy  expression  2(£)  pro¬ 
vides  a  method  that  is  equivalent  to  the  summation  of  certain  types  of  terms  in  the 
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perturbation  expansion  of  2(f)  to  infinite  order.  It  also  makes  the  approach  uni¬ 
versal  in  the  sense  that  it  furnishes  the  set  of  energies  simultaneously.  The  resulting 
equations  are  equivalent  to  those  of  fscc  or  EOM-CC  for  ionization  potentials  and 
electron  affinities  in  spite  of  the  fact  that  both  these  formulations  represent  com¬ 
pletely  different  approaches  to  the  problem.  Using  the  terminology  of  Oddershede 
[5],  FSCC  and  EOM-cc  are  state  function  methods  that  start  from  defining  the 
wavefunctions  of  states  under  consideration.  The  fscc  method  is  characterized  by 
a  valence  universal  strategy  [15]  that  assumes  a  heirarchical  calculation  of  the 
ground  state  energy,  ionization  and  electron  attachment  energies,  single  excitation 
energies,  and  so  on.  In  each  step  results  of  the  previous  steps  are  used  to  isolate 
and  determine  only  the  genuine  changes  in  the  system.  The  electron  propagator 
method  represents  a  different  philosophy  in  that  it  attempts  to  include  the  effects 
of  the  ground  state  correlation,  ionized  and  electron  attached  states  simultaneously. 
The  derivation  presented  here  demonstrates  the  link  between  these  approaches  that 
allows  a  better  understanding  of  the  relation  between  them  as  well  as  the  evaluation 
of  the  quality  of  particular  approximations. 
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Abstract 

A  serious  problem  in  the  ah-iiiitio  Hartree  Fock  calculation  for  extended  systems  is  a  singulanty  in 
the  exchange  contribution  to  the  band  energies  (the  “exchange  pathology").  In  the  homogeneous  electron 
gas  this  pathology  is  exactly  cancelled  by  the  correlation  contribution  ansing  from  the  sum  of  all  time- 
ordered  ring  diagrams  (  RPa  approximation).  In  this  article  e  show  that  this  cancellation  is  more  general 
and  docs  also  occur  in  periodically  extended  systems,  g  IS93  John  Wiley  &  Sons.  Inc. 

Introduction 

One  of  the  problems  associated  with  ab  inilio  calculations  on  extended  systems 
at  the  Hartree-Fock  (  hf)  level  is  a  pathology  introduced  by  the  exchange  interaction. 
This  pathology  has  some  well-known  symptoms:  Bandgaps  and  band  widths  are 
too  large  (both  by  a  factor  or  two  or  more),  and  the  density  of  states  at  the  Fermi 
level  for  metals  vanishes.  As  a  consequence,  results  from  HF  calculations  on  extended 
systems  are  generally  considered  inferior  to  those  obtained  within  the  local-density 
approximation.  The  latter  does  not  suffer  from  the  same  drawbacks  and  gives  more 
realistic  values  of  the  properties  mentioned  above.  The  exchange-correlation  po¬ 
tentials  used  in  local  density  are  all  to  some  extent  based  on  the  recognition  that 
exchange  and  correlation  correcuons  have  to  be  treated  on  an  equal  footing  to 
avoid  the  pathology  introduced  by  exchange  alone.  For  the  electron  gas  Gell-Mann 
and  Brueckner’s  rpa  results  shows  that  this  is  due  to  an  exact  cancellation  of  this 
pathology  [1,2].  .A  similar  result  can  be  obtained  using  the  dielectric  theory  in  the 
RPA  approximation.  This  leads  to  the  formulation  of  a  screened  interaction.  For 
the  homogeneous  electron  gas  this  was  first  worked  out  by  Lindhard  [3].  Quinn 
and  Ferrell  showed  that  the  same  result  can  be  obtained  by  the  study  of  the  self¬ 
energy  at  the  Fermi  surface  [4]. 

In  the  early  1 960s  the  problems  associated  with  the  unscreened  exchange  in  solids 
attracted  considerable  attention  [5-7].  These  efforts  eventually  culminated  in  the 
work  of  Hedin,  who  expanded  the  self-energy  in  terms  of  a  screened  potential.  This 
expansion  is  expected  to  converge  faster  than  the  expansion  in  terms  of  the  un¬ 
screened  Coulomb  interaction  [8,9].  However,  after  the  appearance  of  the  landmark 
articles  by  Hohenberg  and  Kohn  [10]  and  Kohn  and  Sham  [11],  the  attention 
shifted  away  from  calculations  based  on  ah  initio  methods  to  density-functional- 
based  methods  such  as  local-density  and  local-spin-density  methods.  A  more  rigorous 
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approach  starting  from  Hedin’s  quasi-particlt  equation  has  lately  attracted  consid¬ 
erable  attention  and  has  been  applied  to  a  number  of  real  systems  [12]  showing 
great  improvement  of  the  standard  local-density  methods  for  excited-state  properties 
in  semi-conductors. 

For  periodically  extended  systems  the  situation  is  more  complex  than  for  the 
homogeneous  electron  gas.  Monkhorst,  however,  was  able  to  show  that  the  pathology 
of  the  exchange  in  such  systems  has  exactly  the  same  origin  as  the  one  in  the 
homogeneous  system.  In  fact,  the  pathological  behavior  of  the  exchange  is  completely 
due  to  the  homogeneous  part  of  the  charge  density  in  the  crystal  [13].  The  analysis 
of  this  pathology  and  its  origin  is  easiest  expressed  in  the  momentum  space  rep¬ 
resentation  of  all  quantities  involved. 

In  this  article  we  will  revisit  part  of  the  proof  given  m  Ref.  [13]  and  continue 
from  there  to  show  that  the  cancellation  of  pathologies  that  occurs  in  the  homo¬ 
geneous  electron  gas  also  occurs  independent  of  the  shape  of  the  Fermi  surface  and 
consequently  independent  of  the  inhomogeneity  of  a  realistic  system. 

The  current  analysis  will  be  based  on  the  quasi-particle  equation  of  Hedin  [8] 
and  will  be  expressed  in  terms  of  the  self-energy  contributions  from  exchange  and 
RPA-like  terms.  It  could  be  derived  just  as  well  using  the  total  energy  expression 
and  the  Landau  liquid  theory  [14].  We  will  try  to  identify  the  “dangerous”  terms, 
that  consequently  have  to  be  treated  and  calculated  together,  leaving  terms  that  do 
not  introduce  any  pathologies. 


The  Singularity  in  the  Exchange  Self-Energy 

The  exchange  contribution  to  the  self-energy  does  not  depend  on  tlie  energy- 
parameter  and  is  simply  given  by  the  familiar  result: 

=  (I) 

a 


where  8  only  sums  over  occupied  orbitals.  In  an  extended  system  the  orbital  labels 
are  a  combination  of  a  band-index  (^)  and  a  k-vector,  which  lies  inside  the  first 
Brillouin  zone  (BZ).  Eq.  ( 1 )  then  becomes: 

=  -2  «.(k  +  q)<Mk;  Kk  +  q)kr’  li-fk  +  q);  fik)  .  (2) 


Here  we  have  introduced  the  occupation  numbers  of  the  one-particle  states 


«K(q)  = 


1 

0 


if  c,(q)  :S  et : 
ife„(q)  >  c,  , 


(3) 


where  c/.  is  the  Fermi  energy  as  obtained  from  the  Hartree  (or  Hartree-Fock )  cal¬ 
culation. 

In  order  to  analyze  this  expression,  we  express  the  matrix  element  in  momentum 
representation  (see  Appendix  A).  For  the  exchange  integral  we  find 


„(k)  = 


-4ff 


Z  Z  «r(k  +  q) 

i-q  Cp 


^M.(C  +  q.  k)t.^(-G  -  q,  k  +  q) 
(G  +  q)- 


(4) 
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We  now  immediately  realize  that  the  “dangerous”  contributions  in  Eq.  (4)  come 
from  the  terms  with  G  =  0.  We  can  therefore  split  the  self-energy  into  two  contri¬ 
butions: 

2:,\{k)  =  2i;,'(k)  + 2:;--(k).  (5) 

where 


V  >.l 
Hit 


(k) 


— 4;r 


2  /Uk  +  q) 


k)^,;,(-q.  k  +  q) 
•> 


(6) 


and 


-;;;;(k)  =  -^2  2  «,.(k  +  q)  — 

j-q  (f/() 


.,(G  +  q.  k)4>,^(-G  -  q,  k  +  q) 
(G  +  q)  = 


(7) 


Equation  ( 7 )  does  not  contain  any  dangerous  terms  and  can  therefore  be  calculated 
without  any  further  precaution.  We  can  now  analyze  Eq.  (6)  further  to  see  what 
exactly  causes  the  pathology  of  the  exchange. 

For  orbitals  whose  band-energy  c,(k)  >  c/ .  the  summations  will  not  introduce 
any  pathological  behavior,  because  when  q  -►  0.  the  numerator  vanishes  because 
ofEq.  ( A7).  Therefore,  the  exchange  pathology  will  only  occur  for  occupied  bands, 
where  c^(k)  <  c/ .  However,  the  same  argument  as  used  for  unoccupied  states  applies 
to  the  occupied  ones,  unless  n  =  v.  Therefore,  a  singular  behavior  of  the  integrand 
only  occurs  for 


2i;;;'(k) 


-47r 


2  «„(k  +  q) 


<l>„„(q,  k)4>^„(-q,  k  +  q) 


(8) 


Because  the  quantities  <1>  are  continuous  and  smooth  across  the  Fermi  surface,  the 
pathology  must  be  caused  by  the  denominator  and  hence  is  an  end-point  singularity 
associated  with  the  singular  behavior  of  I  jq'  itself  This  behavior  is  independent 
of  the  actual  shape  of  the  Fermi  surface  [  1 3] .  It  leads  to  the  well-known  logarithmic 
behavior  of  the  Hartree-Fock  orbital  energies  at  the  Fermi  surface. 

The  parts  of  the  exchange  self-energy  that  do  not  show  any  pathology  can  be 
calculated  without  any  further  special  treatment.  In  fact,  they  can  be  added  to  the 
Hartree-operator  to  form  a  modified  Hartree-Fock  approximation  that  will  have 
no  exchange  pathology,  but  does  include  most  of  the  exchange  contributions.  The 
pathological  term  in  Eq.  (8)  has  to  be  treated  separately.  As  we  will  see  below,  this 
special  treatment  of  the  singular  exchange  contributions  is  made  redundant  by  the 
inclusion  of  the  rpa  self-energy  into  a  screened  interaction,  that  does  not  show  any 
singularity. 


The  RPA  Self-Energy 

The  starting  point  of  this  work  is  the  quasi-particle  equation  introduced  by  Hedin 
and  Lundqvist  [8,9].  The  single-particle  functions  <j>i,  and  qua.si-particle  energies 
/i*  satisfy  the  equation 
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[/i(.v)  +  r(.v)  -  E,]Mx)  +  f  Z(x,  X',  E,)Mx')  dx'  =  0  ,  (9) 

where  a  and  a'  are  space-  and  spin-coordinates  (r.  ^).  Here  h  is  the  sum  of  the 
kinetic  energy  and  the  external  potential,  t’  is  the  effective  Coulomb  potential,  and 
2  is  the  self-energy,  containing  the  effect  of  exchange  and  correlation  on  the  quasi¬ 
particle  energies.  As  indicated  in  Eq.  (9)  the  self-energy  is  generally  nonlocal  and 
energy-dependent.  Hedin  suggested  to  expand  the  self-energy  in  a  screened  potential 
H' instead  of  the  unscreened  Coulomb  potential  to  improve  the  convergence  of  the 
resulting  perturbation  series  [  8  ] .  The  lowest-order  approximation  to  2  is  then  simply 
given  by  the  product  2  =  GW.  The  simplest  screened  potential  results  from  the 
RPA  screening. 

We  can  define  the  screened  interaction  in  terms  of  a  dielectric  response  function 

H'(  A',.  A'2.  “>)  =  J  ^/.VjUfAi,  A-j)e'‘{.Y3,  .Vi.  o)  .  (  10) 

where  w  is  the  frequency.  Note,  that  in  atomic  units  (/n  =  ft  =  e  =  1 ),  frequency 
and  energy  are  expressed  in  the  same  units.  Note  also,  that  Eq.  (9)  reduces  to  the 
Hartree-Fock  approximation  when  H'is  chosen  identical  to  the  unscreened  inter¬ 
action.  The  dielectric  response  function  can  be  related  to  the  irreducible  polarization 
propagator  by 

e(.V|,  .Vi.  w)  =  6(.V|,  as)  -  f  P(.y,,X},co)v(r3.r2)dX} .  (11) 


The  RPA  approximation  is  made  by  assuming,  that  E  has  the  simple  form 


E(x, 


,  -Vi . 


"T —  (/(A'i,  A'i,  Uj’)C(A'i,  .V|.  CO*  ~  ’’  . 

2ir 


(12) 


We  can  now  solve  the  equation  for  c  by  making  a  quasi-particle  assumption  about 
the  Green's  function  G.  i.e.. 


G(.V|,  .vi.  w) 


2  <t>k{Xt)(t>*(X2) 
«  -  c*.  -  /6a 


(13) 


where  the  one-particle  functions  4>k  are  assumed  to  be  the  solution  of  some  suitable 
one-particle  equation,  and  5a  is  defined  as 


5a  = 


if  ca  ^  Cf  ; 
if  ca  >  Cf  . 


Inserting  this  into  the  polarization  propagator,  we  obtain 

P(.v,.  .vi.  a>)  -  2  ~  ”  -  0a(.v.)</>?(.Vi)</)a'(.Vi)0?'(.v,)  . 

aa'Ca 


(14) 


(15) 


where  Ha  is  the  occupation  number  (either  0  or  1 )  or  of  the  state  4)^. 
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In  order  to  facilitate  further  discussion,  we  again  use  mon'entum  space  repre¬ 
sentation  for  the  quantities  involved.  We  will  therefore  trans'brm  all  quantities 
involved  to  momentum  space.  For  a  solid  the  quasi-particle  states  0a  are  assumed 
to  be  Bloch-functions  and  therefore  have  the  form  given  in  Eq.  (A1 ).  This  allows 
us  to  express  all  resulting  expressions  in  terms  of  reciprocal  lattice  summations 
only,  which  will  facilitate  the  analysis  of  the  final  result. 

Inserting  this  into  the  equations  for  the  dielectric  response  we  find 

e(q,  q  +  K,  oj)  =  5(q,  q  +  K)  -  q  +  K,  o})v{q  +  K)  ( 16) 
where  K  has  to  be  a  reciprocal  lattice  vector,  and  P'  is  given  by 


F'(q,q  +  K,Ao) 


2  y  y  -  /i,(k  -  q  +  K|) 

t  e,(k)-£,(k-q-hK,)-a, 


'J>?.(q,  k)4>^,(q  -I-  K,  k) 


(17) 


The  factor  2  in  the  numerator  is  due  to  spin,  and  the  reciprocal  lattice  vector  K| 
has  to  be  chosen  such  that  k  -  q  +  Ki  is  in  the  first  Brillouin  zone.  Here  we  have 
also  used  the  definition  of  the  Fourier  transform  of  the  interaction  potential 

47r 

i>(q)  =  -^-  (18) 


This  dielectric  function  is  known  as  the  random  phase  approximation  (  rpa  )  re¬ 
sponse  function.  For  the  homogeneous  electron  gas  it  has  a  relatively  simple  form 
due  to  the  fact  that  the  off-diagonal  elements  of  e(k.  k  +  K,  w)  vanish.  In  real 
metals  the  off-diagonal  elements  are  related  to  the  so-called  local  field  effects. 

In  order  to  examine  the  analytic  behavior  of  the  screened  interaction  it  is  sufficient 
to  examine  the  diagonal  approximation  to  this  function  only.  Wiser  has  shown  that 
the  off-diagonal  elements  of  the  rpa  response  function  are  often  negligible  [15]. 
Moreover,  we  are  mostly  interested  in  the  behavior  of  the  dielectric  response  function 
at  the  Fermi  surface,  where  the  structure  effects  are  most  efficiently  screened  out 
bv  the  electron  cloud.  The  diagonal  approximation  to  the  dielectric  response  is: 

c(q.q,w)  =  1  -  P'(q,q,  w)u(q)  (19) 

Using  this  expression  in  the  screened  interaction,  we  find 
W(q,  q,  w)  =  U(q)c~'(q,  q,  os) 

=  y - T-^ -  (20) 

K  ,  47r 

q  -  —  P  (q,  q,  w) 


It  is  clear  that  the  limiting  case  for  both  q  and  w  vanishing  is  dominated  by  the 
behavior  of  P'  in  that  limit.  It  is  now  easy  to  show  that 
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lim  P'(q,  q,  0)  =  lim  2  2 
v-o  «-o47r 


«^(k)  -  w,(k  -  q  4-  K,) 
c„(k)  -  e,(k  -  q  +  K,) 


X  4>*  (q.  k)<>^„(q,  k) 

_L  y  y  «^(k)  -  «;,(k  -  q) 
™o47r'f  f  c,(k)-f,(k-q) 


1  y  q-VA«;;(k) 
47r^  ^  q-V,c^(k) 


(21) 


where  ^  sums  only  over  those  bands  that  cut  through  the  Fermi  surface.  This  last 
expression  on  the  right-hand  side  can  be  evaluated  by  realizing  that  the  gradient  of 
the  occupation  number  vanishes  everywhere  except  at  the  Fermi  surface,  where  it 
has  the  value 


Vni(kf)  =  -Sf,  (22) 

where  S/r  is  a  unit  vector  normal  to  the  Fermi  surface.  Then  the  sum  (in  the  con¬ 
tinuous  limit  an  integral )  of  Eq.  (21)  becomes  a  surface  sum,  which  is  exactly  equal 
to 


—  2  - ^ — 


=  -VMp 


(23) 


where  Nf  is  the  density  of  states  at  the  Fermi  surface.  We  then  finally  find 

lim  c(q,  q,  0)  =  1  +  47rFAf/r/q^  .  (24) 

This  last  result  is  identical  to  the  Thomas-Feimi  screening  formula,  which,  of 
course,  is  exact  in  the  long  wavelength  limit.  However,  it  also  shows  that  the  use 
of  Hartree-Fock  bandstructures  as  a  starting  point  for  correlated  quasi-particle 
band  structures  is  not  a  good  choice,  because  the  density  of  states  at  the  Fermi 
surface  vanishes  in  that  case.  Then,  the  screening  at  the  Fermi  surface  vanishes 
also,  leaving  an  effectively  bare  interaction  at  the  Fermi  surface.  The  singularity 
introduced  by  the  exchange  will  therefore  not  be  lifted,  but  merely  reduced  in 
strength.  It  is  quite  easy  to  prove  that  the  Hartree-Fock  singularity  in  the  density 
of  states  will  in  lowest  order  be  reduced  to  a  In  In  q  singularity  [20).  Moreover, 
the  correction  to  the  band  energy  at  the  Fermi  surface  will  be  related  to  an  effe^'llve 
potential  of  the  form  [21]: 

W'’(q,  q,  0)  ~  (e-'(q,  q,  0)  -  1  )u(q) .  (25) 

This  expression  will  vanish  in  the  long-wavelength  limit,  showing  again  that  little 
improvement  of  the  bandstructure  around  the  Fermi  surface  will  be  obtained. 


Discussion 

In  this  article  we  have  shown  that  the  pathologies  introduced  by  the  exchange 
in  density  of  states  are  removed  by  the  RPa  contribution  to  the  self-energy. 
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We  have,  in  addition,  shown,  that  the  resulting  expression  does  not  contain  any 
pathological  terms.  In  fact,  it  behaves  completely  regular  for  long  wavelengths.  The 
resulting  effective  potential  has  the  form  of  a  screened  exchange.  This  is  similar  to 
the  situation  for  the  homogeneous  electron  gas.  In  fact,  all  the  long  wavelength 
cancellations  that  characterize  the  homogeneous  electron  gas  will  also  occur  in  the 
periodically  extended  systems,  because  in  the  long-wavelength  limit  the  structure 
of  the  lattice  has  no  influence  on  the  expressions. 

Iteration  of  Eq.  (9)  will  not  introduce  any  new  singularities  and  neither  will 
vertex  corrections,  because  the  latter  tend  to  reduce  the  singularities  in  the  potential 
by  softening  the  delta-function  behavior  of  the  vertex. 

In  addition  we  showed  that  the  dielectric  response  thus  calculated  will  only  give 
a  screened  potential  in  the  limits  ^  0  and  co  -►  0  when  non-Hartree-Fock  orbitals 

are  used  as  unperturbed  states.  The  pathology  introduced  by  the  exchange  will  only 
be  lifted  if  correlation  and  exchange  are  combined  at  every  level  of  theory.  This 
result  was  also  anticipated  by  those  obtained  for  the  homogeneous  electron  gas. 

Although  the  results  in  this  article  are  obtained  within  the  quasi-particle  approx¬ 
imation,  we  do  not  believe  this  is  an  important  limitation.  Calculations  on  molecules 
and  extended  systems  have  shown  that  this  approximation  does  not  produce  serious 
errors  in  the  determinatior  of  electron  detachment  energies  [16-18]. 

The  results  derived  in  this  article  do  not  rely  in  any  way  on  the  shape  of  the 
Fermi  surface.  This  result  is  in  line  with  the  earlier  observation  by  Monkhorst  that 
the  singularities  of  the  exchange  interaction  are  related  only  to  the  incomplete 
integration  inside  the  BZ  and  not  to  the  actual  shape  of  the  enclosing  surface(s). 

Another  noteworthy  point  is  the  fact  that  the  results  do  not  in  any  way  depend 
on  the  basis  set  that  is  used  in  obtainmg  the  orbitals.  The  only  restriction  might  be 
the  fact  that  the  basis  functions  (and  hence  the  orbital  set)  must  have  a  Fourier 
transform.  In  most  of  the  ab  initio  band  structure  calculations  the  basis  functions 
are  either  Gaussian  functions  or  Slater-type  functions,  which  both  have  easily  cal¬ 
culated  Fourier  transforms.  The  technique  described  in  this  article  can  therefore 
without  problem  be  applied  to  the  bandstructures  obtained  with  these  basis  sets. 
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Appendix  A:  Momentum  Space  Representation  of  the  Two-Electron  Integrals 

The  solutions  of  the  Hartree  equations  form  an  orlhonormal  set  of  orbitals  for 
an  infinite  system.  We  will  denote  these  orbitals  by  <^>,,k(r)  and  the  associated  orbital 
energies  by  c^(k).  We  can  expand  these  orbitals  in  a  complete  set  of  crystal  plane 
waves: 


<^^k(r) 


iV- 


2  <^.(G  +  k) 

G 


exp(/(G  +  k)-  r) 
y  1/2 


(Al) 
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where  Q  is  the  volume  of  the  unit  cell,  V  =  NQ  {N  is  the  number  of  unit  cells  in 
the  total  volume),  and 

0,(G  +  k)  =  J  .  (A2) 

In  these  expressions  the  vector  G  is  a  reciprocal  lattice  vector,  and  the  sum  in  Eq. 
( A I )  runs  over  all  reciprocal  lattice  vectors.  A  g“neral  two-electron  matrix  element 
between  four  of  these  Hartree  orbitals  can  then  be  written 

(fik;  vk'|/T2  |p(k  +  q);  a{k'  -  q)>  =  y 


X  I 


0?(G,  +  k)<^?(G.  +  k').^,(G,  +  k  +  q).^.(G„  +  k'  -  q) 
(G,  -  G„  +  q)' 


with  the  additional  condition  on  the  summations  that 

G,,  +  G^  =  Gp  -1-  G., . 

Using  this  condition,  and  writing  G  =  Gp  -  Gp,  and  using  the  notation 
k)  =  ^  2  <p*(G  +  k)<PAG  +  Q  +  k) . 


we  finally  obtain 

<pk;  I'k'IrT:  |p(k  +  q);  <r(k'  -  q)> 


Note  that  Eq.  (A5)  implies  that 


_  ^  y  tp„(G  +  q.  k)j>,p(-G  -  q.  k') 

Vn  (G  +  q)  = 


MO.  k)  =  Z  <^p*(G  +  k)<^,(G  +  k)  =  5p.,(k)  =  6p..  (A7) 
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In  Memoriam 
Michael  Andreas  Schliiter 
1945-1992 

Though  not  a  frequent  participant  in  the  Sanibel  meetings,  Michael  Schliiter’s 
work  in  many-fermion  theory,  especially  density  functional  theory  and  studies  of 
complex  systems,  was  so  important  that  he  was  a  presence  in  many  sessions  he  did 
not  attend.  To  give  but  one  example,  his  paper  with  Lu  Sham  on  the  so-called 
band-gap  problem  in  density  functional  theory[  1  ]  was  cited  many  times  in  both 
presentations  and  discussion. 

Two  times  he  gave  plenary  Sanibel  lectures,  in  1984  (“Point  Defects  in  Semi¬ 
conductors  with  Strong  Electron-Lattice  Coupling”[2])  and  1988  (“Electronic 
Structure  and  Superconductivity  of  High  Tc  Cu  Oxides”[3]).  Each  was  a  model  of 
his  clarity  of  thought  and  ability  to  communicate  truly  central  issues. 

Perhaps  as  remarkable  were  his  comments  from  the  floor.  SBT  recalls  the  1984 
Symposium  as  a  particularly  lovely  example.  There  had  been  much  discussion  on 
use,  in  solids,  of  self-interaction  corrections  to  local  density  approximations.  Even¬ 
tually  Michael  asked  for  the  floor,  rose  quietly  to  full  height,  and  gave  a  one-sentence 
diagnosis  of  the  difficulty:  “The  problem  with  self-interaction  corrections  is  that 
there  ;s  no  intrinsic  scale  length  over  which  to  localize  them  in  anything  but  an 
atom.”  Equally  quietly,  he  sat  down. 

Born  in  Germany,  Michael  learned  fluent  French  on  account  of  his  girlfriend, 
then  wife,  now  widow.  HJM  recalls  their  meeting  for  the  first  time  at  the  1971 
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Menton  Conference,  with  hikes  between  sessions  devoted  to  French  practice,  fun¬ 
damental  problems  in  semiconductors,  and  reflections  on  moving  to  the  USA. 
(HJM  had  already  emigrated.  Years  later  he  and  SBT  would  try.  in  vain,  to  persuade 
Michael  to  move  again,  this  time  to  University  of  Florida.)  Michael  was  already 
participating  as  a  full  peer  in  luncheon  discussions  with  frequent  Sanibel  participants 
such  as  Norman  March,  Jan  Linderberg,  Frank  Harris,  Jean-Louis  Calais,  Keith 
Johnson,  et  al.  One  particularly  notable  event  was  a  dinner  in  the  mountains  above 
Menton  that  included  John  Ziman.  Michael  began  his  Diplom  studies  at  University 
of  Karlsruhe  in  LFs  group  in  1968.  Recognized  immediately  for  his  innate  ability 
to  understand  and  communicate  physics,  he  became  a  kind  of  walking  reference. 
LF  recalls  dispatching  Michael  to  brief  various  experimental  groups  and  getting 
back  rave  reviews.  After  the  Diplom,  Michael  confessed  his  chagrin  at  not  speaking 
French.  In  response,  LF  suggested  that  Michael  could  join  a  friend’s  group  at  the 
new  EPFL  in  Lausanne.  He  did  and,  at  breathtaking  speed,  both  picked  up  French 
and  wrote  a  pseudo-potential  code  with  which  he  studied  layered  compounds.  In 
1972,  a  Swiss  Science  Foundation  fellowship  took  him  to  Marvin  Cohen’s  group 
at  Berkeley  as  a  Postdoc.  He  went  to  Bell  Labs  in  1975  and  continued  to  build  the 
record  of  achievement  noted  above. 

The  three  of  us  are  honored  to  have  had  the  opportunity  to  give  memorial  remarks 
at  the  outset  of  the  “Michael  Schliiter  Memorial  Session  on  Density  Functional 
Theory’’  at  the  33rd  Sanibel  Symposium.  We  are  grateful  that  our  lives  were  graced 
by  his  life  and  much  saddened  at  the  premature  loss. 

Lothar  Fritsche 

Institul  Jitr  Theoretische  Physik.  Technische  Universilat  Claiislhal 
Henk  Monkhorst 
Sam  Trickey 

Quantum  Theory  Project.  University  of  Florida 

[1]  L.J.  Sham  and  M.  Schluter.  Phys.  Rev.  Lett.  51,  1884  (1983) 

[2]  G.A.  BarafT  and  M.  Schliiter.  Int.  J.  Quantum  Chcm.  SIS,  737  ( 1 984). 
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Abstract 

In  many  situations,  the  most  long-ranged  parts  of  the  exchange  and  correlation  holes  surrounding  an 
electron  cancel  one  another  Apparently  for  this  reason,  local  spin  density  and  generalized  gradient 
approximations  are  more  accurate  for  exchange  and  correlation  together  than  for  cither  alone.  A  study 
IS  made  of  the  ability  of  these  density  functionals,  and  also  the  unmodified  second-order  gradient  expansion, 
to  describe  vanous  short-range  effects  in  atoms  the  correlation  contnbution  to  the  interacting  kinetic 
energy,  the  antiparallel-spin  correlation  energy,  and  the  correction  to  ihc  random  phase  approximation. 
Generalized  gradient  approximations,  constructed  with  no  adjustable  parameter  from  the  electron  gas 
of  slowly  varying  density,  are  found  to  give  results  of  useful  accuracy  for  real  atoms  Prospects  arc 
discussed  for  use  of  the  new  functionals  to  improve  the  accuracy  of  electronic-structure  calculations 
€/  1993  John  Wiley  &  Sons.  Inc. 


Introduction 

The  noninteracting  kinetic  energy,  the  exchange  energy,  and  the  correlation  energy 
are  all  functionals  of  the  electron  density  [1,2],  The  first  two  anse  from  the  exchange 
hole  surrounding  an  electron,  while  the  third  arises  from  the  coupling-constant- 
averaged  correlation  hole.  The  holes  may  be  expanded  in  powers  of  the  density 
derivatives  at  the  position  of  the  electron..  The  resulting  gradient  expansions  have 
a  characteristic  structure:  When  the  density  does  not  vary  too  rapidly  over  space, 
the  addition  of  each  successive  order  in  V  improves  the  approximate  hole  close  to 
its  electron,  while  worsening  it  far  away  [3,4], 

From  this  analysis,  the  local  spin  density  ( LSD),  second-order  gradient  expansion 
(GEa),  and  generalized  gradient  (gga)  approximation, 

£[rtt,  nj]  =  J  t/V e(«j,  n*.  Vwf ,  V«j) ,  (1) 

are  expected  to  be  relatively  more  accurate  for  energies  that  sample  the  more  short- 
ranged  features  of  the  holes.  For  example,  the  noninteracting  kinetic  energy  samples 
only  derivatives  of  the  exchange  hole  at  the  position  of  the  electron;  its  GEA  makes 
typical  errors  of  less  than  1%,  and  is  its  own  GGA  [5], 
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This  analysis  suggests  why  it  is  often  better  to  treat  exchange  and  correlation 
together  in  the  same  density  functional  approximation  than  to  combine  an  exact 
( Hartree-Fock )  description  of  exchange  with  a  density  functional  approximation 
for  correlation.  A  striking  example  is  the  atomization  energy  of  Ci  [6],  which  is 
0.7  eV  in  Hartree-Fock,  2.3  eV  when  GGA  correlation  is  added,  and  6.55  eV  when 
both  exchange  and  correlation  are  treated  in  GGa;  the  experimental  value  is  6.36 
eV.  The  explanation  seems  to  be  that  the  exact  exchange  and  correlation  holes  have 
long-ranged  components  which  tend  to  cancel  one  another.  The  same  cancellation 
between  the  most  long-ranged  components  of  exchange  and  parallel-spin  correlation 
occurs  in  the  electron  gas  of  uniform  density;  each  hole  has  a  nonoscillatory 
tail,  where  R  is  distance  from  the  electron  [3,7].  The  residue  of  this  cancellation 
is  plasmon-dominated  [3,7]  at  large  R  and  falls  off  like  R~^. 

In  previous  work,  accurate  generalized  gradient  approximations  for  the  exchange 
energy  (£J  and  correlation  energy  (£,)  have  been  developed  from  first  principles 
by  cutting  off  the  spurious  long-range  parts  of  the  second-order  gradient  expansions 
for  the  holes  [  8,9  ] .  From  the  gga  for  E, ,  a  GGA  for  the  correlation  contribution 
r<  to  the  interacting  kinetic  energy  has  been  developed  [5,10]  and  shown  [10]  to 
be  independent  of  the  most  long-ranged  part  of  the  correlation  hole. 

The  aim  of  this  work  is  to  develop  and  test  density  functionals  for  two  other 
short-range  properties;  the  antiparallel-spin  correlation  energy  ( next  section )  and 
the  correction  to  the  random  phase  approximation  (third  section).  Table  I  compares 
GGA  and  nearly  exact  values  [  1 1  - 1 5  ] ,  where  available,  and  confirms  the  good  success 
that  GGA  has  had  for  the  total  correlation  energy.  Prospects  for  the  practical  use  of 
the  new  functionals  are  discussed  in  a  later  section. 

Table  II  compares  LSD,  gea,  and  gga  results  for  the  neon  atom.  Note  that  gea 
is  typically  very  unrealistic  due  to  its  violation  of  a  sum  rule  on  the  correlation  hole 


Table  1  Comparison  between  generalized  gradient  approximation  (gga)  and  exact  values  for 
(minus  the  correlation  energy).  E  (the  correlation  conlnbution  to  the  kinetic  energy),  -lEj* 
(the  antiparallel-spin  contribution  to  -EV).  and  EV-E'?'’*  (the  correction  to  the  random  phase 

approximation.* 


Atom 

~h 

'•< 

£ 

-i 

t-n 

"  c 

EVE'?’’* 

gc;a 

Exact 

GGA 

Exact 

GGA 

Exact 

GGA 

He 

0.046* 

Be 

Ne 

— 

0  306 

•Ref.  1 10) 
"Ref.  (11). 
'Ref.  (12) 
■’Ref.  [13]. 
'Ref.  (14). 
'Ref.  [15] 
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Table  II  Companson  among  local  spin  density  (LSD),  second-order  gradient  expansion  (gea).  and 
generalized  gradient  approximation  (gga)  results  for  the  neon  atom  (hartree).* 


Property 

LSD 

GEA 

GGA 

Exact 

T, 

117.76 

127.83 

127.83 

128.55 

-E, 

11.033 

11  775 

12.115 

12.108 

-Ec 

0.743 

-0.780 

0.382 

0.39 

Tc 

0.496 

0.530 

0.313 

— 

-EV 

0.434 

-0.217 

0.306 

0.31 

E.-Ef’’'' 

0.201 

0.174 

0.188 

— 

*  (the  noninteraciing  kinetic  energy)  and  E,  (the  exchange  energy)  from  Refs  [5]  and  [6]. 


respected  by  LSD  and  restored  tn  GGA.  LSD  is  rather  unrealistic  for  the  total  cor¬ 
relation  energy,  but  less  so  for  the  short-range  contributions,  as  expected. 

Antiparallel-Spin  Correlation  Energy 

At  least  in  the  uniform  electron  gas  [7],  and  perhaps  more  generally,  the  most 
long-ranged  part  of  the  correlation  hole  around  an  electron  ( which  cancels  the  most 
long-ranged  part  of  the  exchange  hole)  arises  from  parallel-spin  correlation  alone. 
The  GGA  for  the  loial  correlation  energy  is  [6.9]  (in  atomic  units) 

£'<  [«t’  «*]  =  J  n[c,.(/y,  f)  +  //(f,  c<(r„  f),  C<.(r,))] .  (2) 

where  n  =  +  tii  -  %7rrf  =  tt/c?/  192,  ^  =  (n^  -  ni)/n, 

l=\Vn\/2gk,n  (3) 

and 

[(1 +n'''  +  (l -n’'']/2.  (4) 

Note  that  e,,  //,  and  g  are  even  functions  of  f.  For  a  slowly  varying  density  (;  -► 
0),  the  right-hand  side  of  Eq.  (2)  becomes 

J  d^rncc{r„  f)  +  J  </V(Co(g)  -  'iCJl]g\Vn\~ +  •  •  ',  (5) 

where  =  -0.001667.  The  first  term  of  Eq.  (5)  is  LSD  and  the  sum  of  the  first 
two  terms  is  GEA .  The  second  or  gradient  term  depends  weakly  upon  C  since  g 
varies  from  1  at  f  =  0  to  2'^’  at  f  =  1,  and  neglects  small  [16]  Vf  contributions  to 
the  first-principles  gea  .  The  gradient  coefficient  C(  r,)g  is  doninated  by  long-wave¬ 
length,  plasmonlike  excitations  [  3  ] .  In  a  plasma  oscillation,  all  electrons  participate 
equally  without  regard  to  their  spins.  This  observation  suggests  that  the  parallel- 
spin  contribution  to  C{r,)g  is  proportional  to  the  probability  that  two  electron 
spins  are  parallel,  [( 1  -E  (■)/2]'  -E  (( 1  -  f)/2]^,  which  varies  from  1/2  at  f  =  0  to 
1  at  C=  ±1. 
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The  //|  term  oi  H  =  Ho  +  is  an  anomalous  [9]  parallel-spin  contribution 
which  gives  rise  to  the  -3C\n  piece  of  the  gradient  term  in  Eq.  ( 5 ).  Thus,  without 
the  painstaking  derivation  required  by  the  geas  for  £,  and  T, ,  a  guessed  GGa  for 
antiparallel-spin  correlation  is 

=  J  dh  «[£<V(r„  n  +  HoU.  rs,  r.  f).  f))] ,  (6) 

with  P  -  V  ClHi'i,  f)  in  Eq.  ( 13)  of  Ref.  [6].  This  reduces  to 

J  d^rnclHrs,  f )  +  J  dhCl^in,  Vn| +  •  •  •  (7) 


in  the  slowly-varying  limit.  From  the  high-density  limit  of  the  uniform  electron 
gas,  the  antiparallel-spin  correlation  energy  per  electron  is  approximately*  [7] 


n  =  e,(rs,  f)  -  e<(r^,  1) 


(8) 


For  f  =  0  and  0.01  <  rj  <  2,  Eq.  (8)  deviates  by  no  more  than  about  ,5%  from  the 
estimate  of  Nozierer  and  Pines  [18]  who  started  from  second-order  perturbation 
theory  and  cut  off  the  unconverged  small-wavevector  contribution;  at  r,  =  5,  the 
deviation  is  still  only  12%.  From  the  argument  at  the  end  of  the  preceding  paragraph. 


n  =  C,(r,) 


1  - 


(9) 


For  f  =  ±1.  Cl*,  and  fl*  all  properly  vanish.  The  integrand  of  Eq.  (6),  like 
that  of  Eq.  (2),  is  everywhere  properly  nonpositive  and  vanishes  as  /  -►  oc.  The 
antiparallel-spin  correlation  potential  5£'l*/6n„(r)  can  be  found  from  Eq.  (.33)  of 
Ref.  [9]. 

While  Eq.  ( 2 )  is  derived,  Eq.  ( 6 )  is  only  an  obvious  guess  which  could  be  improved 
by  refining  the  ingredients  cl*  and  Cl*,  and  by  making  a  real  space  cutoff  [8,9]  on 
the  gradient  expansion  of  the  antiparallel-spin  correlation  hole. 

LSD,  GEA,  and  GGA  results  for  1 7  spherical  atoms  or  ions  with  Hartree-Fock 
densities  [  19]  are  presented  in  Table  III.  Note  that  the  contribution  of  antiparallel- 
spin  correlation  to  -E,  varies  from  about  90%  for  few-electron  spin-unpolarized 
atoms  to  74%  for  the  54-electron  atom  Xe.  Contrast  this  with  the  spin-unpolarized 
uniform  electron  gas  of  high  density,  in  which  only  60%  of  the  correlation  energy 
arises  from  antiparallel  spins.  Preferential  suppression  of  long-range  parallel-spin 
correlation  is  a  characteristic  feature  of  small  systems  like  atoms. 

There  are  two  reasons  to  hope  that  this  gga  for  £j*  might  be  more  accurate 
than  the  gga  for  £, :  ( 1 )  GGas  are  better  suited  to  more  short-ranged  effects.  ( 2 ) 
This  GGA  for  £j*  is  self-interaction-free,  i.e.,  it  properly  vanishes  for  a  one-electron 
system. 


•  The  model  pair  distnbution  function  which  implies  this  result  for  the  uniform  electron  gas  is  supported 
by  results  of  Pickett  [17] 
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Tabi.1.  111.  Comparison  of  LSD.  gea.  and  gga  results  (hartree).' 


Atom 

-i 

:U 

•'  C 

A>i 

,*RPA 

<• 

LSD 

GhA 

GGA 

LSD 

GEA 

GGA 

% 

H 

0.000 

-0.000 

0.000 

0.014 

He* 

0.000 

-0.000 

0.000 

204% 

He 

-0.035 

88% 

0.037 

67% 

Li* 

-0.089 

94% 

0.039 

0.020 

0.033 

65% 

Be=* 

-0.147 

99% 

0.041 

0.022 

0.034 

64% 

Li 

-0.068 

0.050 

86% 

0.054 

0.037 

0.045 

78% 

Be* 

0.091 

-0.108 

0.057 

91% 

0.057 

0.040 

0048 

77% 

Be 

0.130 

-0.099 

0.084 

89% 

0.072 

0.049 

0  062 

66% 

Neo* 

0.195 

-0.454 

0  123 

100% 

0.083 

0  063 

0072 

59% 

N 

0.227 

-0.145 

0.166 

83% 

0.136 

0.111 

0.124 

62% 

Ne 

0  434 

-0.217 

0.306 

80% 

0.201 

0.174 

0.188 

49% 

Ar 

0  8.32 

-0.432 

0.606 

0.338 

45% 

Zn'-* 

1.057 

-1.243 

0.762 

82% 

0.393 

41% 

I.5II 

-0.714 

1.134 

mSm 

0.598 

0.579 

0.580 

39% 

Zn 

1.556 

-0.636 

1.167 

mSM 

0.630 

0.614 

0.613 

Kr 

1.916 

-0.777 

1.450 

Hi 

0.760 

0.742 

0.743 

39% 

Xe 

3.038 

-1.183 

2.344 

1.153 

1.145 

1.135 

36% 

*  The  column  displass  the  ratio  of  the  gga  for  a  given  property  to  the  gga  for  . 


The  self-interaction  error  of  the  LSD  or  GOA  approximation  for  the  total  exchange- 
correlation  energy  has  several  undesirable  consequences,  which  are  avoided  by  ap¬ 
proximating  only  the  antiparallel-spin  correlation  energy.  One  of  the  most  striking 
is  the  self-interaction  error  of  the  exchange-correlation  potential  Sf,,  /  bn„{  r ) ,  which 
is  roughly  a  positive  constant  over  the  interior  of  an  atom,  but  decays  like  +c^/r 
as  r  -*•  X .  This  error  of  the  potential  is  qualitatively  similar  in  the  neon  [20]  and 
hydrogen  atoms.  Another  error  of  the  GGA  potential,  its  divergence  (like  V*n)  as 
r  -*■  0,  is  probably  reduced  but  not  eliminated  by  approximating  only  antiparallel- 
spin  correlation. 


Correction  to  the  Random  Phase  Approximation 

The  random  phase  approximation  [3]  (RPA,  sum  of  ring  diagrams,  or  time- 
dependent  Hartree  approximation )  is  exact  for  long-range  correlation,  because  it 
constructs  the  correlation  hole  from  a  density-response  function  X(r,  r',  w)  which 
becomes  exact  in  the  limit  |r'  -  rl  -►  oo.  However,  RPA  is  poor  for  short-range 
correlation.  For  the  uniform  election  gas  at  metallic  densities,  it  produces  a  cor¬ 
relation  hole  at  the  position  of  the  electron  (r'  =  r)  that  is  much  too  deep. 

The  GGA  of  Eq.  ( 2 )  is  constructed  beyond  the  random  phase  approximation 
(RPA).  However,  it  is  easy  [6]  to  make  an  RPA  version  by  using  RPA  results  for  the 
ingredients  c<(r,,  f)  and  Q(r,).  The  cutoff  parameter  [9]  in  H  should  remain 
unchanged,  since  RPA  should  become  relatively  exact  in  the  high-density  limit. 
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Table  IV.  Parameters  of  Eq.(  10)  for  the 
second-order  gradient  coefficient  of  the 
correlation  energy,  withtn  and  beyond  the 
random  phase  approximation  (rpa) 


RPA 

Beyond* 

a 

0.001254 

0.023266 

n 

2.6  X  10-’ 

7.389  X  10-’ 

7 

0.444 

8.723 

0.0071 

0.0472 

•  Ref  [22], 


The  Perdew-Wang  analytic  representation  [21]  for  e,(r,,  f)  works  equally  w'ell 
w-ithin  or  beyond  RPA;  only  the  parameters  change,  as  showin  in  Table  1  of  Ref. 
[21],  The  Rasolt-Geldart  analytic  representation  [22] 


C(r,) 


0.002568  +  ar,  + 

1  +  +  hr-  +  \0^^rl  ’ 


(10) 


may  also  be  used  within  rpa.  Table  IV  shows  the  parameters  determined  here 
from  a  fit  to  the  rpa  C<(r,)  from  Ref.  [3]  at  r,  =  2,  4.  6.  and  12. 

The  difference  between  Eq.  (2 )  evaluated  beyond  and  within  rpa  is  the  gga  for 
£<  -  £f ,  the  correction  to  the  random  phase  approximation.  LSD,  gea,  and  gga 
results  for  17  spherical  atoms  and  ions  are  presented  in  Table  111.  [An  alternative 
GGA  for  fJf''  -  £,  could  be  constructed  in  the  form  of  Eq.  (6),  but  with  “rpa 
minus  exact”  inputs  for  c,  and  C< .] 

£,  -  £f'’^  is  the  only  correlation  energy  studied  here  for  which  gea  provides  a 
reasonable  estimate,*  and  for  which  lsd  is  close  to  gga  .  Note  that  it  is  an  important 
part  of  the  total  correlation  energy  (67%  of-£,  for  He  and  36%  for  Xe).  Moreover, 
(£<  -  £”‘’'^)/A’  (the  contribution  per  electron)  is  typically  close  to  0.024  hartree, 
the  second-order  exchange  constant  which  corrects  the  RPA  e,(r,,  f)  in  the  high- 
density  limit  r,  0. 

Table  III  shows  that,  for  spin-unpolarired  atoms  within  LSD,  £<  -  is 
typically  between  40%  and  60%  of  -  £**.  This  result  is  not  unexpected;  wavevector 
analysis  of  the  RPA  correlation  energy  for  the  uniform  electron  gas  shows  [18] 
that  RPA  is  exact  in  the  small-wavevector  limit  (long-range,  parallel-spin  correla¬ 
tion),  but  IS  too  negative  by  a  factor  of  two  in  the  large-wavevector  limit  (short- 
range.  antiparallel-spin  correlation). 


Prospects 

LSD  and  GGA  density  functionals  have  been  constructed  and  tested  for  two  short- 
range  contributions  to  the  correlation  energy:  antiparallel-spin  and  non-RPA.  It 

Vosko  and  Langrclh  ( 2.t  ]  have  suggeslcd  that  gea  may  suffice  for  the  second-order  exchange  energy, 
the  leading  correction  to  ri’a  in  the  high-density  limit. 
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appears  that  functionals  of  this  kind  are  potentially  more  accurate  for  the  short- 
range  contributions  than  for  the  total  correlation  energy.  How  could  these  new 
functionals  be  used  to  improve  the  accuracy  of  electronic-structure  calculations? 

The  GGA  for  antiparallel-spin  correlation  might  be  combined  with  some  more 
accurate  description  of  exchange  plus  parallel-spin  correlation  energy .  Quantum- 
chemical  methods  like  “coupled  cluster"  [24]  are  well  suited  for  this  purpose,  since 
they  would  not  be  required  to  build  up  the  cusp  of  the  correlation  hole  at  the 
position  of  the  electron,  which  is  carried  by  other  electrons  v/ith  antiparallel  spins 
[7],  This  hybrid  approach  is  similar  to  that  of  Savin  [25],  who  made  a  different 
separation  between  long-range  and  short-range  correlation.  Alternatively,  a  fully 
nonlocal  weighted  density  approximation  [26]  might  be  used  for  exchange  and 
parallel-spin  correlation  [27,28]. 

The  GGA  for  £,  —  might  be  combined  [27]  with  a  full  rpa  calculation  of 
the  exchange  and  correlation  energies.  RP.A-like  calculations  have  been  earned  out 
for  finite  systems  [29.30]  and  for  metallic  hydrogen  [31],  but  usually  in  versions 
of  RPA  more  elaborate  than  the  simple  ring  sum  needed  here. 

The  generally  good  results  obtained  from  a  gga  with  no  adjustable  parameter 
should  finally  answer  the  old  question;  “What  does  the  electron  gas  of  slowly- 
varying  density  have  to  do  with  real  atoms  and  molecules?"  It  is  only  human  beings 
who  draw  a  distinction  between  condensed  matter  physics  and  quantum  chemistry; 
to  Nature,  electronic  systems  form  a  seamless  whole. 

Acknowledgments 

This  work  was  supported  in  part  by  the  National  Science  Foundation  under 
Grant  DMR92-13755.  The  author  thanks  A.  Savin  and  J.  Oddershede  for  helpful 
references. 


Bibliography 

(1 1  W.  Kohn  and  L.  J.  Sham.  Phvs  Rc'.  I-(0.  A1 13.t  ( 196.“'). 

[2]  R.  O  Jones  and  O.  Gunnarsson.  Rev.  Mod.  Phvs  61. 689  ( 1989) 

(.3)  D.  C  Langrelh  and  J.  P  Perdew.  Phvs  Rev.  B  21 . 5469  (  1980) 

(4)  V.  Wang.  J.  P.  Perdew.  J.  ,A.  Chevarv.  L.  D  Macdonald,  and  S.  H  Vosko.  Phvs.  Rev.  .A4I.  78 
( 1990). 

(5)  J.  P.  Perdew.  Ph>s.  Lclt.  A 165,  79  { 1992) 

(6)  J.  P.  Perdew.  j.  A.  Chevarv.  S  H.  Vosko.  K  A  Jaekson.  M  R.  Pederson.  D.  J.  Singh,  and  C. 
Fiolhais.  Phvs.  Rev.  B46,  6671  ( 1992). 

(7)  J.  P  Perdew  and  Y  Wang.  Phvs.  Rev.  B46.  12947  ( 1992). 

(8)  J  P  Perdew  and  Y  Wang.  Phvs.  Rev.  B-T3.  88(H)  ( 1986) 

(9)  j  P  Perdew.  in  l.lalwnn  Sinulurv  o!  Soluh  VI.  P.  Ziesche  and  11.  Lschrig.  i  ds.  (.Akademie 
Vcriag.  Berlin.  1991 ).  p.  II. 

( 10)  A.  Gorhng.  M  l.cvv.  and  J  P.  Perdew,  Phvs.  Rev.  B47.  1 167  ( 199.3 ). 

(11)  .A  Veillard  and  E  Clementi.  J.  Chem.  Phvs.  49,  2415  ( 1968).  H.  .Stoll  and  A  Savin,  private 
eommunieation. 

(12)  J.  B  LagowskiandS  H.  Vosko.  J.  Phvs.  B21 , 20.3  ( 1988). 

1 1.3)  C.-O.  Almbladh  and  A.  C.  Pedro/a.  Phvs.  Rev.  .A29.  2.322  ( 1984 ) 

1 14)  R.  K  Nesbet.  Phvs.  Rev  I.S5.  51  ( 1967). 

(15)  R.  K  Nesbel.  T  L  Barr,  and  I;  R  Davidson.  Chem.  Phvs  I  ell  4,  20.3  ( I9('9) 


100 


PERDEW 


[  16]  Y.  Wang  and  J  P  Perdew,  Phys.  Rev.  B  43.  891 1  ( 1991 ), 

[17]  W.  Pickett  and  J.  Q.  Broughton,  quantum  Monte  Carlo  results  (unpublished). 

[18]  P.  Noziefes  and  D.  Pines.  Phys.  Rev.  111.  442  (1958).  See  Eq.  ( 5 1 ) . 

[ 19]  E.  Clement!  and  C.  Roetti.  At.  Data  NucI  Data  Tables  14.  177  ( 1974). 

[20]  M  Teter.  Int  J.  Quantum  Chem.  Quantum  Symp.  27;  C.  J.  Umngar  and  X.  Gonze.  Bull.  Am. 
Phys  Soc.  38.  381  (1993). 

[21]  J.  P.  Perdew  and  Y.  Wang.  Phys.  Rev.  B4S.  13244(1992). 

[22]  M.  Rasolt  and  D.  J.  W.  Geldart.  Phys.  Rev.  B34.  1325  ( 1986). 

[23]  S.  H.  Vosko  and  D.  C.  Langreth.  private  communication. 

[24]  G.  Stollhoff  and  A  Hcilingbrunner.  Z.  Phys.  B83.  85  ( 1991 ). 

[25]  A  Savin.  Int.  J  Quantum  Chem.  Quantum  Symp.  22. 59 ( 1988).  J.  Chim  Phys.  86.  757  ( 1989). 

[26]  O.  Gunnarsson.  M.  Jonson.  and  B.  I  Lundqvist.  Phys.  Rev.  B20.  3136  ( 1979). 

[27]  J.  P.  Perdew.  in  Ek-uronu  Dciisin  Fiimitonal  Ttwon  of  Mokxules.  Clusters,  and  Sahds.  D.  E. 
Ellis.  Ed.  (Kluwer  Academic.  1993). 

[28]  A  Rubio  and  J.  P.  Perdew,  work  in  progress. 

[29]  A  Szaboand  N.  S.  Ostiund.  J.  Chem.  Phys.  67,4351  ( 1977);  J.  Oddershede,  Adv.  Quant.  Chem. 
11.275  (1978). 

[  30  ]  F.  E  Hams.  H.  J  Monkhorst.  and  D.  L.  Freeman.  Algehrau  and  Diugraininalic  Methods  in  Many- 
Fermion  Theon  (Oxford  University  Press,  New  York,  1992). 

[31]  H.  J  Monkhorst  and  J  Oddershede,  Phys  Rev.  Lett.  30.  797  ( 1973) 


Received  April  21.  1993 


Asymptotic  Structure  of  the  Kohn-Sham  Effective 
Potential  at  Metal  Surfaces 


MANOJ  K.  HARBOLA*  and  VIRAHT  SAHNI 

Department  of  Physics.  Brooklyn  College  of  the  City  University  of  AVii  York.  Brooklyn.  .Veil  York 
1 1210.  and  The  Graduate  School  and  University  Center  of  the  City  University  of  New  York. 

33  HVw  42nd  Street.  New  York.  New  York  10036 


Abstract 

In  this  article  we  calculate  the  asymptotic  structure  of  the  exchange  potential  for  the  Fermi  level 
electron  in  the  vacuum  region  outside  a  metal  surface,  and  show  it  to  be  the  image  potential.  In  this 
asymptotic  region  the  exchange  potential  due  to  the  Fermi  level  electron  is  equivalent  to  the  optimized 
potential  of  exchange-only  density-functional  theoi^.  Thus,  the  asymptotic  Kohn-Sham  effective  potential 
at  a  metal  surface  is  the  image  potential,  and  arises  solely  due  to  correlations  which  result  from  the  Pauli 
exclusion  principle  ©  1993  John  W;ley  &  Sons.  Inc 

Introduction 

In  density-functional  theory  [1,2],  the  electrons  of  a  nonuniform  system  in  the 
presence  of  an  external  potential  are  treated  as  noninteracting  quasi-particles  moving 
in  a  local  effective  potential.  This  effective  potential  is  the  sum  of  the  classical 
Hartree  electrostatic  and  exchange-correlation  potentials,  with  all  the  many-body 
effects  including  those  due  to  the  correlation  contribution  to  the  kinetic  energy 
being  incorporated  in  the  latter.  The  many-body  potential  is  derived  by  application 
of  the  variational  principle  for  the  energy  to  be  the  functional  derivative  of  the 
exchange-correlation  energy  functional  of  the  density.  The  asymptotic  structure  of 
the  effective  potential  in  the  classically  forbidden  region  is  that  of  the  exchange- 
correlation  potential  since  the  Hartree  potential  decays  exponentially  as  the  density 
in  this  region.  In  this  article  we  address  the  issue  of  the  structure  and  physical  origin 
of  the  effective  or  equivalently  the  exchange-correlation  potential  for  asymptotic 
positions  of  an  electron  far  in  the  vacuum  outside  a  metal  surface. 

In  the  literature,  semiclassical  calculations  [  3-6  ]  obtain  the  asymptotic  structure 
of  the  exchange-correlation  potential  at  a  metal  surface  to  be  the  image  potential 
- 1  /4x  for  an  electron  at  a  distance  x  from  the  metal  surface.  In  these  calculations 
the  asymptotic  electron  is  treated  as  an  external  test  charge  distinct  and  independent 
from  the  inhomogeneous  electron  gas  at  the  metal-vacuum  interface.  As  a  con¬ 
sequence,  the  asymptotic  image-potential  structure  of  the  exchange-correlation  po¬ 
tential  is  attributed  to  being  due  to  Coulomb  correlation  effects. 
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The  asymptotic  electron,  however,  is  not  separate  from  but  part  of  the  interacting 
nonuniform  electronic  system  at  the  surface.  In  recent  work  [7,8]  the  integral  equa¬ 
tion  [6]  relating  the  functional  derivative  of  the  exchange-correlation  energy  to  the 
electron  self-energy  of  many-body  perturbation  theory  has  been  solved  self-consis- 
tently  within  two  approximation  schemes  for  the  self-energy.  The  asymptotic  struc¬ 
ture  of  the  resulting  approximate  exchange-correlation  potential  in  each  case  is 
image-potential-like,  and  in  the  latter  calculation  [  8  ]  this  structure  is  shown  to  be 
due  to  the  Coulomb  correlation  contribution  to  the  self-energy.  In  this  calculation 
[8]  the  self-energy  is  replaced  by  the  first  term  in  an  expansion  [9]  in  powers  of 
the  dynamically  screened  electron-electron  interaction.  Further  approximations 
invoked  are  the  replacement  of  the  response  function  by  that  within  the  random 
phase  approximation,  and  the  assumption  that  the  Kohn-Sham  and  Dyson  equation 
Green  functions  are  the  same.  On  the  other  hand,  the  -1/r  asymptotic  structure 
of  the  effective  potential  in  atoms,  as  obtained  via  the  integral  equation  by  employing 
[6]  the  ♦act  that  only  the  highest  occupied  orbitals  contribute  to  this  structure  is 
shown  [6]  to  be  a  consequence  of  Pauli  correlations.  Thus  within  this  framework, 
the  asymptotic  structure  of  the  effective  potential  of  the  nonuniform  electron  gas 
in  atoms  and  molecules  is  due  to  Pauli  correlations  whereas  that  at  metal  surfaces 
is  a  result  of  Coulomb  correlations. 

In  contrast,  by  application  of  a  physically  based  many-electron  theory  [10,11] 
developed  by  us,  we  have  shown  that  both  the  asymptotic  image  potential  at  a 
metal  surface  [  1 2, 1 3  ]  as  well  as  the  - 1  /  r  asymptotic  potential  in  atoms  [  1 0, 1 1 , 1 4  ] 
are  due  to  Pauli  correlation  effects.  In  this  theory  the  local  potential  representing 
the  correlations  due  to  the  Pauli  exclusion  principle  and  Coulomb’s  law  is  obtained 
as  the  work  done  to  move  an  electron  in  the  force  field  of  its  Fermi-Coulomb  hole 
charge  distribution.  Since  the  total  Coulomb  hole  charge  is  zero,  Coulomb  corre¬ 
lation  effects  do  not  contribute  to  the  asymptotic  structure  of  this  potential.  As 
such  the  physical  origin  of  the  asymptotic  effective  potential  of  all  nonuniform 
electronic  systems  is  universal  in  that  it  is  due  only  to  correlations  arising  from  the 
Pauli  exclusion  principle.  Thus  the  asymptotic  structure,  which  arises  as  a  result 
of  the  Coulomb  interaction  between  the  asymptotic  electron  and  the  Fermi  hole 
charge  distribution,  is  exactly  known.  In  the  theory  the  asymptotic  electrons  are 
explicitly  treated  as  part  of  the  nonuniform  electronic  system,  and  no  approximations 
are  made  in  determining  the  asymptotic  structure  of  the  effective  potential.  The 
asymptotic  image  potential  structure  at  a  metal  surface  is  obtained  from  the  delo¬ 
calized  [15]  Fermi  hole  spread  through  the  crystal,  v'hereas  the  - 1  /r  potential  in 
atoms  is  derived  from  the  Fermi  hole  localized  [\6]  about  the  nucleus  of  the  atom. 

In  the  present  work  we  show  by  calculating  the  asymptotic  structure  of  the  rig¬ 
orously  defined  [  1 7,6]  exchange  potential  of  exchange-only  density-functional  theory 
that,  as  is  the  case  for  atoms,  the  asymptotic  effective  potential  at  metal  surfaces  is 
an  exchange  effect.  Furthermore,  this  structure  is  that  of  the  image  potential.  Once 
again,  no  approximations  are  made  in  arriving  at  these  conclusions.  The  calculations, 
performed  within  the  context  of  density-functional  theory,  are  distinct  from  those 
of  our  previous  work  [  1 2, 1 3  ] .  However,  the  conclusions  of  the  two  independent 
formulations  are  entirely  consistent. 
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Exchange-Only  Density-Functional  Theory 

The  exchange  potential  /i,{r)  in  exchange-only  [17]  density-functional  theory  is 
defined  as  that  obtained  by  the  optimized  potential  method  [18]  (OPM).  In  the 
OPM,  the  electrons  in  an  external  potential  K„,(r)  are  assumed  to  move  in  a  local 
effective  potential  r°'’'^(r)  so  that  the  differential  equation  governing  the  electronic 
orbitals  0,(r)  is 

[-  K„,(r)+  FO'‘M(r)].^,(r)  =  £,</., (r).  (1) 

These  orbitals  are  then  employed  to  determine  the  Hartrce-Fock  theory  energy 
which  is  the  expectation  value  of  the  nonrelativistic  Hamiltonian  taken 

with  respect  to  a  Slater  determinant  of  the  orbitals  (^,(r).  The  potential 
is  varied  until  a  minimum  of  the  energy  is  achieved,  i.e.,  the  variation 

5<«>HF/5KOP^(r)  =  0  (2) 

is  performed.  The  resulting  potential  is  the  optimized  potential.  The  exchange  part 
of  this  potential  is  obtained  by  subtracting  from  it  the  Hartree  potential  f  rfr'p(r')/ 
Ir  -  r'l,  where  the  density  p(r)  =  Z,  l</>,(r)|  *.  The  variation  of  Eq.  (2)  leads  [18] 
to  the  following  integro-differential  equation  for  the  exchange  potential: 

2  J  dr'lnAr')  -  v,,(r')]G,(r T)4>,(r')<pt  (r)  =  0  ,  (3) 

where  G,(r,  r')  is  the  Green  function 

G,(r,r')=  2>/r)0;(r')/c,-e,,  (4) 

j 

and  where  u,.,  (r)  is  the  Hartree-Fock  theory  orbital-dependent  potential  [19,20] 
v.x..(r)  =  -J  r')/|r  "  r^r' ,  (5) 

with 

P.x.,(r,  r')  =  2  4>J{r')(Pj(r)(t>,{T')/(t>,{r)  (6) 

J 

being  the  orbital-dependent  Fermi  hole  charge  at  r'  for  an  electron  at  r.  The  OPM 
has  been  applied  to  the  nonuniforrn  electronic  system  at  metal  surfaces  [21]  and 
atoms  [22],  and  leads  to  properties  that  are  essentially  equivalent  to  those  of  Har¬ 
tree-Fock  theory  [23,24].  The  OPM  calculations  [21]  at  a  metal  surface  were,  how¬ 
ever,  not  carried  out  for  asymptotic  positions  of  the  electron  in  the  vacuum.,  Such 
a  self-consistent  solution  of  the  integro-differential  equation  Eq.  ( 3 )  for  the  deter¬ 
mination  of  the  asymptotic  structure  of  the  exchange  potential  would  be  a  numerical 
lour  deforce. 

Now  it  can  rigorously  be  shown  [25]  that  in  the  asymptotic  region  (as  in  the 
vacuum  outside  a  surface  or  far  from  the  nucleus  of  an  atom)  where  the  only 
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contributions  to  the  density  come  from  the  highest  occupied  orbital  0„,(r),  the 
exchange  potential  is  equivalent  to  the  highest  occupied  orbital-dependent 
potential  oC  Hartree-Fock  theory  (see  Eq.  (5)].  In  other  words,  the 

asymptotic  structure  of  the  exchange  potential  at  a  metal  surface  arises  from  the 
Fermi  level  electrons.  We  note  that  it  is  only  in  the  asymptotic  region  where  the 
orbitals  decay  exponentially  is  the  orbital-dependent  potential  e.x,,„{r)  physically 
meaningful.  In  other  regions,  the  potential  can  be  singular  as  at  the  nodes  of  the 
orbitals.  From  the  above,  and  the  fact  that  the  orbital-dependent  Fermi  hole  charge 
integrates  to  unity,  we  see  that  the  asymptotic  effective  potential  in  atoms  goes  as 
- 1  /  rand  is  purely  an  exchange  effect.  In  order,  therefore,  to  determine  the  asymp¬ 
totic  structure  of  the  exchange  potential  iu»(r)  at  a  metal  surface,  we  calculate  the 
asymptotic  structure  of  the  orbital-dependent  potential  v^.,„(r)  corresponding  to 
the  Fermi  level  [19], 


Application  to  the  JelHum  Metal  Surface 


In  the  jellium  model  of  a  metal  surface,  the  Hamiltonian  of  the  system  is 


//=-5ZV?  +  2  Kcx.(r,)  +  |2'- 


r,  -  r,| 


(7) 


where  the  external  potential  f  «i(r,)  =  -/  (/rp+(r)/|r  -  r,  |  is  due  to  the  uniform 
positive  background  ''harge  p+fr)  =  p(?(-.v  +  a)  beginning  at  the  surface  at  .v  = 
a.  and  p  the  bulk  density  of  the  metal.  The  constant  term  corresponding  to  the 
self-energy  of  the  jellium  charge  is  not  included  in  the  Hamiltonian. 

At  a  jellium  metal  surface,  the  translational  invariance  in  the  plane  of  the  surface 
requires  the  wavefunction  of  an  electron  with  momentum  k  to  be  of  the  form 
'l'k(r)  =  'k,,(X|)4>A(.v).  'I'k.fxJ  =  (I/\Ci)exp(/k,|.x,].<l>^(.v)  =  (-V27Z)0a.v), 
where  0a  (a)  is  the  component  of  the  wavefunction  perpendicular  to  the  surface,  k 
and  A  the  momentum  and  position  vectors  in  that  direction,  and  k„  and  Xn  the 
corresponding  components  in  the  plane  parallel  to  the  surface.  This  wavefunction 
is  then  substituted  into  Eq.  (5)  for  the  orbital-dependent  potential  which  in  k- 
notation  is 


i’v,k(r) 


=  -f  -Jl— 

J  |r  -  r’l 


^ir(r)^f'(r')>k,-(r)>k,(r') 


(8) 


where  we  have  multiplied  the  numerator  and  denominator  by  'I'f  (r).  However, 
since  it  is  only  the  component  of  the  electron  momentum  perpendicular  to  the 
surface  that  contributes  to  the  asymptotic  structure,  we  Slater  average  [20]  the 
potential  with  respect  to  the  component  of  the  wavefunction  in  the  plane 

of  the  surface.  This  leads  to  the  potential  i',.a(.\  )  which  is  dependent  only  on  the 
component  k  of  the  electron  momentum.  Thus 
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k;, 

I  ^f,.(X||)^k„(X|,) 

1  r  dr' 

4>?(a-)4>a(.v)  J  |r-r'l 

x(<J>?(.v)4>;(.v')  Z  'I'ir„(x,|)'I'k„(xli)  Z  'I'f-(r')'I'k'(r) 
Z  'I'f„(X„)^-k,/X|,) 

k|, 


(9) 


(10) 


On  solving  the  momentum  space  integrals  analytically  to  the  degree  possible,  and 
rewriting  in  terms  of  the  variables  j-  =  /c/.-x,  y'  =  ki-x',  y||  =  kp\\^,  R  =  I  yJi  I ,  ^  = 
k/k,.-,  q'  =  k'lkf,  Q  =  (I  -  (/-)''*  and  Q'  =  (  \  -  q'^)'^‘  where  ky  =  \laria~'  = 
(97r/4)'^^]  is  the  Fermi  momentum  and  the  Wigner-Seitz  radius,  the  above 
expression  reduces  to 


^.\.Q(.y)  _  _ 

Okr/lw)  ~  3QU,{y} 


8  _  p 
-•)  J-c 


¥My')  f 

Jo 

X  f  dq'4>p{y')<i>Ay)Q'MQ'R)  ^ 

Jo 


QMQR) 


R^iy-Vf  +  R^ 


(11) 


where  Ji(r)  is  the  first-order  Bessel  function.  At  the  Fermi  level  q  =  1,  so  that  Eq. 
(11)  becomes 


v^Ay) 

(3A.>/2ir) 


=  -  777  .  f  dy'My')  {  dq'(i>^iy')(f>g’{y)Q' 


MQ’R) 


^iy  -  v')'  +  R‘ ' 


(12) 


where  we  have  employed  limy^o  Ji(QR)  =  QR/^-  Finally,  since  the  R  integral 
can  be  done  analytically,  we  have  the  orbital-dependent  potential  corresponding 
to  the  Fermi  level  electron  with  momentum  perpendicular  to  the  surf!''’e  [19]  to 


be 


v>.i(y) 

Ok,./2w) 


4  r 

3<^i  (.v) 


dy 


,  4>dy') 
\y-y'\ 


J' 


dq<i>ny')My)i^  - 


(13) 


We  have  calculated  the  potential  u».i(>  )  for  the  semi-analytical  wavefunctions 
generated  by  the  finite  linear  potential  model  [26]  of  a  metal  surface.  These  wav¬ 
efunctions  are  highly  accurate  and  reproduce  (see  the  Appendix  of  Ref.  [13])  the 
results  of  fully-self-consistent  calculations  [  3  ]  for  the  surface  energy  and  work  func¬ 
tion  of  simple  metals.  The  wavefunctions  which  are  exponential  in  the  classically 
forbidden  region  and  oscillatory  in  the  metal  bulk  are 
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(t>kix)  =  sin[A:x  +  5(/c)]0(-x)  +  [B^Aii^)  +  Q5;(f)] 

X  [6(.x)  -  d(x  -  Z>)]  +  Dk  exp{-Kx)6(x  -  b),  ( 14) 

where  k  =  ilE,  k  =  S2{W  -  E),  f  =  (x  -  £'/F)(2F)‘''^,  E  is  the  energy,  Fthe 
field  strength,  W  the  barrier  height,  and  Ai(t)  and  Bi(t)  are  the  Airy  functions. 
The  phase  factor  5(k)  and  the  coefficients  Bi,  Q,  and  A  are  determined  by  the 
requirement  of  continuity  of  the  wavefunction  and  its  logarithmic  derivative  at 
a:  =  0  and  x  =  b.  The  calculations  are  performed  for  a  metal  of  density  corresponding 
to  r,  =  2.0.  The  wavefunction  parameters  are  the  same  as  those  employed  in  our 
previous  work  [12,13]  and  once  again  taken  from  the  Appendix  of  Ref  [13]. 

In  Table  I  we  quote  the  values  of  the  universal  function  i;j  j(y)/(3^/r/27r)  as  a 
function  of  electron  position  y  for  distances  outside  the  surface  ranging  from  y  = 
10  to  >'  =  20.  We  also  quote  in  Table  I  the  values  of  the  function  —  1  /2y  n;  -0.24/ 
X  for  these  positions  of  the  electron.  A  plot  of  these  functions  is  given  in  Figure  1 . 
The  jellium  surface  edge  is  at ;;  =  1 .33.  Note  that  >’  =  lir  corresponds  to  a  Fermi 
wavelength.  Numerical  calculations  beyond  an  electron  position  of  >  =  20  are 
difficult  due  to  the  delocalized  and  highly  oscillatory  nature  of  the  dynamic  orbital- 
dependent  Fermi  hole  charge.  The  further  out  beyond  the  surface  the  electron,  the 
deeper  within  the  metal  this  charge  distribution  is  spread.  For  a  study  of  the  orbital- 
dependent  Fermi  holes  at  a  metal  surface,  we  refer  the  reader  to  Ref  [  1 3  ] .  In  the 
present  calculations,  the  spatial  integral  has  been  performed  up  to  approximately 
75  Fermi  wavelengths  inside  the  metal.  It  is  evident  from  the  results  that  the  potential 
merges  with  the  function  -l/2>'.  This  means  that  for  asymptotic  electron 
positions,  the  total  exchange  potential  which  is  equivalent  to  i^a.i(>’)  there,  will 
go  as  the  image  potential.  Thus,  when  the  asymptotic  electrons  are  considered  as 
part  of  the  nonuniform  system  at  the  metal  surface,  the  asymptotic  effective  potential 
arises  solely  due  to  exchange  effects  and  is  the  image  potential. 


Table  I.  Values  of  the  universal  exchange 
potential  function  tJ,,i(>’)/(3A>/2jr)  and  the 
function  l/2>’  for  asymptotic  electron 
positions  y  (in  units  of  X//2)r)in  the  vacuum 
region. 


.V 

-VxAy)IOkr/2T) 

-\/2y 

10 

0.0592 

11 

0.0524 

12 

0.0470 

13 

0.0421 

14 

0.0384 

15 

0.0352 

16 

0.0326 

17 
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0.0294 
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19 

0.0263 

20 

0.0251 
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y  (units  of  X,f/2ji) 

Figure  1.  Variation  of  the  universal  exchange  potential  function  Vx,i(y)l('ikrl2ir)  as  a 
function  of  electron  position  y.  Here  is  the  orbital-dependent  exchange  potential 
corresponding  to  the  Fermi  level.  The  function  - 1  lly  is  also  plotted. 


We  also  note  that  although  the  exchange  potential  u^,i(y)  due  to  the  Fermi  level 
electron  merges  with  -\l2yhy  y=  19,  it  does  not  mean  that  the  effective  potential 
is  the  image  potential  from  that  point  on.  Rather  it  indicates  that  since  the  potential 
t^x.i(>')  has  the  image  potential  structure,  the  asymptotic  structure  of  the  total  ex¬ 
change  potential  will  be  the  image  potential.  Near  the  surface  other  high  energy 
electrons  which  lie  below  the  Fermi  level  also  contribute  to  the  total  exchange 
potential  so  that  Hx  i^x.i  ty)  there.  Therefore,  in  this  region  the  Kohn-Sham  effective 
potential  will  not  be  image-potential-like.  It  is  only  by  approximately  y  =  40  that 
the  effective  potential  at  the  surface  becomes  [12,13]  the  image  potential. 

Conclusions 

To  summarize,  we  have  shown  in  this  article  that  the  asymptotic  structure  of  the 
density-functional  theory  effective  potential  at  a  metal  surface  is  the  image  potential, 
and  that  it  arises  solely  due  to  the  effects  of  Pauli  correlations  between  the  electrons. 
The  calculations  are  rigorously  quantum-mechanical  in  that  the  asymptotic  electrons 
are  treate.’  as  part  of  the  nonuniform  system  with  the  electronic  wavefunctions 
being  explicitly  employed  to  determine  the  potential.  These  results  are  entirely 
consistent  with  the  conclusions  of  our  previous  work  [  1 2, 1 3  ]  in  which  the  asymptotic 
effective  potential  was  obtained  by  calculating  the  work  required  to  move  an  electron 
in  the  force-field  of  the  (orbital-independent )  Fermi  hole  charge  distribution.  Thus, 
we  note  that  two  distinctly  different  quantum-mechanical  formulations  lead  to  the 
same  conclusions. 
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We  reiterate,  however,  that  there  are  two  schools  of  thought  on  the  fundamental 
issue  as  to  the  physical  origin  of  the  asymptotic  image  potential  at  a  metal  surface. 
Whereas  our  present  and  previous  work  [12,13]  shows  this  asymptotic  structure  to 
be  due  to  Pauli  correlations,  the  work  of  Eguiluz  et  al.  [7,8]  indicates  it  to  be  a 
Coulomb-correlation  effect.  Further  research  is  evidently  required  to  better  under¬ 
stand  the  source  of  this  difference,  and  on  this  we  are  in  agreement  with  these 
authors  (see  footnote  23  of  Ref.  [8]).  On  the  other  hand,  the  conclusions  of  sem- 
iclassical  theories  have  little  bearing  on  this  issue.  This  is  because  the  asymptotic 
electron  is  treated  in  these  calculations  as  an  external  test  charge  that  is  distinct 
from  the  inhomogeneous  electron  gas  at  the  metal  surface.  The  resulting  image 
potential  structure  of  the  asymptotic  effective  potential  can  therefore  only  arise  as 
a  consequence  of  correlations  due  to  Coulomb’s  law. 
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Correlation  Corrected  Band  Structures  of  Quasi  ID 
and  2D  Periodic  Systems  and  Level  Distributions  of 
Disordered  Chains;  New  Method  with  Correlation 
for  Dynamic  Nonlinear  Optical  Properties 
of  Periodic  Polymers 
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Abstract 

In  this  paper,  methods  of  correlation  corrections  for  ID  and  2D  periodic  svstcms  as  well  as  for  ID 
nonperiodic  poNmers  arc  reviewed  These  procedures  include  not  onl)  the  correlation  correction  of  the 
total  energy  per  unit  cell,  but  also  the  recalculation  (using  the  generalized  electronic  polaron  model)  of 
the  band  structures  themselves  in  the  periodic  case,  as  well  as  the  density  of  states  (txis)  in  nonperiodic 
chains  In  both  cases  the  inverse  Dyson  equation  was  used  in  an  iterative  way  in  the  mp/2  and  mp/3 
level.  The  programming  of  the  same  procedure  for  the  coupled  cluster  method  is  in  progress  In  a  number 
of  examples  (ground  state  properties  of  (SN)*.  the  gap  of  alternating  (ru/u-polyacctylene.  the  exciton 
spectrum  of  diacetylene.  polyglycine  and  polyalaninc.  of  a  cyiosinc  stack,  the  bulk  modules  of  polyethylene, 
the  hopping  conductivity  of  insulin ).  we  have  obtained  quite  good  agreement  with  expenment,  if  a  good 
basis  set  ah  iniiio  calculation  was  supplemented  of  the  correlation  correction  at  the  bands  or  DOS  curves, 
respectively.  In  the  second  part  of  ihe  paper  a  new  method  is  developed  to  take  into  account  the  simul¬ 
taneous  effect  of  a  stati '  and  a  periodic  time-dependent  electric  field  on  the  one-electron  wave  functions 
and  on  the  total  electronic  energy  of  a  penodic  polymer.  The  method  uses  a  variational  technique 
(coupled  Hartree-Fock  equations)  instead  of  perturbation  theory  for  the  time-dependent  one-electron 
wave  functions.  The  proper  treatment  of  the  potential  caused  by  the  field,  which  in  this  way  does  not 
destroy  the  periodicity  of  the  polymer,  as  well  as  the  inclusion  of  correlation  at  the  mp/2  level  of  the  1- 
and  w-dependent  quasienergy  per  unit  cell  is  part  of  the  method  Finally,  it  is  shown  how  one  obtains 
in  the  usual  way  the  different  polarizabilities,  second-  and  third-order  hyperpolarizability  tensor  elements 
with  the  help  of  the  derivatives  of  the  quasiencrgics  according  to  the  different  field  (both  static  and  time- 
dependent)  components,  c  I99.t  John  Wiley  &Sonv.  Inc. 


Introduction 

The  ah  mil  to  Hartree-Fock  (FIF)  theory  (SCF  lcao)  crystal  orbital  (CO)  theory 
has  been  known  since  the  early  work  of  Lowdin  [1].  Our  work  and  the  work  of 
Andre  et  al.  for  ID.  2D.  and  3D  periodic  systems  was  first  published  in  1967  [2]. 
and  It  has  become  since  that  time  a  standard  procedure  [3(b).(c)].  Closely  related 
work  was  undertaken  also  in  1969  by  Kunz  [3(a)].  Computer  codes  also  for  3D 
systems  have  been  developed  by  Pisani’s  group  (Crystal  80.  90.  92)  [4]. 
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As  the  next  step,  methods  based  on  the  generalization  of  the  electronic  polaron 
idea  [5]  were  applied  to  correct  the  band  structures  themselves  in  the  MP/2  [6], 
mp/3  [7],  and  in  the  coupled  cluster  (CCDD)  approximations  [8]  (with  localized 
orbitals  [9]).  For  these  calculations  the  inverse  Dyson  equation  [10]  (see  below) 
was  used. 

For  the  determination  of  the  electronic  density  of  states  (DOS)  of  disordered 
polymers,  the  so-called  negative  factor  counting  (  nfc  )  procedure  ( based  on  Dean’s 
negative  eigenvalue  theorem  [11])  was  applied  both  in  its  simple  [12]  and  matrix 
block  form  [13],  The  latter  method  has  also  been  developed  in  a  form  that  also 
takes  into  account  correlation  [14]  and  has  been  applied  for  several  disordered 
polymers  [15], 

In  all  cases,  when  the  calculations  could  be  performed  with  a  good  enough  basis 
set  and  correlation  corrections  were  applied,  for  unrelated  completely  different 
physical  properties,  results  in  quite  good  agreement  with  experiments  were  obtained 
(for  several  examples,  see  the  next  Section). 

Short  Review  of  the  Treatment  of  Correlation  in  Periodic 
and  Disordered  Polymers 

We  can  write  the  Dyson  equation  in  a  matrix  form  as 


G  =  Go  +  GqVG  ,  ( 1 ) 

where  G  is  the  Green’s  matrix  of  the  perturbed  and  Go  the  same  quantity  for  the 
unperturbed  system  while  F  is  the  perturbation  matrix.  Expressing  G  from  (1), 
one  obtains 


G  =  (1 -GoV)-Go.  (2) 

The  poles  of  G  give  the  quasiparticle  (QP)  (correlation  corrected  energies)  of  the 
system: 

0  =  [(1 -GoV)-'Go]-'  (3) 

Writing  out  the  inverse  matrix  results  in 

0  =  Go-'(1  -GoV)  =  G„-‘-V.  (4) 

In  the  diagonal  approximation  [10],  one  can  write  for  a  state  I  {i,k) 

0  =  01, -c'jHF,,  01,  =c7+K„.  (5) 

which  is  the  inverse  Dyson  equation  where  oi,  is  the  QP  and  c,  the  HF  energy, 
respectively,  and  the  form  of  I',,  has  to  be  determined  by  a  (size-consistent)  ap¬ 
proximation  used  for  the  correlation,  that  is,  mp[,^*(oi,).  mp,7’(oi,)  +  MPj‘’'(oi,) 
cesD,  ccdd(2^[;(oi,)  +  2jj;(oi,)),  etc.  Here  mp,’,‘’(oi,),  mp,?'(oii)  are  the 
Moeller-Plesset  [16]  expressions  for  state  I,  respectively,  2^(oi,)  +  2^[|(o},)  the 
electron  and  hole  self-energies  obtained  for  the  state  1  in  a  given  approximation  of 
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the  coupled  cluster  theory  (see  Eqs.  (9),  ( 10),  (21 ).  and  (22)  of  Ref.  [9]  for  the 
CCDD  case  [17]). 

To  keep  the  discussion  simple,  we  write  down  explicitly  only  the  expression  for 
MPfi(wi)  (mp/2): 


MPu’(ai,)  + 


2 

JGocc 

/l.iiGunocc 


^'/J4b{  2  Vij  IB  ~  I ) 

w,  +  ef- e ef 


+  z  •••• 

J..H.B 

yeunocc 


(6) 


Here  Vu  ,b  is.  in  the  Bloch  orbital  basis,  the  usual  two-electron  integral 

VuAB  =  1  HA2)\\lr,2\4>A  1  )<i>B(2))  (6') 

One  has  to  substitute  (6)  into  (5)  and  iterate  until  SCF  aj|  values  are  obtained. 

In  this  way.  for  the  fundamental  gap  of  alternating  /ran.s-polyacetylene  in  the 
MP**’  approximation  1.8  eV  [7]  and  in  the  mp*’’  +  mp'^’  approximation  2.2  eV 
were  obtained  [7]  in  quite  good  agreement  with  experiment. 

One  should  mention  that  if  one  is  interested  only  in  ground  state  properties  of 
a  polymer  near  to  its  equilibrium  geometry,  one  obtains  quite  good  results  already 
with  a  better  basis  set  HF  calculation.  Thus  we  have  obtained  good  agreement  with 
experiment  in  the  case  of  (SN)^  [18]  for  the  effective  mass  and  density  of  states  at 
the  Fermi  level  as  well  for  the  charge  transfer  from  S  to  the  N  atom  using  a  Gaussian 
lobe  valence-split  basis  set  [19]. 

Using  the  above  described  formalism  in  the  mp  level  as  starting  point.  Suhai  has 
also  obtained,  with  the  aid  of  a  generalized  ah  initio  form  of  the  intermediate 
exciton  theory  [  20.2 1  ] ,  quite  good  agreement  with  experiment  for  the  exciton  ener¬ 
gies  and  intensities  for  polydiacetylenes,  polyethylene,  and  a  cytosine  stack  (showing 
the  experimentally  well-established  hyperchromicity  of  DNA)  for  polyglycine  and 
polyalanine  [22]. 

We  have  performed  also  qp  band  structure  calculations  for  different  periodic  2D 
systems  (Hi  lattice,  graphite,  BN  layer)  [23].  The  obtained  gaps  seem  to  be  again 
close  to  the  experimentally  measured  ones  (in  graphite  the  part  of  k  space  is  meant, 
where  the  gap  is  not  close  to  zero). 

In  the  case  of  polyethylene  a  good  basis  (6-31G**  )  -F  correlation  (MP/2)  cal¬ 
culation  has  provided  also  very  good  agreement  with  experiment  for  the  bulk  mod¬ 
ulus  (theoretical  value  303  GPa,  experimental  ones  250-350  GPa)  [24]. 

Turning  now  to  disordered  quasi  ID  polymers,  the  most  accurate  method  seems 
to  be  the  NFC  one  [12].  To  describe  this  briefly  one  starts  from  the  tridiagonal 
Hiickel  determinant  of  a  finite  but  long  disordered  chain  {N  =  300-10000  units) 


|H(X)|  = 


«!  —  X 
1^2 
0 


1^2 

CC2  —  X 


Hi 


0 

Hi 

wj  —  X 


0 

Ha 


0 


=  0.  (7) 


I  /3.v«.v-Xi 

The  Hiickel  parameters  a  and  H  can  be  determined  on  the  basis  of  the  band  positions 
(a)  and  bandwidths  (d)  of  ah  mil  toco  calculations  of  the  components  of  the  chain. 
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repeating  them  periodically.  The  determinant  (7)  (if  we  would  know  its  roots  X) 
could  be  written  in  the  form 

A 

|H(X)|  (8) 

y=i 

On  the  other  hand,  with  the  help  of  a  successive  Gaussian  elimination,  ( 7 )  can  be 
written  in  a  didiagonal  form.  Then  we  have,  instead  of  (8), 

A 

|H(X)1  =  Z  e,(X).  (9) 

1=  I 

where  the  diagonal  elements  of  the  determinant  £(  X)  (at  a  parameter  value  X)  can 
be  computed  with  the  help  of  the  recursion  relation  ( which  follows  from  the  Gauss¬ 
ian  eliminations) 

c,{X)  =  a,  -  \  .  (10a) 

Ci(X)  =  «|  —  X  .  ( 10b) 

This  means  that  if  we  choose  a  certain  parameter  value  Xj  in  any  case  when  an 
eigenvalue  Xj  <  X|,  one  obtains  a  negative  factor  in  (8).  Since  Eqs.  (8)  and  (9)  are 
equal,  the  number  of  negative  factors  for  the  same  X  has  to  be  the  same  also  in  (9) 
(which  can  be  computed  easily).  Choosing  another  trial  X2,  one  obtains  another 
number  of  negative  factors  in  (8)  and  (9).  In  this  way  one  can  determine  the 
number  of  eigenvalues  Xj  which  lie  between  the  two  trial  values.  Scanning  in 
this  way  the  whole  spectrum  and  taking  a  dense  grid,  one  obtains  a  good  DOS 
histogram  [12]. 

One  can  prove  [13]  that  if  one  has  matrix  blocks  instead  of  simple  matrix  elements 
in  (7).  relations  ( 10a)  and  (10b)  still  hold,  if  we  put  into  them  instead  of  elements 
the  corresponding  blocks: 

V,(X)  =  A,  -  XS,  ~(B;  -  XQ;)U-',(X)(B,  -  XQ,),  (lla) 

U|(X)  =  A, -XS,  (11b) 

Here  the  A,  -  s  are  the  diagonal  blocks  which  occur  in  the  determinant  (since  in 
this  case  we  have  an  ah  mitio  formulation  both  at  the  diagonal  and  off-diagonal 
part  of  the  determinant  not  simply  X  but  its  product  with  the  corresponding  overlap 
matrices  S,  and  Q,,  respectively,  occurs)  and  the  off-diagonal  matrix  blocks  B, 
describe  the  interaction  with  a  unit  with  its  first  neighbor.  For  actual  calculations 
in  the  first  neighbor’s  interactions  approximation,  one  has  to  perform  dimer  cluster 
calculations.  For  instance,  if  the  chain  starts  with  the  units  AB,  one  has  to  perform 
an  ah  initio  HF  calculation  for  it: 

/  -  X.S'"  F-*"  -  XS-'"\ 


(12) 
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Here  F '  and  are  A,  and  A^.  respectively.  F =82,8^  and  S'*  stand  for  S|  and 
82,  lespectively.  and  finally  8  '"  =  Q2. 

If  the  chain  continues  with  a  unit  C.  one  has  to  perform  a  cluster  calculation  for 
the  dimer  BC.  One  can  build  up  in  this  way  the  full  matrix  of  th.  whole  chain 
from  the  matrix  blocks  belonging  to  the  «uosequent  dimers.  One  should  mention 
that  to  join  at  B  the  AB  and  fiC  clusters,  one  has  to  take,  for  F"  -  X8®,  F^  -  X8^, 
where  F"  =  ^(Ff  +  F?)  (the  Fock  matrix  blocks  corresponding  to  the  unit  B  in 
the  two  different  clusters  .-t/i  and  BC,  respectively,  which  are  not  the  same).  Further 
if  the  chain  does  not  show  a  simple  translational  symmetry  (like  in  a  polynucleotide 
or  polypeptide  helix),  one  has  to  be  careful  to  perform  all  dimer  calculations  in  a 
common  coordinate  system. 

Having  built  up  the  |F  -  X8|  determinant  in  the  described  way  of  the  whole 
chain,  one  can  show  that  [12.13] 


IF  -  X8| 


(X,-  X) 


r 


n  »a(X) 


(13) 


Here  the  s,  are  the  eigenvalues  of  the  matrix  8.  the  >  those  of  8  '''F8'''',  A'  is 
the  number  of  unit  cells,  the/,  a,  c  the  dimensions  of  the  matrices  U,(X).  and  finally 
iiikiX)  stands  for  the  kih  eigenvalue  of  the  matrix  block  U, .  Forming  the  matrices 
C  with  the  help  of  relation  (11).  one  can  diagonalize  them  and  count  the  number 
of  those  t<,K  for  a  given  X  that  are  negative.  In  this  way.  one  obtains  the  number  of 
states  between  two  trial  X  values  originating  from  all  bands.  Scanning  again  the 
whole  spectrum,  one  obtains  now.  in  contrary  to  the  previous  band  case,  the 
total  DOS. 

To  test  this  NCF  matrix  block  method,  first  a  periodic  Gly-Gly-Ser  chain  (in 
1.1.1  composition )  was  calculated,  once  with  the  help  of  the  CO  method  with  periodic 
boundary  conditions  and  once  with  the  matrix  block  NFC  method.  The  DOS  his¬ 
tograms  obtained  have  shown  a  remarkably  good  agreement  [25].  After  this,  the 
sequence  of  the  Gly  and  Ser  units  was  taken  randomly  using  a  Monte  Carlo  program. 
In  this  case  the  DOS  histogram  broadened  in  a  very  significant  amount  [25]. 

Similar  calculations  were  performed  for  aperiodic  DNA  sequences  [26]  (including 
a  part  of  an  oncogene )  and  also  for  a  larger  number  of  aperiodic  polypeptides  [  27  ] . 
If  one  has  at  least  three  or  four  rather  different  units,  the  DOS  curves  of  the  random 
chains  show  in  all  cases  a  very  large  broadening.  This  is  different  to  the  periodic 
case  (like  (ABCD)\).  when  very  narrow  nearly  6-function-type  peaks  occurred 
[  26.27  ] .  The  explanation  of  this  phenomenon  is  that  if  one  has  a  random  sequence, 
a  certain  unit  has  four  different  left  and  four  different  right  neighbors  ( 1 6  possibilities 
with  16  different  perturbations,  including  the  two  cases  when  one  neighbor  is  the 
same  as  the  reference  unit  and  in  one  case  both  neighbors  of  the  unit  are  the  same, 
causing  a  rather  larger  perturbation  of  its  level  scheme).  On  the  other  hand,  if  one 
has  an  (ABCD)\-hkc  system  r  '•wen  unit  sees  always  the  same  two  (and  different) 
neighbors. 

In  the  case  of  the  four-v.<.,ipjncnt  Giy(glycinc)-Ser(scrine)-Cys(cysteine)- 
Asn( asparagine)  300  units  long  polypeptide  [27],  we  have  determined,  with  the 
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help  of  the  inverse  iteration  method  [28],  the  physically  interesting  lowest  25  unfilled 
levels.  Namely,  in  a  DNA-protein  complex,  due  to  the  negative  charge  on  the  PO4 
units  and  to  some  positively  charged  subunits,  like  arginine,  in  nucleohistone,  most 
probably  there  is  a  charge  transfer  from  DNA  to  the  protein.  The  results  have 
shown  that  all  the  states  being  close  to  the  lower  edge  of  the  “conduction  band” 
(and  therefore  below  the  mobility  edge)  were  completely  Anderson-localized  on 
single  amino  acid  residues. 

Generalizing  the  approximate  expression  given  for  the  primary  jump  rates  of 
hoppings  (hopping  frequencies)  by  Mott  and  Davis  [29]  for  the  case  if  there  are 
more  than  one  orbitals  per  site,  one  can  write 

C,''..,C;"<x;Mxf>)- exp 

where  A£„  =  Cj  -  c,  are  the  one-electron  levels  of  the  orbitals  between  w  hich  the 
hopping  occurs.  Finally  v  is  the  frequency  of  acoustic  modes  which  have  values  in 
proteins  between  10  "  and  10  '■  s ;  these  modes  in  a  polypeptide  chain  correspond 
to  the  relative  vibrations  of  the  side  chains.  Taking  hoppings  within  the  same  residue 
and  between  first  and  second  neighbors,  we  have  obtained  values  in  the  order  10 
10^-10'*,  and  10^  respectively  [28].  They  have  the  same  order  of  magnitude  as 
the  estimated  hopping  frequencies  in  good  amorphous  conductors  (silicon  or  chal- 
cogenides  [30]). 

We  have  repeated  the  whole  described  calculation  (nfc  in  matrix  block  form, 
inverse  iteration  for  the  eigenvectors,  and  determination  of  the  hopping  frequencies), 
taking  into  account  also  the  disulfur  bridges  and  its  experimentally  determined 
geometry  for  a  small  protein,  the  hormone  insulin  [31].  The  hopping  frequencies 
were  in  order  of  magnitude  the  same  as  for  the  previously  investigated  four-com¬ 
ponent  random  (artificial)  system.  Applying  these  hopping  frequencies  as  input 
for  the  random  walk  theory  of  Lax  and  coworkers  [  32  ]  ( after  generalizing  it  again 
to  arbitrary  number  of  orbitals  per  site),  we  have  determined  the  frequency- 
dependent  hopping  conductivity  1  (t(w)|  of  this  system.  We  have  found  that  in  the 
frequency  range  betw'een  w  =  10''-10'*  (where  measurements  were  performed) 
|(t(w)|  has  values  between  10  ’  '  cm  '  and  10  '  '  cm  '  [33],  which  fall 

between  the  experimental  values  found  for  the  different  chalcogenides  [34]. 

One  should  mention  that  no  correlation  corrections  were  taken  into  account  as 
first  approximation  in  the  hopping  conductivity  calculation  of  the  disordered  poly¬ 
peptide  insulin  (the  treatment  of  correlation  in  disordered  chains  see  below).  It 
looks  like  that,  since  in  the  <7(0-)  calculation  only  hoppings  between  different  excited 
states  was  taken  into  account  (the  cross  gap  hoppings  would  be  extremely  small 
due  to  the  very  small  Boltzmann  factor  in  this  case),  though  their  spacing  without 
correlation  may  be  too  large,  the  overall  effect  is  not  very  large.  Anyxvay,  one  expects 
that  if  one  takes  into  account  in  a  subsequent  calculation  also  correlation,  the  level 
spacing  would  be  generally  smaller  and  therefore  the  Boltzmann  factors  larger.  This 
could  cause  easily  an  increase  of  |a(u!)l  at  a  given  w  value  of  1  or  2  orders  of 
magnitude.  Even  if  this  were  the  case,  one  would  still  remain  in  the  upper  range 
of  the  measured  values  of  chalcogenides  [34], 


A£„ 
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In  this  way.  we  could  prove  that  there  is  a  nonnegligible  electronic  conduction 
along  a  disordered  polypeptide  chain,  if  it  is  doped  ( which  in  vivo  easily  can  happen ) . 
This  was  postulated  by  Szent-Gyorgyi  [35,36]  already  in  1941  to  explain  different 
biolo^cal  phenomena  (electron  transport  through  proteins  at  oxidation  metabolism 
and  by  photosynthesis,  fluorescence  of  a  chromophore  in  proteins  which  lies  far 
away  from  another  chromophore  which  has  primarily  absorbed  light,  etc.). 

We  can  close  this  review  with  the  observation  that  if  one  performs  a  good  basis 
HE  CO  calculation  for  a  periodic  system,  correcting  well  enough  the  resulting  bands 
for  correlation,  practically  all  kinds  of  physical  properties  of  all  types  of  polymers 
can  be  computed  ( using  for  these  properties  the  right  theory )  in  good  agreement 
with  experiment.  This  opens  the  possibility  of  predicting  polymers  with  three  to 
four  different  unrelated  optimal  properties  ( “tailor-made”  polymers),  which  is,  of 
course,  of  great  practical  importance.  We  are  working  now  in  cooperation  with 
computer  scientists  on  the  acceleration  of  the  different  programs  (direct  SCF  + 
parallelization  +  vectorization  and  some  other  procedures).  In  this  way.  we  shall 
also  be  able  to  perform  the  above-described  calculations  for  polymers  with  larger 
unit  cells. 

Returning  to  the  case  of  nonperiodic  polymers,  one  can  combine  the  matrix- 
block  NFC  method  also  with  the  inverse  Dyson  equation.  Namely,  if 

FU  =  SU£.  (15) 

where  the  unitary  matrix  U  contains  the  vectors  u,  which  fulfill  this  equation  and 
the  diagonal  matrix  e  its  eigenvalues,  one  can  write 

(F  +  A)U  =  SUw  (16a) 

with 

A  =  SUVU^S  (16b) 

Here  the  diagonal  elements  of  w  are  the  correlation  corrected  QP  energies  of  the 
disordered  system.  To  prove  the  correctness  of  Eqs.  ( 16),  let  us  substitute  ( 16b) 
into  (16a),  multiply  from  the  left  by  ,  and  take  into  account  the  normalization 
condition  U''SU  =  1.  In  this  way  we  obtain 

UTU  +  U^SUVU^SU  =  U^SUco  (17a) 

c  +  V  =  aj,  (17b) 

which  is  the  well-known  inverse  Dyson  equation,  for  which  we  can  write  in  the 

diagonal  approximation 


I  n  -  wi  (1  -  l<i)  ■ 
This  provides  in  the  mp/2  approximation 


(18) 


-  e'"+  Mp!i'’(a)|)  . 


(19) 
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For  the  actual  calculation  of  aperiodic  polymers,  one  has  to  perform  ab  initio 
cluster  calculations  for  dimers.  For  instance,  for  an  A B  dimer,  we  have  now  to 
write  instead  of  Eq.  ( 12) 

F(AB)  +  MAB)  -  \S{AB) 

_  /  F'*  +  A''  -  AS'*  F'*"  +  A  ■'®  -  XS-'"\ 

~  F"  +  A^-\S«  /■ 

From  the  dimer  clusters  matrices,  one  can  construct  again  the  determinant  of 
the  whole  chain,  and,  for  its  diagonal  blocks  (after  successive  Gaussian  elimination ), 
one  has  now  with  the  notations  A,  +  A,  =  A,.  B,  +  A,+i  =  B, ,  A,  =  F',  B,  = 
F'-'"'.  A,*,  =  A'-'-", 

U,(X)  =  A,  -  XS,  -  (B/  -  XQDUr-'.CB,  -  XQ,) .  (21) 

Diagonalizing  the  blocks  0,  ( X ),  we  can  count  again  the  negative  eigenvalues  ( X ) 
at  a  given  X.  Taking  a  fine  grid  in  X,  we  can  scan  again  the  whole  spectrum  and 
find  out  again  the  total  DOS,  but  in  this  case  with  correlation. 

The  described  method  has  been  successfully  applied  to  treat  several  disordered 
polymer  chains  (like  Li — H  chains,  disordered  polypeptides)  with  correlation,  which 
resulted  again  in  the  decrease  of  the  gap  between  the  “valence  levels”  and  “con¬ 
duction  levels”  region  and  decr(.:.sed  also  the  width  of  the  allowed  energy  regions 
in  the  DOS  histogram. 

A  New  Correlation  Corrected  Variation-Perturbation  Method 
for  the  Calculation  of  Dynamic  Polarizabilities  of  Polymers 

General  Formalism 

In  a  previous  paper  [37],  a  theory  for  the  interaction  of  a  laser  pulse  with  a 
periodic  quasi  1 D  polymer  was  developed  on  the  basis  of  the  coupled  Hartree- 
Fock  equations.  This  method  has  considered  the  simultaneous  effect  of  the  electric 
and  magnetic  field  of  the  laser,  and  therefore  the  resulting  equations  have  become 
rather  cumbersome.  Furthermore,  the  perturbation  term  in  the  Hamiltonian  due 
to  the  electric  field  {E{r,  t  )7)  was  not  partitioned  into  a  part  which  shows  lattice 
symmetry  and  to  another  one  which  causes  change  of  H  of  the  electrons  [38,39]. 
This  method  has  been  already  successfully  applied  for  the  calculation  of  the  static 
(hyper)polarizabilities  of  different  infinite  polymers  ((H2)a(H20)a  and  (LIHIa  ) 
chains  [40]. 

The  other  problem  that  was  not  addressed  in  the  previous  paper  was  the  inclusion 
of  correlation  effects  in  the  laser  light-polymer  interaction.  To  solve  this  formidable 
problem,  we  return  to  the  case  of  the  perturbation  of  a  periodic  polymer  by  a  time- 
dependent  periodic  homogeneous  electric  field* 


*  To  simplify  the  expressions,  first  we  shall  not  consider  in  addition  to  the  time-dependent  field  an 
additional  static  electnc  field. 
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=  -2  1  £o(c""“'  +  ,  (22) 

m=l 

but  put  the  magnetic  vector  potential  A  =  0. 

It  should  be  pointed  out  that  Rice  and  Handy  [41.42]  have  already  treated  the 
effect  of  correlation  on  dynamic  polarizabilities,  but  only  for  molecules.  Further¬ 
more.  instead  of  applying  a  variational  approach,  they  have  used  perturbation  theory 
for  the  determination  of  the  field-  and  time-dependent  one-electron  orbitals,  which 
is  certainly  correct  in  the  case  of  weak  fields,  but  becomes  questionable  if  one  thinks 
on  a  later  extension  of  the  method  for  interactions  with  intense  laser  pulses.  On 
the  other  hand,  their  perturbation  theoretical  expansion  of  the  one-electron  orbitals 
probably  gives  more  insight  into  the  physical  nature  of  the  response  at  different 
orders. 

The  total  Hamiltonian  of  an  «-electron  system  can  be  written  as 

//  =  //o  +  //'.  (23) 

where  is  the  usual  unperturbed  Hamiltonian  of  a  system  with  n  electrons  and 
//'  will  become,  in  the  case  of  the  expression  ( 22)  of  E  and  A  =  0, 

n  A 

//'(  r . .  £o,  a.,  0  =  - 1  !c|  Z  Z  S(.(f"’’“'  +  C''''^')  ^ 

/-I 

«  A 

= -kl  Z  Z  cosmulEo'r,.  (24) 

1  I 

As  is  well  known  due  to  the  form 

=  e'^'u„[k,r)  (25) 

of  a  Bloch  orbital,  where  k„(  k,T)  shows  lattice  periodicity,  -\e\E7  can  be  rewritten 
as [38-40] 

-  U'l  E7  =  -/jt'j  +  /|c|  EVi .  (26) 

Here  the  first  term  on  the  rhs  is  invariant  under  transformations  of  the  symmetry 
group  of  the  lattice  ( it  can  be  sho"'  i  that  it  does  not  mix  states  with  different  k 
vectors),  while  only  the  second  ont  which  has  no  7-dependence  can  have  nonzero 
matrix  elements  between  Bloch  orbitals  with  different  k  vectors  [39].  Thus,  if  one 
is  interested  only  in  the  polarization  effect  of  the  field  (which  is  usually  the  case  in 
nonlinear  optics),  one  has  to  include  in  /?'  only  the  first  term  of  the  rhs  of  (26). 
Thus  we  can  write 

n  A 

//'=  -/U’l  Z  Z  cosmw/ .  (27) 

*=  I  m=  I 

We  can  now  proceed  in  the  usual  way  applying  Frenkel's  variational  principle 
[43]*: 


•  Lowdin  and  Mukhcrjcc  (44)  have  shown  that  by  the  variation  of  the  expression  in  (27).  one  really 
obtains  a  stationary  state  if  the  norm  of  <b  exists. 
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Ip  {^\H-dldt\^)di 

~  rJo  <<i'|<i>> 


(28) 


Here  //  is  defined  through  Eqs.  ( 23 )  and  (27)  and  for  the  field-  and  time-dependent 
total  n-electron  wave  function 


. r„.  £o.  w.  /  ) . 


we  apply  the  Ansatz 

. O..  £o,  w.  /)  =  n  /.  £■(>•  w)  (29a) 

1=1 


with 

^,(  r,.  /,  %.  (ji) 

\ 

=  V5<(7,)  +  2  [-^V!;.,„(7,.  £o,  w)t>''"'*'  +  AvJ-,„(r,.  aOe''"’-']  -  (29b) 

/IJ*  1 


Finally.  H  i,  is  the  eigenvalue  of  Ho- 

Substituting  (29a)  with  (29b)  into  (28)  and  performing  all  the  operations  and 
the  variations  according  to  the  unknown  one-electron  functions  and 

AipZm-  respectively,  one  obtains  the  following  coupled  Hartree-Fock  (rpa)  equa¬ 
tions  (up  to  quadratic  terms  in  Stpf,,,,)  [45]  for  their  determination 

(f„  -e,±  /??w)|A^,-,„(T,.  £o.  w)>  +  h\(p,{r,)) 


n/2 
+  1 

L 


<Pj('r2) 


2  - 


ri2 


A<p,'.,„(  y 


2,  £o.  w)| 


+ 


/ 

1  -  P,^2 

\  M 

<PAr2)) 

\ 

fi; 

/’/J 

/  =  1 .  2 . n  /«  =  1 ,  2 . N  .  ( 30 ) 


Here  £|„.2  is  the  exchange  operator,  c,  the  eigenvalues  of  the  unperturbed  Fock 
operator  of  the  problem. 


n/2 

fi>  =  2 

I  I 


2  -  £,„= 


T|2 


^/(tz))  +  H ' 


(31a) 

(31b) 


({2A'  +  1 )  is  the  number  of  unit  cells  in  the  polymer.  A’„  the  n.  ..iber  of  nuclei  in 
the  unit  cell,  r,.  the  charge  of  the  oth  nucleus  and  Ri,  its  position  vector  in  the  /th 
cell.]  Furthermore,  in  this  case. 


QUASI  ID  AND  2D  PERIODIC  SYSTEMS 


121 


Introducing  an  lcao  expansion  for  the  functions 


^<ph„=  2  2  Cl,(2o.aj)X?(r). 


where  m  is  the  number  of  basis  functions  in  the  unit  cell,  X?  is  the  ith  AO  in  the 
qXh  cell.  Substituting  (33)  into  (30),  one  easily  arrives  at  the  matrix  equation 


/A-„,  B,  /D,\ 

\b.  ArJ\c:J  IdJ' 


=  <x?i(A  -  c,  ±  /»ai)|x;>  +  b»,i. 


=  2  X?(T,)X?{T:) 


12  - 


>P,(ri)<p,('r2) 


Dl  =  (x‘i\h\<p.). 


Equation  (34)  can  be  rewritten  as 

A-„,Cr.m  +  =  D,  .  (36a) 

+  Alrndj,,  =  D,  .  (36b) 

Since  the  matrices  Al,„  and  B,  are.  in  the  case  of  a  linear  chain  with  periodic 
boundary  conditions,  cyclic  hypermatrices,  they  can  be  block-diagonalized  with 
the  help  of  the  unitary  matrix  U  (the  p,  ^th  block  of  U  is  =  exp[/27rpqr/ 
(2(V+1)]1[1] 

U*A;,,A„,UU^C,-,„,  +  U"B,UU*C;,,„  =  U^D,  .  (37a) 

U"B,UUX;.„,  +  UA;.„,UU*C;.,„  =  U^D,  .  (37b) 

Using  the  notations  A,.””  =  U^A,  „,U,  etc..  Ff_,„  =  U^C^,„,  and  G,  =  U'^D, ,  we 
can  write 

A,.r „  +  B?‘^F;,„  =  G,,  (38a) 

Br‘’F-,„  +  A;.»'’F;,„  =  G,  .  (38b) 


This  system  of  equations  can  be  reduced  easily  to  such  matrix  equations  in  which 
each  matrix  has  only  the  order  ifi  X  ifi.  For  A  -►  x ,  one  can  introduce  the  con¬ 
tinuous  variable  k  =  2Trp/a(2N  +  1 )  (ifp  runs  from  -N  to  N,  the  quasicrystal 
momentum  k  will  have  the  range  —irla  <  A  <  ir/a  [1,2]  if  a  is  the  elementary 
translation).  In  this  case  we  can  write  [1,2] 


A, ,,„(A)F,.„,(A-)  +  B,(A)F;,,„(A)  =  G,(A) . 

B, (A)F,.„,(A)  +  A;.„,(A)/-V.„.(A)  =  G,(A) . 


(-39a) 

(39b) 
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Putting  back  the  two  equations  in  the  hypermatrix  form,  one  obtains 


/A-„,(*) 

B,(A')  UFr,„(A)\  _ 

_  /G.{k)\ 

1  B,(A:) 

Kn.(k)l\¥Uk)l 

\G.(k)l 

which  can  be  solved  at  each  point  k  and  values  Eo  and  w  to  obtain  the  coefficients 
C^%(k,  Eo,  w).  For  this  we  need  the  back  transformation* 

C,>,(A-,  Eq,  oi)  =  UF,o„(A:,  £0.  ■ 

Furthermore,  from  the  unitary  transfo  nations  preformed  in  Eqs.  (37),  it  follows 
[  1  ]  that 

Af.„.(A:)=  2  (41a) 

S 

B,{k)=  2  e'‘«"B,(<?),  {41b) 

q=-S 

S 

G,(A')=  2  (41c) 

-,v 

Finally  the  lcao  crystal  orbitals  can  be  written  in  the  presence  of  the  electnc 
field  £„  as 

!p,{k,  Eo,  u),  t,  r) 

«  S  m  S 

=  2  2  c'‘‘"'[Q.(A-)+  2  [C,t„,,(A:,  £0, 

m.| 

+  Cl„,Ak,  Eo,  a))e-'''’“']]X?(  r) , 

(42) 

where  the  coefficients  C.Jk)  belong  to  the  lcao  expansion  of  the  unperturbed 
crystal  orbital  <p,(,k,r). 

We  can  define  £  instead  of  ( 23 )  in  the  more  general  form 

yv 

£  =  £„  +  £„  =  £5.  +  I  2  £o(c""“'  +  ,  (43) 

m-l 

including  the  Static  field  £s,.  ^ 

For  the  determination  of  the  effect  of  £5,  on  the  one-electron  wave  functions, 
one  can  apply  perturbation  theory  [40]  with  the  perturbation  operator 

*  Actually  Cl  and  Cl*  are  those  blocks  of  U  and  U* ,  respectively,  that  are  necessary  for  the  transformations 
of  these  vectors.  For  instance,  in  the  equation 

u*c;^(A)  =  . 

2A<I 

Ci*c, ■;„(*)-  2  u;(A-)c, (A)  =  F, *.„(;■) , 

(-1 

where  U/  ( A )  stands  for  the  A .  /th  block  ( considenng  A  as  a  discrete  variable  which  has  2N  +  I  possible 
values)  of  the  big  matrix  U* 
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//„  =  -/ k  1 (44) 

If  we  apply  again  an  lcao  expansion  for  the  part  of  the  one-electron  wave  functions 
due  to  this  perturbation  (44)  can  be  written  as 

A*  m 

£s.)  =  Z  C'*""  2  C.Ak,  £..)X?(7,) .  (45) 

</*-A  «=I 

We  can  apply  again  the  same  formalism  for  a  periodic  chain  as  before  to  find  out 
the  coefficients  C(A:)  [40], 

Thus  we  can  write  now  for  the  one-electron  wave  functions 
ip,  =  ^,(7,,  u  w,  £5,,  £0)  =  ¥’,(7i,  k)  +  (pr(7|,  £si,  k) 

A 

+  Z  (A^;,„,(7,,  £0,  a>,  A.ic""'"  +  A^-„,(7,,  £0,  w,  A:)^-"™') .  (46) 


Thus  we  can  finally  write 


<p,{r,  /,  «,  k,  £s„  £0) 
A  m 


1 


2A'+  1 


Z  Z^' 

(fs  -A  1 


C.Ak)  +  C,Ak,  £s,) 


+  Z  [C,t„,,(Ar,  £0,  a. )£"’“'  +  C,:,„(/c,  £0,  co )?-""“']  ] 

m=l 

(47) 


Definition  of  the  Quasi- Hart  ree-Fock  Equations 

We  can  build  up  the  total  wave  function  of  the  polymer  now  with  the  help  of 
the  generalized  crystal  orbitals  (46)  as 

n 

^(7| . 7„,  k,  t,  w,  £s,,  £0)  =  n  k,  co,  £5,,  £0) .  (48) 

1=1 


With  the  help  of  (48)  we  can  now  define  a  quasienergy  of  the  polymer  in  the 
presence  of  the  static  and  time-dependent  field  as 


W{k,  t,  w,  £si,  £0) 


(^{k)\imk)) 

(Mk)\m)) 


(49) 


*  One  should  point  out  that  if  one  takes  into  account  also  the  effect  of  Ej,.  in  the  coupled  Hartree- 
FocK  equations  (30).  one  has  to  put  instead  of  and  c,,-  respectively,  E„)  and 

c,  +  Aff.  respectively.  Here  Acf  is  the  shift  of  the  eigenvalues  of  the  unperturbed  Fock  operator  c,  due 
to  the  perturbation  [45] 
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Here  the  brackets  mean  only  integrations  according  to  the  spatial  variables  of  the 
/!  =  (2A'  +  1  )/7  electrons  («  is  the  number  of  electrons  in  the  unit  cell). 

Since  the  crystal  orbitals  <^,  still  have  to  be  orthonormal,  (ip,  |<p;)  =  5,,,*  we  can 
write  the  variational  equation 

/.  <0.  /:«)  +  I  |^,>]  =  0  .  (50) 

‘■j 

where  in  (49)  the  total  Hamiltonian  is  defined  in  the  general  case  as 

H  =  //„  +  H\. 

with 

II  .< 

//'  =  -i\e\  2  2  [  £'s.  +  \Eo(e"'-  +  .  (51 ) 

iii^l 

and  7/(1  is  unperturbed  many-electron  Hamiltonian  of  the  poly  mer. 

With  this  definition  of  77,  one  can  obtain  with  the  help  of  the  usual  variational 
procedure  forgiven  1.  uj.  and  k  values  and  for  vectors  Tfj,  and  7f(,.  respectively,  the 
quasi  Hartree-Fock  equations 

f(  £(,.  w.  /.  k)\p,  =  3/  £'s,.  £■(,,  w,  /,  k)^j(7.  u>,Lk,  £s,..  £o)  (52a) 

[ipj  =  ip-,  see  Eq.  (46)]  with  quasi-Fock  operators 

"/2 

F(  Zo.  CO.  /,  k)  =  //•'(  7f„.  CO.  /.  r, )  +  2  (27;  -  A;) .  (52b) 

Here  the  quasi-Coulomb  operators  Jj  and  quasiexchange  operators  K,  are  defined 
in  the  usual  way  but  not  with  the  help  of  the  unperturbed  crystal  orbitals  <Pj,  but 
with  the  perturbed  ones  ip,.  Furthermore. 

^  A* 

fF  =  7/''  2  (  Am  +  jZofe"""'  +  (52c) 

in-  I 

(77''  is  the  one-electron  operator  of  the  unperturbed  polymer). 

Substituting  the  lcao  expression  of(46)  into  (52a)  one  obtains  in  the  usual  way 
the  matrix  equation 

F(A:.  CO.  /,  ^si,  7:o)dj(A.  <o,  t,  %)  =  l,(k.  co.  l,  Tf,,.  7f(i)Sd; .  (53a) 

Here  matrix  elements  of  F  are  defined  as 

A 

[F(A-)]...=  2  (53b) 

0=  A 


*  Even  if  the  oribtals  and  ip  are  oilhonormal.  =  h,,.  their  sum  ip,  docs 

not  form  an  orthonormal  orbital  set  We  can  apple  Schmidt’s  orthogonalization  pren  edure  and  renormalize 
also  the  orbitals  ip, .  In  this  way  we  can  also  write  (v’l  Ip;)  =  ,■ 
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with  /•'given  in  (52b).  S(k)  has  the  elements 

;V 

[S(A-)],,,=  2  (53c) 

It-  “A 

and  the  vectors  d,  are  defined  as 

d,{k,  •••)-C/A:)  +  C,(A-,£„) 

,v 

+  Z  IC;.„,(A,  £o,  a.)c""“'  +  C;,„(A.  £o.  0))^-""“'] .  (53d) 

m-  1 

With  these  definitions  one  can  solve  Eq.  (53a)  for  given  A.  co,  /,  Ej,.  and  Eo  in  the 
same  iterative  way  as  one  does  in  the  unperturbed  case.  In  this  way  one  can  obtain 
the  SCF  quasi-Fock  operator  and  with  the  help  of  (52b)  the  SCF  quasicrystal 
orbitals 

1  ^ 

'^7.  •  •  •  )^'  =  1  e‘^‘<‘‘djAk,  t,  CO,  E„.  /■o)X?( 7 )  (54) 

and  the  scF  quasi  one-electron  energies  These  quantities  are  needed  in  the 
subsequent  treatment  of  correlation. 

One  should  mention  that  in  the  possible  case  if  the  iteration  procedure  for  F 
does  not  converge,  one  can  still  use  the  orbitals  i,  [see  Eq.  (46)]  (after  orthogonal- 
ization  and  renormalization )  as  one-electron  orbitals  for  an  MP/.2  calculation,  and 
one  just  can  approximate  the  quasi  one-electron  energies  e  as 

c,  =  {4'j\hh)-  (55) 


Mceller-Pk’sset  Perturbation  Theory 

After  having  computed  the  quasi  one-electron  orbitals  and  quasi  one-electron 
energies  C;,  one  can  apply  the  MP/2  expression  for  the  second-order  correlation 
correction  of  the  quasitotal  energy  (49)  for  given  A,  cv,  /.  Esi,  and  E,,.  In  this  case 
the  Moeller-Plesset  perturbation  operator  will  be 

//mp  =  2  -  -  Z  Hf,  /.  k.  E,„.  E„)  =  I  -  -  «E  (56) 

KJ  OCC  !</  ^'1 


(n  =  h{2N  +  1 ).  With  the  definition  (56),  one  can  derive  in  the  standard  way  [46] 
the  second-order  correction  to  the  quasitotal  energy  (for  given  /,  cv,  Ej,,  and  Eo) 


MP  Z/ 

I.J.  I.ll 

Here,  of  course. 


KiA/Iti.  •  •  ■K;{r,  •  •  •)|(l/ri2) 

X(1  -.P,...;)|^,(7,.  ■  •  •)^/,(7,.  •  •  •)>! 


Cl  +  Cj  ~  C  ,  -  Cl, 


(57) 


MP^  *  ' 


CO.  E„.  E„),  because  the  perturbed  crystal  orbitals  iA/It, 
•  •  • )  and  quasi  one-electron  energies  depend  on  the  same  variables  ( this  is  meant 
in  the  argument  of  the  lA/,  etc.  by  the  three  points).  The  A-dependence  is  cx- 
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pressed  by  the  combined  indices  /  =  i,  k,,  etc.  Therefore,  the  fourfold  summation 
includes  the  threefold  integration  over  k  (because  of  the  conservation  of  mo¬ 
menta  A',  +  kj  =  A„  +  kh)  (for  the  details,  how  one  calculates  the  matrix  elements 
('Pi'PA  I'I'.ti'n)  occurring  in  (57)  see  again  [46]). 

After  having  calculated  £*„‘p,  we  can  write  for  the  total  energy  with  second-order 
correlation  correction 

E(i.  «.  £„)  =  ^  fr[A-,  £  o),  £s„  £o)r/A-  +  £‘„V(/,  a,,  £„,  £o) .  (58) 

irr  J-,/« 

The  development  of  the  program  package  for  the  rather  complicated  formalism 
developed  here  and  in  the  previous  two  subsections  is  in  progress  in  our  laboratory. 

The  Calailaiion  of  the  Polarizahtlities  and  Hyperpolarizabililies 

Having  the  quasi  total  energy  (58),  we  can  formally  express  it  in  the  presence 
of  the  electric  field 

S  A 

£  =  £„  +  £,,  =  £m  +  i  >3  £o(c"”“'  +  =  £s,  +  2  Eo  cos  wwt  (59) 

m»  I  1 

as 

£(/.  01,  £„.  £o)  =  £'“>+  £*;;+  +  E^a(o>)E 

+  |E^i8EE  +  ^E^7J!i-E+  •  •  •.  (60) 

where  £‘”’and  Ewpare  the  Hartree-Fock  and  mp/2  energies  of  the  polymer  in  the 
absence  of  the  field  and 

a.\  ~  dsiEsi  +  2  Eji^sjEst  +  —  EjjYsiEsiEsi  +  •  •  •  (61 ) 

is  the  part  of  the  induced  dipole  moment  due  to  the  static  field.  It  can  be  easily 
calculated  with  the  aid  of  the  expression 

«)  =  2:pZ  f  /l<P,(r„A')  +  v’fCr,.  £51,  A-)] 


X  2  ^  £,A  )  +  (/?]'(£.  E'si-  ^') 


dk. 


(62) 


where  the  dipole  moment  operator  p,  =  U'|  r, . 

Using  express'  ns  (49),.  (58).  and  (59),  one  can  apply  the  time-independent 
Hcllmann-Feynmann  theorem  [46] 


dE 

d£M 


<$!d///d£;,l<i>) 


(63) 
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(for  the  conditions  of  its  validity,  see  [42.44]).  K  re  E  is  the  total  energy  [see  Eq. 
(59)],  4>  the  many  electron  wave  function  [see  Eq.  (48)]  and  //  the  full  perturbed 
Hamiltonian  [see  Eq.  (51 )]  (if  we  neglect  overtones  in  the  expression  of  that 
is,  we  put  m  =  1 ).  On  the  basis  of  Eq.  (63).  we  can  write  following  [42]  but  using 
our  notation 


Mo  = 


dE 

BE. 


a"'‘(0,  0)  =  - 


St.X 


d-E 


/3"''‘'(0;  0,0)  =  - 


d£'s,.xd£'si.M  t-:=,o  ' 
d^E 

dEa.xdEa.^dE^,,,. 


to)  cos  U)t  =  - 


d-E 


dEa_xdE^„ 


^/3^‘“'(0;  «)  +  ^/3^'"  (-2co;<o,  co)cosco/  =  - 


d^E 


dE,,xdE^,dE^„. 


(64a) 

(64b) 

(64c) 

(65a) 

(65b) 


l-o 


and  finally 


0,  <t>)  cos  (0/  = 


d^E 


dEa.xdE^.^dE^_,. 


i/;-3 


(65c) 


In  these  equations  £si.a  and  respectively,  are  the  Xth  component  of  £„  and 
the  nth  one  of  E„,  respectively.  Furthermore,  we  can  determine  /J^^XO,  -  to.  w)  and 
-2tt),  to,  to )  separately  from  Eq.  (65b)  if  we  compare  the  coefficients  of  the  terms 
containing  as  factor  cos  to/  and  those  of  the  constant  terms.  One  can  deduce  also 
the  symmetry  relation 

i8"'“(0;to, -to)  =  r""{w;0.  w)  [42].  (66) 


There  are,  of  course,  more  combinations  in  to  in  the  possible,  but  they  are  not 
measurable  at  the  present  time  [42]. 

Formally  one  can  write  also  for  the  elements  of  7  similar  expressions  as  fourth 
derivatives  of  E.  For  instance. 


7 "‘■'■‘(-to;  0,  0,  to)  cos  to/ 


d*E 

dE^t,^  dE^._ , 


(67) 


Conclusion 

It  was  shown  that  if  the  coupled  Hartrce-Fock  equations  based  on  Frenkel’s 
variational  principle  are  applied  for  the  determination  of  the  to  depending  part  of 
the  one-electron  wave  functions  in  the  presence  of  E.,,  one  can  easily  define  a  /  .  to, 
and  field-dependent  quasienergy  as  Rice  and  Handy  [41,42]  did  in  their  per- 
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turbational  theoretical  approaeh  to  the  determination  of  the  orbitals.  One  can 
also  add  to  a  static  field  Es,  and  define  the  quasienergy  with  its  inclusion. 
From  this  .quasienergy  quasi  Hartree-Fock  equations  were  derived  with  Fock 
operators  F(l.  w,  Ej,.  Eo)  which  give  rise  also  to  one-electron  energies  c,,  which 
depend  on  the  same  quantities  as  F.  The  whole  formalism  was  extended  from 
molecules  also  to  periodic  polymers.  Finally  the  quasi  MP/2  energies  for  polymers 
were  also  derived.  Adding  those  to  the  SCF  quasitotal  energies,  one  obtains  the 
tot  il  energy  of  a  polymer  per  unit  cell  £(a),  t,  Ej,.  Eo)/(2N  -t-  1 ). 

With  the  aid  of  these  quantities  the  polarizability  and  hyperpolarizability  tensor 
elements  can  be  expressed  in  the  usual  way  [41,42].  Very  probably  for  the  necessary 
numerical  derivatives  the  same  procedures  can  be  applied  as  before  [41,42].  The 
programming  of  our  coupled  HF  equations  in  the  presence  of  E  =  £„  +  E„  for  a 
linear  polymer  as  well  as  the  correction  of  its  quasitotal  energy  per  unit  cell  for 
correlation  is  in  progress. 

As  next  step  we  plan  to  apply  a  laser  pulse  instead  of  E(/)  which  means  that  the 
magnetic  vector  potential  A,.,  +  0,  but 

A 

A.  =  Z  [Aoc"""'  +  Ajc-'"’“'] 

111=1 

(as  it  was  done  before  without  correlation  and  without  the  correct  treatment  of 
lattice  symmetry  in  [37]).  Finally,  we  plan  to  extend  the  theory  to  2D  and  later 
also  to  3D  periodic  lattices. 
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Abstract 

The  structural  and  electronic  properties  of  polvsilane.  {S1H2),  and  polvsilene.  (SiH ),  will  be  compared 
with  their  carbon-based  analogues  polvcthvlene.  (CHj),.  and  polyacetylene.  (CH),,  aS  investigated  by 
the  ah  intlio  crystal  orbital  method  using  several,  partly  highly  polanzed.  atomic  basis  sets  of  increasing 
size  at  the  Hartree-Fock  ( lir )  level  and  by  including  electron  correlation  effects  up  to  the  third  order  of 
Moller-Plessct  perturbation  theory  The  single-particle  energy  bands  have  also  been  corrected  for  cor¬ 
relation  effects  applying  the  electron  polaron  quasiparticlc  (qp)  method  Election  correlation  plays  an 
essential  role  in  stabilizing  the  guiuhe  conformation  of  polysilane  against  the  Ira  is  form  While  the 
guiiJw-trans  energy  difference  is  about  0.25  mH  at  the  Hr  level,  it  will  be  reduced  to  about  -0.8  mH 
due  to  correlation  Also  the  energy  bands  and  band  gaps  will  be  substantially  modified  in  going  from 
the  HF  to  the  QP  description,  resulting  in  A/-',ap  ^  9.5  and  1 1.8  eV  for  traits  and  leiiiii/it' polysilane  at  HF, 
and  5.12  and  5.56  cV  at  the  qp  level,  respectively.  Polysilene  turns  out  to  be  a  semiconductor  (similar 
to  polyac^tylenc I  for  which  bond-alternation  energetically  stabilizes  the  semiconducting  phase  (as  com¬ 
pared  to  the  equidistant,  metallic  one)  by  1.4  mH  at  the  Ht  and  by  0  6  mH  at  the  .Mp2  levels,  with  bond 
alternations  of  IR  =  0.1327  and  0.0865  A.  respectively,  c  iv9.t  John  Wilcy  &  .Sons.  Inv. 

introduction 

Experimental  and  theoretical  research  of  the  past  decade  convincingly  demon¬ 
strated  several  interesting  physical  and  chemical  properties  of  carbon-based  organic 
polymers  like  polyacetylenes,  (CH),,  with  a  wide  range  of  new  technological  po¬ 
tentials  [  1  ] .  The  experience  with  this  type  of  materials  has  led  to  the  common  belief 
that  the  efficient  charge  and  energy  transport  phenomena  in  organic  polymers  should 
be  related  to  their  extended  rr-conjugated  double-bond  system.  Some  early  band 
structure  calculations  on  the  p-bonded  backbone  of  poly  peptides  [2.3]  indicated, 
however,  that  efficient  electron  delocalization  may  also  result  without  ir-electrons. 
In  fact,  theoretical  effective  mass  and  mobility  values  for  both  electrons  and  holes 
indicated  more  pronounced  semiconductive  properties  in  a  two-d.mensional  poly¬ 
peptide  sheet  along  the  p-bonded  backbone  of  the  amino  acid  residues  than  in  the 
perpendicular  direction,  betw'een  the  jr-electron  systems  of  the  peptide  units. 

Recent  experimental  progress  in  investigating  a  relatively  new  class  of  silicon- 
based  materials,  the  polydiorganosilyienes  (polysilanes).  (SiRiRy),,  with  a 
backbone  comprised  entirely  of  Si  —  Si  ir-bonds.  has  shown,  on  the  other  hand. 


IiUcrnalionalJourna!  of  Quantum  Chemistry.  QuaiiUim  Chemistry  Symposium  27.  131-146  (1993) 

■c  1 993  John  Wiley  &  Sons.  Inc.  CCC  0020-7608/93/0 10131-16 


132 


SUHAI 


that  (T-bonded  polymers  are  capable  of  delocalizing  their  electrons  much  in  the 
same  manner  as  normally  ascribed  to  the  more  familiar  ir-conjugated  materials 
like  pol> acetylenes,  polydiacetylenes,  etc.  It  is  a  striking  observation  that,  though 
polysilanes  are  the  structural  analogues  of  polyolefins  (e.g..  polyethylene),  their 
electronic  and  optical  properties  resemble  more  closely  those  of  quasi-one-di- 
mensional  7r-conjugated  polymers  (like  polyacetylenes).  They  exhibit,  for  ins¬ 
tance,  a  strong  and  conformationally  dependent  optical  absorption  in  the  near 
uv  [4-17],  possess  relatively  low-lying  ionization  potentials  [18.19],  and  show 
an  excellent  nondispersive  photoconductivity  [20,21],  Whereas  the  use  of  poly¬ 
silanes  as  photoconductors  depends  on  their  ability  to  transport  charge  (they 
can  be  doped  to  about  1  S/cm  [22]),  their  application  in  nonlinear  optical 
devices  [23-25]  takes  advantage  of  their  unusually  high  capacity  to  delocalize 
electronic  excitations.  They  proved  to  be  valuable  as  microlithographic  contrast 
enhancement  layers  in  submicron  lithography  [26]  and  may  replace  silicones 
in  some  cases  [27]  due  to  their  higher  thermal  stabilities  and  better  strength. 
Unlike  many  of  the  carbon-based  polymers,  polysilanes  are  almost  invariably 
air-stable  and  processible  by  standard  industrial  techniques.  Two  recent  reviews 
provide  further  details  on  the  properties  and  of  the  potential  applications  of 
these  exciting  polymers  [28-30], 

In  view  of  these  experimental  developments,  it  seems  to  be  an  attractive  theo¬ 
retical  problem  to  systematically  compare  the  structural  and  physical  properties 
of  analogous  silicon-  and  carbon-based  materials,  respectively,  and  try  to  trace 
back  the  similarities  and  differences  in  their  properties  to  their  electronic  struc¬ 
ture.  As  a  first  step  towards  this  goal,  the  theoretical  work  to  be  reported  here 
has  mainly  concentrated  on  the  Si  and  C  backbones  of  the  corresponding  poly¬ 
mers  themselves  (taking  always  hydrogen  atoms  as  side  groups  R)  and  R2,  re¬ 
spectively  ).  There  are,  however,  several  convincing  experimental  evidences  for 
the  importance  of  the  detailed  chemical  structure  of  these  side  groups  in  gov¬ 
erning  the  physical  properties  of  polysilanes.  On  the  basis  of  some  preliminary 
calculations  applying  methyl  groups  as  substituents,  we  have  also  obtained  some 
hints  for  their  expected  influence  through  the  Si  —  C  orbital  mixing  but  their 
more  accurate  investigation  has  to  be  the  subject  of  further  elaborate  investi¬ 
gations.  Besides  polysilanes,  for  which  a  large  volume  of  experimental  data  are 
available,  we  will  also  report  theoretical  results  on  the  hypothetical  /ran.v-poly- 
acetylcne  analogue  (SlH)^,  both  in  its  equidistant  and  (single-double  bond) 
alternating  conformations,  respectively.  Though  it  seems  to  be  very  intricate  to 
sy  nthcsize  stable  compounds  containing  silicon-silicon  double  bonds  [31],  the 
first  successful  synthesis  of  such  a  compound  [32]  induced  theoretical  work  on 
silicon-containing  poly  mers  containing  unsaturated  bonds  [33-36]. 

Most  of  the  earlier  theoretical  investigations  aiming  at  the  prediction  of  the  struc¬ 
tural  parameters  of  polysilanes  used  small  ohgosilanes  ( up  to  pentasilane)  as  model 
compounds  apart  from  empirical  force  field  calculations  [37-40]  that  could  use 
chains  up  to  30  Si  atoms.  Among  the  scmiempirical  methods.  CNDO/2  has  been 
u.sed  to  compute  the  structure  and  ionization  potentials  [41]  as  well  as  low-lying 
excitation  energies  [42],  mndo  [4.3,44]  and  mndo/2  [45]  calculations  predicted 
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reasonable  Si  —  Si  bond  lengths  for  different  trans  and  gauche  conformations,  re¬ 
spectively,  while  AMl  and  PM3  [46]  have  been  used  to  analyze  the  conformational 
effect  of  various  substituent  groups.  In  the  first  ab  initio  hf  cluster  calculations, 
Mintmire  and  Ortiz  [47,48]  utilized  a  polarized  (3-21G*)  atomic  basis  set  and 
obtained  reasonable  bond  lengths  and  valence  angles,  respectively,  and  provided 
vertical  ionization  potentials  of  the  first  four  oligosilanes  in  good  agreement  with 
experiment.  At  the  same  level  of  theory.  Nelson  and  Pietro  [49]  studied  the  evolution 
of  the  valence  band  of  polysilanes  from  cluster  energy  levels,  while  Balaji  and  Michl 
[50]  performed  Cl  calculations  with  a  double-zeta  +  d  +  Rydberg-type  basis  set  to 
obtain  the  singlet  excitation  energies  of  the  Si„H2„+2,  n  =  2  -  5  series.  Finally, 
Kirtman  and  Hasan  computed  in  a  recent  study  [51]  linear  and  nonlinear  static 
electronic  polarizabilities  of  H3  -  (SiH2)„  -  SiH3  oligomers  until  «  =  13  in  the 
trans  conformation. 

The  most  sensitive  part  of  the  abovementioned  conformational  studies  is  the 
computation  of  the  dihedral  angles  of  the  Si  backbone  that  play  a  decisive  role 
in  determining  the  photophysical  properties  of  these  materials  [52].  trans  and 
gauche  conformations  have  generally  been  identified  as  local  minima  of  the 
potential  energy  surface  but  the  location  of  the  absolute  minimum  remained, 
due  to  the  small  energy  differences  obtained,  mostly  beyond  the  credibility  limit 
of  the  methods  applied.  Methods  applying  empirical  potential  functions  usually 
favored  the  gauche  over  the  trans  form,  while  single  determinant  MO  methods 
(neglecting  dispersion  forces)  predicted  mostly  the  less  compact  trans  form  as 
the  absolute  minimum.  A  very  important  hint  for  the  need  of  including  electron 
correlation  in  these  investigations  came  from  the  Mp2/6-31G*  calculation  of 
Mintmire  and  Ortiz  [47]  who  found  that  the  mp2  energy  (computed  for  the  HF 
optimized  geometry)  slightly  inverts  the  stability  order  in  tetrasilane  (favoring 
the  gauche  form  by  0.04  kcal/ mol). 

Theoretical  work  on  infinite  polysilanes  concentrated  mostly  on  the  computation 
of  the  band  structures  by  using  semiempirical  [41,53,54]  or  local  density  functional 
[55-58]  methods  and  standard  geometrical  parameters.  Ab  initio  crystal  orbital 
geometry  optimizations  have  been  performed  at  the  HF  level,  however,  for  the  trans 
conformer  using  a  minimal  basis  set  [59],  for  both  of  the  trans  and  gauche  con- 
formers,  respectively,  with  a  double-zeta  basis  [60],  and  for  different  mixed  com¬ 
posites  of  the  two  structures  again  at  the  minimal  basis  level  [61],  The  linear  po¬ 
larizability  of  /rani-polysilane  has  been  investigated  with  sto3G  and  4-3 IG  basis 
sets  recently  [62]  without  geometry  optimization. 

The  present  article  will  give  a  short  review  of  a  first  pnnciples  method  for  the 
calculation  of  electron  correlation  effects  on  the  total  and  single  particle  energies 
in  infinite  polymers  and  it  will  present  new  results  concerning  the  geometrical 
structure  and  the  electronic  band  indices  of  several  infinite  polyene  models.  The 
next  two  sections  will  summarize  the  methodological  aspects  related  to  the  com¬ 
putation  of  the  conelated  ground  state  in  the  second  order  of  PT  and  of  its  hf  part, 
respectively,  while  the  fourth  section  will  provide  the  necessary  working  expressions 
to  compute  (correlated)  quasiparticle  energies.  Finally,  we  will  present  our  new 
results  for  five  different  infinite  polyene  models. 
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Computational  Method:  Hartree-Fock  and  MoIler-PIesset 
Schemes  for  Infinite  Regular  Polymers 

Our  ah  initio  computational  procedure  to  calculate  electron  correlation  effects 
for  infinite  polymers  has  been  recently  described  [63,64],  Here,  we  only  summarize 
the  basic  expressions  to  define  the  various  theoretical  levels  as  applied  to  polysilanes. 
The  zeroth-order  (Hartree-Fock)  ^electron  wavefunction,  4’o,  will  be  written  as 
a  Slater-determinant  built,  utilizing  the  translational  symmetry  of  the  polymer, 
from  doubly  filled  Bloch  functions,  (f)(7),  in  the  form 

$0  =  (fV!)“'^Met[-  •  •(t)l{r,)a{ff,)<j)l(7,-^,)0[<j,+x)‘  •  •] .  (1) 

(f)J(  7)  stands  here  for  a  one-electron  state  with  quasi-momentum  k  in  band  n.  The 
Bloch  functions  themselves  will  be  expressed  as  linear  combinations  of  symmetry 
adapted  Bloch  basis  orbitals  in  the  form 

V 

0«(7)=  2  cL^5(7),  (2) 

<3-1 

where  v  is  the  number  of  basis  orbitals  per  elementary  cell  of  the  polymer  and  the 
Bloch  orbitals  if  J(  7 )  are  themselves  linear  combinations  of  (Gaussian-type )  atomic 
orbitals: 


A'c 


ii{7)  =  Nc''-  2  e\p{flcRj}xi,{7) . 


(3) 


Ni  stands  here  for  the  number  of  elementary  cells  and  the  atomic  orbital  Xi(7)  = 
Xa(7  -  Ra-  Rj)  is  centered  in  the  cell./  at  Ra  +  Rj.  The  nonrelativistic  electronic 
Hamiltonian  of  the  polymer  (in  atomic  units)  has  the  form 


//  = 


"A 

-22 


A^\ 


+  2 
l<tn 


(4) 


is  here  the  number  of  atoms  per  unit  cell  and  is  the  core  charge  of  the  atom 
A  at  position  ,  respectively.  Variation  of  the  expectation  value  (4>o|7/|4>o)  leads 
to  the  polymer  HF  equation 

F(7,)<t)l(7,)  =  cl(j>J(7,) ,  (5) 

with  the  Fock-operator 


1  ^  .  r  Pi'^n,.7„ 


P(7i,7,„) 
\7i-  7„,\ 


H7h7m)  , 


(6) 


where  the  permutation  operator  P  interchanges  the  variables  7/  and  7,„,  respectively. 
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before  integration  for  the  function  standing  behind  him.  The  first-order  density 
matrix  p(7/,r„,)  is  constructed  from  the  occupied  Bloch  orbitals  as 

occ  { B/.  i 

P(r/,r„,)=2  Z  <^>«(r/)[0«(r„,)]*  .  (7) 

n  X 

For  the  computation  of  the  correlation  energy,  we  will  apply  the  Moller-Plesset 
partitioning  scheme  [65]  of  the  Rayleigh-Schrodinger  many-body  perturbation 
theory  (rs-mbpt )  [66]  by  writing  the  full  many-particle  Hamiltonian  (4)  in  the 
form 

H  =  H„+Q.  (8) 

fl,=  'Z.eic]c,  (9) 

/ 

Q  =  1/2  z  Z  2  Z  (IJ/LM)c]cU-mC,. 

I  J  L  At 

-lll(LI\\U}n,.c]cj  (10) 

I  j  I. 

c]  and  Cl  are  here  creation  and  annihilation  operators,  respectively,  referring  to  the 
Bloch  spin  orbitals  0/  (the  compound  index  I  refers  to  the  band  index  /  and  quasi¬ 
momentum  X, ), «/,  is  the  occupation  number,  and  the  four-center  repulsion  integrals 
are  defined  by 

(IJ\LM)  ^  j  drt  j  ^/r2(/.f■(7|)vJ^(r2)*r^2  0f'(7,)0L"(72)  (11) 

Integrals  of  the  type  <  •  •  •  ll  *  •  • )  are  in  their  structure  analogous  to  Eq.  (11)  with 

the  difference  that  the  Coulomb  operator  is  multiplied  m  them  by  the  factor  ( 1  - 
P12)  to  take  account  of  exchange.  The  zeroth  order  many-electron  wavefunctions 


are  eigenfunctions  of  //„: 

//,A  =  £,<i>x.  (12) 

The  working  expressions  of  mbpt  can  be  written  in  a  compact  form  using  the 
reduced  resolvent  of  H„  belonging  to  its  lowest  eigenvalue.  E„: 

y. 

G,.=  Z  (£'«-^x)-’|-J>x><4>xl  -G  (13) 

\=t 

In  terms  of  G,  the  first  three  orders  of  the  rs-mbpt  are  given  by 

it',  =  <#„|(?|4>„>  (14) 

A'2  =  <^,,l(2G0|<i>„>  (15) 

=  <^,.I(2G<2G<21^„>  -  <«>..l<3GG<2l<l‘»><<t>J<2l«J>»>  ( 16) 


Since  £nt  =  E„  +  // , ,  the  correlation  energy,  defined  as  the  difference  between  the 
exact  eigenvalue  of  U  and  £,,1  >  can  be  obtained  as  the  sum  of  all  higher  order  terms 
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in  the  perturbation  series  starting  by  £2-  If  we  take  into  account  the  translational 
symmetry  of  the  polymer,  substitute  the  resolvent  operator  from  Eq.  ( 13)  and  make 
use  of  diagrammatic  techniques  [66],  we  straightforwardly  arrive  at  the  expressions 


i<  j  a<  b  X,  Xf  q 


(cf'  +  e]>  -  -  ejr^) 


(17) 


£2=1111111111 

i<  J  a<  h  <<  </  X,  Xj  q  r 

(c^-,  +  e}'  -  cy'^  -  el'-^) 

(ef-  +cf'-c5'""-4'‘') 

+  ZZ2Z2Z22 

(,pr  ^  ^r,-7  _  ^yr,  _  ^r,-5) 


+ZZZZZZZZZZ 


ic}‘  +  ej’  -  c5''’  -  cl'-^Xel'  +  cf'  -  ep"  -  cj^) 


(18) 


The  notation  follows  here  the  convention  that  indices  /.  /,  •  •  •  denote  energy 
bands  occupied  in  the  ground  state  and  a,  ft,  •  •  •  stand  for  virtual  bands. 

The  termination  of  the  perturbation  expansion  at  the  second  order  has  two  major 
advantages.  At  first,  in  this  case  only  matrix  elements  between  the  ground  state  and 
doubly  excited  configurations  will  have  to  be  computed.  (Starting  at  third  order, 
also  matrix  elements  between  doubly  excited  states  will  be  required,  whose  list 
substantially  exceeds  that  of  the  former  elements,  since  even  with  the  use  of  basis 
sets  of  moderate  size  the  number  of  v  irtual  bands  exceeds  by  far  that  of  the  occupied 
ones.)  At  second,  £2  can  be  thought  of  as  a  sum  of  separate  electron-pair  contri¬ 
butions  if  it  is  written  in  the  form 


£2  ~  1  1 '' «//  •' 

/  J 


(19) 


where  the  pair-correlation  energies  c/./arc  defined  by  the  comparison  of  Eqs.  (17) 
and  (19).  In  this  way.  the  particle  concept  can  also  be  preserved  beyond  the  nr 
theory,  and  one  can  define  quasiparticle  states  (and  energy  bands)  that  incorporate 
correlation  effects  at  the  given  level  of  approximation. 

The  HF  method  is  especially  efficient  if  the  Bloch  orbitals  are  calculated  in  the 
form  of  a  linear  combination  of  atomic  orbitals  (1.CA0)  [67,68]  since  in  this  case 
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the  large  amount  of  experience  collected  in  the  field  of  molecular  quantum  me¬ 
chanics  can  be  used  in  polymer  studies.  In  post-HF  computations  (like  correlation 
and  excitonic  effects,  impurity  levels,  etc.),  however,  the  transformation  from  the 
Bloch  basis  to  that  of  Wannier  functions  [69]  turned  out  to  be  a  very  efficient 
procedure.  The  connection  between  the  two  basis  sets  is  given  by  the  Fourier  trans¬ 
formation 

H'i  =  u„(  r  -  Ri)  =  'Z  exp(-/X^/)</>J(7) .  (20) 

I 

where  the  Wannier  function  =  u'„(7  -  R/)  is  centered  around  the  monomer 
at  Ri.  From  the  point  of  view  of  the  accuracy  and  economy  of  the  above  mentioned 
calculations,  the  optimal  ( minimal )  extension  of  ii  „  over  neighboring  unit  cells  in 
the  polymer  is  of  great  importance.  The  originally  undetermined  phase  of  the  Bloch 
functions,  represented  by  the  transformation  ^J(7)  =  exp[i\„(X)]  (where 

\„(k)  can  be  any  analytic  function  of  k  possessing  the  symmetry  of  the  Bz)  can 
be  utilized  to  determine  the  phase  factor  exp[  /X„(  /c ) )  in  such  a  way  that  the  Wan¬ 
nier  functions  will  be  localized  [  70  ] .  After  transformation  ( 20 ),  the  Wannier  func¬ 
tions  are  obtained  in  the  same  ao  basis,  in  which  the  Bloch  functions  are  calculated; 

iv(,(7  -R,)  =  ZZ  c/j;„X„(r  -R,-  R,) .  (21 ) 

A  /> 

The  variational  optimization  of  the  phase  factors  reduces  first  the  sum  k  over  a  few 
neighboring  cells.  For  a  given  cell,  the  Wannier  function  components  Zp  dp„(T  - 
Ri  -  R/,)  are  similar  to  molecular  orbitals  ( MOs).  It  is  not  surprising,  therefore,  that 
methods  worked  out  for  mo  localization  may  also  provide  excellent  tools  in  polymer 
theory.  The  Wannier  transformation  is,  unfortunately,  not  feasible  in  the  case  of 
metallic  systems.  For  this  case,  a  special  numerical  procedure  has  been  developed 
[71]  that  starts  from  Bloch  functions  in  tight-binding  form 

A'c 

0«(r)  =  A7'^'  2  exp{ikR,}Al{r  -R,).  (22) 

with 


A^y -Rj)=  2  clxHT),  (23) 

«-  I 

and  expands  the  A  factors  into  a  series  of  trigonometrical  functions  according  to 
k  within  the  BZ.  In  this  way,  the  four-index  transformation  will  have  to  be  performed 
only  on  the  expansion  coefficients  (and  not  on  the  A  functions  themselves  at  a 
large  number  of  k  values  in  the  BZ).  It  turned  out  that  also  for  metafile  polyene 
(as  found  previously  for  simpler  model  systems  [71])  the  leading  terms  of  the  A- 
expansion  until  second  power  provide  enough  numerical  stability  to  compute  the 
correlation  terms. 

Electron  correlation  influences,  besides  the  ground  state  total  (binding)  energy 
of  the  system,  other  observables  as  well.  Within  an  approximae  physical  model. 
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the  so-called  electron  polaron  theory  [72,73],  one  can  reasonably  correct  for  such 
effects,  for  instance,  the  single-particle  energies  of  the  valence  and  conduction  bands, 
respectively.  The  physical  meaning  of  the  energy  bands  is  given  by  Koopman’s 
theorem  stating  that  the  band  energies,  as  calculated  by  Eq.  (5),  are  in  fact  the 
momentum-dependent  electron  affinities  (eas)  and  ionization  potentials  (IPs)  of 
the  HF  A'^-electron  system,  respectively.  Generalizing  this  statement  to  the  correlation 
problem  (at  second  order  mp  perturbation  theory  [71]),  the  corresponding  quasi¬ 
particle  (QP)  energies  can  be  obtained  as  eas  and  IPs,  respectively,  of  the  correlated 
A^-particle  wavefunction 

Cf(QP)  =  ec(HF)  +  2:‘c^^'’(e)  +  ,  (24) 

er(QP)  =  cr(HF)  +  S^A^)  +  •  (25) 

The  correction  terms  appearing  in  Eqs.  (24, 25),  in  addition  to  the  HF  band  energies, 
are  the  electron  and  hole  self-energies,  2(e)  and  2(/i),  respectively.  The  physical 
origin  of  these  self-energy  corrections  is  a  cloud  of  virtual  excitons  accompanying 
the  HF  particles  in  analogy  to  the  lattice  polaron  problem  where  virtual  (optical) 
phonons  dress  the  polarizing  particle.  A  more  detailed  analysis  of  the  structure  of 
the  corresponding  expressions  reveals  [71-73]  that  the  various  2(t)-type  terms 
tend  to  stabilize  and  the  2(/i)-type  terms  tend  to  destabilize  the  corresponding 
quasiparticle  states,  respectively.  As  a  net  effect,  the  correlated  IPs  and  EAs  will  be 
reduced,  as  compared  with  their  HF  values;  the  valence  bands  will  move  upwards 
and  the  conduction  bands  downwards,  respectively.  Since  the  upper  and  lower 
band  limits  will  be  usually  influenced  to  a  different  extent,  a  Franck-Condon  type 
band  narrowing  effect  can  also  be  observed  (as  in  the  case  of  phonon  polaron 
states). 


Conformational  Properties:  A  Comparison  of  Carbon- 
and  Silicon-Based  Polymers 

The  conformational  parameters  of  the  simplest  silane  polymer,  (SiH2)A,  have 
been  investigated  both  at  the  hf  and  mp2  levels  of  theory,  respectively.  To  under¬ 
stand  the  dependence  of  the  physical  properties  of  these  polymers  (especially  those 
strongly  depending  on  correlation  effects)  on  the  size  of  the  atomic  basis  set,  we 
have  employed  in  these  studies  several  such  sets  starting  from  the  minimal-type 
STO-3G,  through  double-zeta  (DZ)  [74]  up  to  se^’eral  polarized  sets  like  DZfp), 
DZ{d,p),  DZ{2d,p),  and  DZ(2dfp).  The  exponents  and  contraction  coefficients 
of  these  basis  sets  have  been  taken  from  the  Gaussian  suite  of  quantum  chemical 
programs  [75].  Our  crystal  orbital  program  package  also  uses  several  parts  of  the 
Gaussian  system  including  all  integral  routines.  Table  1  summarizes  the  structural 
parameters  obtained  for  the  three  basic  rotamers  of  polysilane,  the  trans,  eclipsed, 
and  gauche  structures,  respectively.  Small  cluster  calculations  [47,48  ]  also  indicate 
the  presence  of  a  fourth  ( eclipsed)  conformer  at  the  dihedral  angle  0  =  0°  but  our 
scf  program  could  not  converge  to  this  solution. 

We  can  see  that,  apart  from  trivial  deviations,  all  basis  sets  for  both  methods 
properly  locate  these  structures  around  the  0  values  of  180°,  120°,  and  60°,  re- 
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Table  I.  Structural  parameters  of  three  polyhydrosilanc.  (SiH^),,  structures  (trans,  gauche,  and  eclipsed, 
respectively)  as  optimized  at  the  Hartree-Fock  (hf)  and  second-order  Moller-Plesset  (mp2)  levels  of 
theory  using  different  atomic  basis  sets.  These  structures  are  characterized  by  the  dihedral  angle  6  as 
defined  by  four  consecutive  Si  atoms  of  the  polymer  (the  trans  form  corresponds  to  the  fully  extended 
chain).  The  potential  surface  of  the  polymer  has  been  scanned  for  8  values  around  the  expected  local 
minima  of  the  trans  and  jgaHc/ic  structures  {6  =  180°  and  8  ^  60°.  respectively),  and  in  the  neighborhood 
of  the  expected  maximum  for  an  eclipsed  structure  {8  =  120°).*  (Distances  are  given  in  A,  angles  in 

degrees.) 


Method/basis  set 

R(Si— SI) 

R(Si— H) 

a(Si — Si — Si) 

/){Si— Si— H)" 

8{Si — Si — Si — Si) 

HF/DZ 

2.3917 

1.4848 

111.93 

109.83 

180. 

HF/DZ  (d,p) 

2.3793 

1.4835 

112.04 

110.12 

180. 

HF/DZ  {2d/,p) 

2.3728 

1.4805 

112.14 

110.16 

180. 

MP2/DZ 

2.3977 

1  4984 

112.02 

109.61 

180. 

mp2/dz  {d,p) 

2.3591 

1.4813 

111.92 

110.28 

180. 

MPllDZ  {2df,p) 

2.3559 

1.4782 

111  92 

110.28 

180. 

HF/DZ 

2.4012 

1.4849 

112.02 

109.89 

120.29 

HF/DZ  {d,p) 

2.3912 

1.4830 

112.38 

110.18 

120.09 

HF/DZ  {2dJ,p) 

2.3878 

1.4802 

112.35 

1 10.23 

120.15 

MP2/DZ 

2.4070 

1.4985 

110.85 

109.76 

120.84 

mp2/dz  (d.p) 

2.3734 

1.4809 

111.75 

109.88 

121.21 

MP2/DZ  {2df.p) 

2.3706 

1.4776 

111.78 

1 10.07 

120.66 

HF/DZ 

2.3925 

1.4849 

111.79 

109.65 

64.50 

HF/DZ  {d,p) 

2.3807 

1.4801 

112.18 

109.96 

61.29 

HF/DZ  t2df,p) 

2.3736 

1.4782 

112.24 

109.98 

60.65 

MP2/DZ 

2.3986 

1.4982 

111.38 

109.50 

61.63 

mp2/dz  (d,p) 

2.3610 

1.4812 

111.06 

110.18 

58.13 

MP2/DZ  (2df,p) 

2.3582 

1  4786 

111.04 

110.39 

60.24 

“  There  is  another  expected  local  maximum  for  an  eclipsed  structure  with  9  =  0°.  Due  to  convergency 
problems  of  the  scf  procedure  for  small  9  values,  our  program  could  not  reach  this  point  on  the  potential 
surface. 

^  The  H — Si — H  plane  is  perpendicular  to  the  Si — Si — Si  plane  and  bisects  the  corresponding  backbone 
angle  a. 


spectively.  The  bond  angles  obtained  for  the  polymer  are  also  in  good  agreement 
with  those  obtained  earlier  for  small  clusters  using  3-21  G*  and  6-3 IG**  basis 
sets,  respectively  [47-49,51].  The  Si — Si  bond  length  is,  however,  very  sensitive 
to  both  basis  set  and  correlation  effects  varying  by  0.03  to  0.04  A  from  the  DZ  to 
the  v>z(2dj\p)  levels.  Since  the  binding  energy  of  the  conformers  very  sensitively 
depends  on  this  distance,  even  its  small  variations  as  functions  of  the  dihedral  angle 
are  important.  It  elongates  by  about  0.015  A  (both  for  HF  and  Mp2)  in  going  from 
the  trans  to  the  eclipsed  conformation  and  relaxes  nearly  back  to  its  trans  value  at 
the  gauche  conformation. 

The  energetic  changes  accompanying  the  torsional  motion  of  polysilane  are  shown 
in  Table  II.  It  is  interesting  to  note  first  that  even  the  use  of  the  DZ(  d,p)  basis 
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(comparable  to  the  6-3 1  G**  basis  set)  does  not  completely  saturate  the  hf  energy 
per  SiH;  unit  in  polysilane.  The  additional  set  of  d-  and  ./-functions  further  decreases 
the  HF  energy  by  20  and  8  millihartrees.  respectively.  This  is  in  striking  difference 
to  its  structural  carbon  analogue,  polyethylene,  for  which  the  DZ(  d,p}  basis  is  very 
close  to  the  estimated  hf  limit  (Table  III).  The  relative  HF  energies  are,  however, 
less  sensitive  to  polarization  functions  also  for  polysilane.  The  height  of  the  eclipsed 
tnrs'onal  barrier  (1.7- 1.8  millihartree)  is,  for  instance,  reasonably  deseribed  by 
using  one  set  of  ^/-functions  for  the  Si  atom  [dz(  d)].  This  situation  is  similar  to 
polyethylene  tnough  the  barrier  height  is  substantially  larger  for  the  carbon  backbone 
(6. 8-6.9  millihartree).  Since  the  correlation  contribution  is  very  similar  in  the  irons 
and  echpwd  conformers  of  polysilane,  respectively,  the  mp2  level  barrier  values  are 
nearly  equal  to  the  HF  ones  ( 1.6- 1.7  milihartree).  Furthermore,  though  the  third- 
order  correlation  contribution.  £3,  substantially  lowers  the  total  energy  (its  value 
is  about  46-43%  of  £2  in  polysilene  and  37-44%  in  polyethylene,  respectively),  its 
influence  is  relatively  small  on  the  conformational  energy  difference  since  it  changes 
quite  slowly  as  a  function  of  6. 

Since  the  gauche  conformation  is  much  more  compact  than  the  two  previously 
discussed  ones,  dispersion  interactions  become  more  important  for  it.  In  polysilane, 
the  Irons- gauche  barrier  is  about  0.25  millihartree  at  the  HF  level  but  it  diminishes 
at  til?  mp2  and  mp3  levels,  respectively,  and  the  gauche  structure  will  be  stabilized 
by  about  -0.8  millihartree  relative  to  the  irons  conformation.  This  differs  again 
from  polyethylene  where,  though  the  corresponding  barrier  is  also  reduced  by  cor¬ 
relation,  It  seems  to  be  stabilized  at  about  1 .4  to  16  millihartree,  i.e.,  the  irons 


Tabli  III  Hartrec-Fock  (/-hU-  second-  and  third-order  coirelation  (/.;  and  £0,  and  total  energies 
(/.Mp;  and  /.'mpi).  respectively,  obtained  (using  different  atomic  basis  sets)  per  unit  cell  of  polyethylene 
in  different  conformations,  truin  (dihedral  angle  0  -  180°).  aain.lw  (H  -  60°).  and  eclipsed  (t)  =  120°) 
The  absolute  energies  for  the  tra/n  conformcr  are  given  in  hartree.  the  relative  energies  for  the  gituihe 
and  rt/ipvrt/ structures,  jpectively,  are  measured  in  millihartree  from  the  corresponding  (ra«s-valuc 


Energy 

STO-3G 

DZ 

DZ  (r/) 

t>Z  {(l,p) 

DZ  {2d.p) 

DZ  {2d/.p) 

LiiPiliuns) 

-.ys  580150 

-.39.022111 

-.39  0.39101 

-39  042697 

-39  043195 

-39.04.3797 

{trails) 

-0  052657 

-0.101050 

-0.14207.3 

-0.155058 

-0.166158 

-0.178008 

£3'  {trails) 

— 

-0.045211 

— 

-0.057358 

— 

— 

l-Mr:''  {trails) 

-38.6.52689 

-39.12.3008 

-39.181045 

-39  197612 

-.39.209208 

-39  221652 

/i'mp/  (trails) 

— 

-.39.168219 

— 

-39.254970 

— 

— 

A£|ii“  (eclipsed) 

6.250 

6.482 

6.864 

6  921 

6.874 

6.862 

■iAivip;'’  (eclipsed) 

— 

5.7.36 

5.981 

6.030 

6.152 

6  068 

A£mpi°  (eclipsed) 

— 

5  642 

— 

5.881 

— 

— 

A£|,i°  {ftamlw) 

1.617 

1.599 

1.657 

1  691 

1.624 

1.3.34 

4£mp;'’  {ftaudic) 

— 

0.377 

0..381 

()..35.3 

0.328 

0.3.30 

— 

0.386 

— 

0  247 

— 

— 

°  Energy  at  the  optimized  lir  geometry. 

Energy  at  the  optimized  mp2  geometry 
'  Single-point  calculation  at  the  mp2  optimized  geometry. 
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configuration  remains  the  energetically  favored  one.  Since  the  conformation  energy 
differences  obtained  here  are  comparable  to  the  thermal  energy  at  room  temperature 
(kT  is  about  1  millihartree  at  300°K),  these  polymer  backbones  will  be  extremely 
flexible  against  conformational  excitations  at  higher  temperatures.  Furthermore, 
since  interchain  interactions  in  the  corresponding  polymer  ci'vstals  will  also  play 
an  important  role  in  determining  the  most  stable  single  chain  structure,  a  final 
conclusion  on  relative  stabilities  will  be  possible  only  after  their  inc  usion.  The 
available  experimental  structural  parameters  also  refer  to  polysilanes  with  more 
complicated  side  chains  so  that  a  more  serious  comparison  with  experiments  will 
have  to  await  their  inclusion  into  the  computations.  ( More  details  on  these  potential 
surfaces  and  on  the  corresponding  zero-point  vibrational  corrections  will  be  pub¬ 
lished  elsewhere.) 

Table  IV  presents  the  hf  and  mp2  energies,  respectively,  for  the  hypothetical 
polymer  /ran.r-(SiH),,  optimized  first  as  a  bond-alternating  and  then  as  an  equi¬ 
distant  zig-zag  chain.  The  development  of  the  bond  alternating  geometry  in  poly- 
silene  is  very  similar  to  the  case  of  /ram-polyacetylene  [64.7 1].  While  the  increasing 
quality  of  the  HF  description  [STO-3G,  DZ,  and  DZ(  d,p)  basis  sets,  respectively], 
produces  bond  alternation  values  (difference  in  the  lengths  of  single  and  double 
bonds,  respectively )  of  \R  =  0.1438, 0.1074,  and  0.1 167  A  for  polyacetylenc  [71], 
the  corresponding  \R  values  for  polysilene  are  0.1529.  0.1 159.  and  0.1327  A.  As 
correlation  reduces  Ai?  to  0.1 105.  0.0834.  and  0.0816  A  for  polyacetylene,  it  has 
the  same  effect  in  polysilene  leading  to  0. 1 1 2 1 . 0.0990,  and  0.0865  A,  respectively. 
Though  the  energy  barrier  of  about  1 .4  millihartree  between  the  alternating  and 
equidistant  chains  at  the  HF  level  will  be  reduced  to  approximately  0.6  millihartree 
at  the  mp2  level,  it  definitively  persists  and  favors  the  bond-alternating  (semicon¬ 
ducting)  phase  as  the  energetically  stable  one. 


Tabu.  IV  Hartree-Fock  (Tin),  second-order  correlation  and  lolal  energies  (T’mp:).  respectiveK. 
obtained  (using  different  atomic  basis  sets)  per  SiH  unit  of  pobhvdrosilcnc.  (SiH),.  in  alicrnultng  irons 
(alt )  and  viiunlistoiu  irons  (cqu.)  conformations,  respecticeh.  The  absolute  energies  (/.)  ar-  given  in 
hartree.  the  relative  energies  (A/;)  in  millihartree. 


STO.tG 

DZ 

I5Z  (d,p) 

DZ  (2dl.p) 

Till'  («/.■•) 

-286.154454 

-289.446521 

-289  476705 

-289.498665 

T':"  (oil ) 

-0.049050 

-0.114584 

-0.156216 

-0.172212 

f-MP:*'  Ulll ) 

-286.20.t000 

-289.560313 

-289.476016 

-289.670768 

Till'  («/» ) 

-286.152922 

-289.445485 

-289.475306 

-289.497253 

(<’</» ) 

-0.050179 

-0.115073 

-0.157010 

-0.172788 

) 

-286.201808 

-289  559785 

-289.631699 

-  289.670200 

A/;„i*  (oh  -lyii ) 

-2,180 

-I  016 

-1.399 

-1  412 

A /imp;*’  (alt  -equ  ) 

-1.192 

-0.528 

-0.533 

-0.568 

*  Unergv  at  the  optimized  Ml  geomelrv. 

*’  Fnergv  al  the  optimized  mi>2  geomelrv. 
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Tablc  V.  HF  and  quasiparticle  (Qp)  energv  levels  at  the  limits  of  the  valence  and  conduction  bands, 
respectively,  of  tram-polyhydrosilane,  MS1H2),,  computed  with  double-zeta  (Dz)  and  polarized  atomic 

basis  sets,  respectively  (in  eV) 


DZ 

DZ  (d.p) 

DZ  (2dl,p) 

HF 

QP 

HF 

QP 

HF 

QP 

mm 

-13.652 

-12.588 

-13.816 

-12.246 

-13.829 

max 

-8.473 

-7.612 

-8.588 

-7.386 

-8.677 

-7.138 

min 

1.276 

0.942 

1.224 

-1.471 

0919 

-2.012 

max 

9  501 

8  902 

9.408 

6.912 

9.308 

5.942 

9.749 

8.554 

9.812 

5.915 

9.516 

5.126 

Energy  Bands  in  Hartree-Fock  and  Quasiparticle  Theory 

Tables  V  and  VI  contain  the  limits  of  the  valence  and  conduction  bands,  and 
the  width  of  the  forbidden  gap  computed  for  the  iram  and  gauche  conformations 
of  (SiH2);t  using  selected  basis  sets  at  the  HE  and  electron  polaron  levels,  respectively. 
The  HE  bands  seem  to  be  quite  stabilized  at  the  dz  basis  set  level  including  the 
band  energy  shifts  due  to  the  backbone  torsion.  The  valence  band  of  the  gauche 
modification  will  be  stabilized  by  about  0.8  eV  compared  with  the  trans  one.  and 
it  becorr>es  narrower  by  1  to  1 .2  eV.  A  similar  shift  to  lower  energies  occurs  for  the 
HE  conduction  band  in  going  from  irons  to  gauche  but  its  width  will  be  reduced 
by  2.5  eV.  The  formation  of  correlated  quasiparticle  states  according  to  Eqs.  (24, 
25)  shifts  the  valence  band  for  both  systems  upwards  by  an  amount  depending  on 
the  magnitude  of  the  correlation  energy  associated  with  the  basis  set  in  question 
(ranging  from  0.8-2,2  eV  for  different  basis  sets).  The  lower  edges  are  shifted 
somewhat  stronger  (by  0.^  -0.8  eV)  resulting  in  the  previously  mentioned  Franck- 
Condon  type  band  narrowing.  Similar  observations  can  be  made,  in  the  opposite 
energetic  direction,  for  the  conduction  bands.  As  a  net  effect,  the  semiconducting 
band  gap  will  be  substantially  reduced  due  to  correlation;  its  trans  he  value  of  9.5 
eV  diminishes  to  5.12  eV  and  xht  gauche  value  from  1 1.8  eV  to  5.56  eV. 


Table  V;  hf  and  quasiparticle  (qp)  energy  levels  al  the  limits  of  the  valence  and  conduction  bands, 
respectively,  ofgai«//c-polyhydrosilane.  g-lSiHj),.  computed  with  double-zeta  (DZ)  and  polarized  atomic- 

basis  sets,  respectively  (in  cV). 


DZ  DZ  (r/.p)  DZ  (Idf.p) 


HF 

QP 

HF 

QP 

HF 

QP 

mm 

-14.602 

-13.208 

-14.588 

-12.711 

-14.617 

-12.208 

max 

-9.216 

-9  002 

-10  590 

-8.422 

mm 

1.5.30 

0.948 

1.406 

-2.2(X) 

1.192 

-2  862 

max 

7.105 

6  287 

6.997 

3.067 

6.872 

2.433 

•i/'jap 

12.016 

10.164 

11.918 

6.802 

11.782 

5.560 
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Table  VII  hf  and  quasiparticle  (qp)  energy  levels  at  the  limits  of  the  valence  and  conduction  bands, 
respectively,  of  alternating  Pu/is-pohsilene.  MSiH),,  computed  with  double-zeta  (DZ)  and  polanzed 
atomic  basis  sets,  respectively  (in  eV). 


D7 

nz  td.p) 

DZ  {.2df,p) 

HF 

QP 

HF 

QP 

HF 

QP 

^1.  mm 

-9.428 

-8.108 

-9.506 

-8.426 

-9.544 

-8.166 

^"1.  max 

-5.924 

-5.088 

-5.987 

-4  909 

-6.062 

-4.720 

mm 

-1.802 

-2.617 

-1.969 

-3.188 

-1.980 

-3.392 

max 

2.716 

1.420 

2  646 

1.289 

2.579 

0.822 

4.122 

2.471 

4.018 

1.721 

4.082 

1.328 

Table  VII  summarizes  the  analogous  band  indices  for  the  energetically  more 
stable  bond-alternating  model  of  tram  polysilene.  Besides  similar  observations  as 
above  in  the  case  of  polysilane,  it  is  important  to  emphasize  here  that,  though 
correlation  again  substantially  reduces  the  band  gap.  with  Afgap  =  1 .3  eV  polysilene 
still  remains  a  semiconductor.  The  energy  band  structures  obtained  here  for  a  num¬ 
ber  of  different  silicon-based  polymers  form  the  starting  point  for  the  computation 
and  structure-dependent  analysis  of  their  optical  spectra  making  use  of  the  inter¬ 
mediate  exciton  model  [76,77].  The  corresponding  results  will  be  published  else¬ 
where  [78]. 
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Spin-Polarized  LCGTO-ff  Band  Structure  Technique 
Application  to  3d  Transition  Metal  Monolayers 
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Abstract 

The  linear  combinations  of  Gaussian-lype  orbitals-fitting  function  (  lcgto-ff  )  technique,  as  embodied 
in  the  program  package  FILMS,  has  been  extended  to  include  spin-polarization  effects.  The  spin-polarized 
LCCiTO-rF  method  is  descnbed  here.  Test  results  arc  pre.sented  for  square  monolayers  of  seven  3d- 
transition  metals  (Ti,  V,  Cr,  Mn,  Fe,  Co,  and  Ni)  at  the  lattice  constant  of  AgfOOl ).  The  LCGTO-ff 
results  compare  favorabi)  with  previous  results  obtained  with  the  flapw  technique  ©  1993  John  Wilcy 
&  Sons.  Inc 


Introduction 

During  the  past  decade,  the  linear  combinations  of  Gaussian  type  orbitals-fitting 
function  (lcgto-ff)  technique  [1]  has  been  used  to  calculate  the  properties  of  a 
number  of  ultra-thtn  films  (UTF);  see  Ref,  [2]  forareview.  Most  of  those  calculations 
have  been  restricted  to  UTFs  composed  of  relatively  small  atoms  (Z  s  20);  although 
there  have  been  some  exceptions  [  3  ] .  This  focus  on  small  systems  has  been  due, 
in  part,  to  two  basic  limitations  of  the  program  package  (FILMS)  used  for  those 
calculations.  ( 1 )  neglect  of  relativistic  effects,  and  ( 2 )  neglect  of  spin-polarization. 
Here,  the  program  FILMS  has  been  modified  to  incorporate  spin-polanzation 
effects  into  the  lcgto-ff  technique.  This  extension  of  the  method  is  straightforward 
but  has  potential  pitfalls  due  to  the  particular  formalism  used  by  FILMS  to  fit  the 
local  spin-density  approximation  (lsda)  exchange-correlation  (xc)  kernels 
and  r„ ).  For  this  reason,  the  new  spin-polarized  version  of  FILMS  has  been  tested 
carefully  by  calculating  the  spin  magnetic  moments  for  square  lattice  monolayers 
of  seven  3d  transition  metals  (Ti.  V,  Cr,  Mn,  Fe,  Co,  and  Ni),  all  at  the  theoretical 
lattice  constant  of  Ag(001 );  a  =  5.51  a.u.  [4],  This  particular  test  allows  direct 
comparison  with  previously  published  results  obtained  with  the  FLAPW  technique 
[5]  using  the  same  LSDA  model  as  is  used  in  FILMS  (that  proposed  in  Ref.  [4]), 
The  basic  lcgto-ff  method  and  its  extension  to  include  spin-polarization  effects 
arc  described  in  the  next  section.  Results  will  then  be  presented  for  the  cohesive 
energies,  spin  magnetic  moments,  and  work  functions  of  the  3d  transition  metals. 
Brief  concluding  remarks  are  given  in  the  final  section. 
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Methodology 

The  LCGTO-FF  Technique 

The  LCGTO-FF  technique  is  an  all-electron,  full-potential,  electronic  structure 
method  which  is  characterized  by  its  use  of  three  independent  Gaussian  basis  sets 
to  expand  the  one-electron  orbitals,  the  charge  density,  and  the  LSDA  xc  energy 
density  and  potential.  The  current  implementation  of  the  lcgto-ff  technique  in 
FILMS  is  derived  from  a  fitting  function  UTF  electronic  structure  code  due  to 
Mintmire,  Sabin,  and  Trickey  [6],  which  in  turn  was  related  to  a  fitting  function 
based  cluster  code  [  7  ] .  Although  FILMS  is  directly  related  to  these  other  GTO  fitting 
function  based  methods,  the  present  implementation  of  the  lcgto-ff  technique 
has  several  distinctive  features  (see  below). 

The  purpose  of  fitting  the  charge  density  with  GTOs  is  to  reduce  the  number  of 
Coulomb  integrals  required  in  the  calculations  by  ensuring  that  only  three-center 
integrals  appear  in  the  total  energy  expression  and  the  one-electron  equations,  instead 
of  the  usual  four-center  integrals.  Thus,  the  total  number  of  Coulomb  integrals 
required  for  a  system  will  scale  as  NIN^  (where  N„  =  the  number  of  orbital  basis 
functions  and  A,  =  the  number  of  charge  fit  functions),  instead  of  the  usual  Nl. 
Since  the  charge  density  is  a  nodeless  function,  unlike  the  orbitals,  is  generally 
less  than  N<,;  thereby  providing  an  additional  reduction  in  the  total  number  of 
Coulomb  integrals.  In  all  of  the  extant  GTO  fitting  function  codes,  the  expansion 
coefficients  of  the  charge  density  fitting  functions  are  determined  by  variationally 
minimizing  the  error  in  the  Coulomb  energy  due  to  the  fit  [6,7].  Once  all  the 
necessary  Coulomb  integrals  have  been  evaluated  analytically,  the  variational  charge 
fit  can  be  accomplished  by  solving  a  simple  linear  matrix  equation  [6,7]. 

The  calculation  of  the  three-center  Coulomb  integrals  involves  lengthy  lattice 
summations  over  a  pair  of  orbitals  and  a  charge  fitting  function  plus  a  neutralizing 
array  of  nuclei  [6] .  To  improve  convergence,  the  integral  sums  for  the  very  diffuse 
orbitals  are  calculated  with  mixed  real  and  reciprocal  space  techniques  [  6  ] .  The 
remaining  integral  sums  are  accelerated  via  multipole  expansion  techniques  [6]. 
In  FILMS,  the  pure  real-space  integral  summations  for  the  5-type  fitting  functions 
plus  their  neutralizing  point  charge  arrays  are  further  accelerated  by  splitting  the 
sums  into  a  short-range  part  consisting  of  the  fitting  function  and  a  single  neutralizing 
positive  point  charge  and  a  long-range  part  formed  from  the  nuclear  array  and  a 
single  negative  point  charge  [  8  ] .  This  procedure  is  both  faster  and  more  stable  than 
the  original  method  used  in  Ref.  [6]. 

The  xc  fitting  function  procedure  f’an  be  understood  best  as  a  rather  sophisticated 
numerical  quadrature  scheme  which  allows  a  highly  accurate  evaluation  of  the  xc 
integrals  on  a  relatively  sparse  integration  mesh  by  fitting  of  the  integral  kernels 
with  GTOS.  The  expansion  coefficients  of  the  xc  fitting  functions  are  obtained  from 
a  weighted  least  squares  fit  to  the  energy  density  or  potential  on  the  numerical 
integration  mesh.  This  least  squares  fit  can  be  expressed  as  a  linear  matrix  equation 
whose  elements  are  obtained  via  numerical  integration.  The  fitted  energy  density 
and  potential  then  can  be  used  to  evaluate  the  xc  matrix  elements  of  the  secular 
equation  and  total  energy  analytically  to  a  much  higher  level  of  precision  (and  in 
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less  time)  than  would  be  possible  with  direct  numerical  integration.  The  accuracy 
of  the  xc  matrix  elements  is  also  improved  by  constraining  the  fit  procedure  to 
ensure  that  the  numerical  integral  of  the  charge  denr'  iy  times  the  difference  between 
the  fitted  and  exact  energy  densities  (or  potentials)  \anishes  [9], 

In  FILMS,  the  numerical  xc  integrals  arc  carried  out  with  an  integration  technique 
specifically  designed  to  exploit  the  utf  geometry  [8].  The  UTF  unit  cell  (which 
has  infinite  extent  in  the  z-direction)  is  divided  into  three  distinct  regions:  ( 1 )  a 
collection  of  touching  spheres  centered  on  the  atomic  sites  in  the  cell,  (2)  a  vacuum 
region  which  lies  outside  of  two  horizontal  planes  which  just  touch  the  outermost 
atomic  spheres,  and  ( 3 )  an  interstitial  region.  In  the  spheres,  the  numerical  integrals 
are  factored  into  a  radial  integral  on  an  exponential  mesh  and  an  angular  integral 
using  Gaussian  quadrature  including  all  spherical  harmonics  through  1  =  12.  In 
the  vacuum  region,  the  integral  is  divided  into  a  z-integral  on  an  exponential  mesh 
and  an  x,>'  integral  on  a  uniform  mesh  exhibiting  the  symmetry  of  the  2d  lattice. 
In  the  interstitial  region,  the  integral  is  calculated  with  a  simple  histogram  technique 
on  a  very  dense  uniform  mesh  whose  x,y  coordinates  exhibit  the  symmetry  of  the 
2d  lattice.  This  numerical  integration  scheme  is  substantially  more  accurate  (and 
requires  less  time)  than  the  numerical  integration  scheme  used  in  Ref.  [6]. 

The  most  important  difference  between  the  present  implementation  of  the 
LCGTO-FF  technique  and  other  GTO  fitting  function  methods  is  in  the  xc  fitting. 
In  FILMS,  the  XC  fitting  procedure  is  referenced  to  the  fitted  charge  density 
[p(  r)]  in  place  of  the  exact  density  [p(  r)]  [1];  i.e.,  once  p(  r)  has  been  found,  it  is 
used  in  place  of  p(  7 )  in  all  xc  expressions.  This  approximation  produces  a  significant 
speedup  in  the  calculations.  Forming  p(7)  at  a  given  point  on  the  numerical  in¬ 
tegration  mesh  only  requires  summing  over  Nc  fitting  functions,  whereas  forming 
p(7)  requires  summing  over  Nl  products  of  orbital  basis  functions  for  inequi¬ 
valent  Bloch  vectors  (X)  in  the  Brillouin  zone.  Thus  using  p(  7)  in  place  of  p(7) 
reduces  the  time  required  for  the  xc  fit  by  a  factor  of  (.NINi,/Nc)',  typically  more 
than  two  orders  of  magnitude.  As  an  added  benefit,  the  use  of  p(7)  also  produces 
a  more  stable  xc  energy  calculation  by  replacing  the  usual  rather  slowly  converging 
numerical  Brillouin  zone  integrations  with  analytical  real  space  integrals. 

Unfortunately,  the  use  of  p(7)  in  the  xc  fit  introduces  a  source  of  potential 
difficulties  in  the  lcgto-ff  calculations.  The  variational  charge  fitting  procedure 
ensures  that  the  error  in  the  Coulomb  energy  will  be  minimized.  However,  there 
is  no  proven  way  to  constrain  the  fitted  density  to  be  positive  everywhere.  Thus, 
in  some  cases,  the  charge  fit  procedure  will  yield  a  fitted  density  which  is  negative 
in  the  far  vacuum  region  (in  essence  sacrificing  the  far  region  to  get  a  better  de¬ 
scription  closer  to  the  nucleus).  Although  this  negative  density  has  negligible  impact 
on  the  Coulomb  energy,  it  can  alter  the  work  function  #  of  a  film  significantly  by 
spuriously  reducing  the  dipole  barrier.  In  addition,  there  is  no  meaningful  way  to 
calculate  the  xc  kernels  for  a  negative  p(7).  For  these  reasons,  it  has  been  found 
that  the  best  solution  to  this  problem  is  to  be  careful  to  select  a  charge  fit  basis 
which  does  not  produce  any  negative  density.  If  a  charge  basis  does  produce  a 
negative  density,  it  can  usually  be  removed  by  using  a  slightly  larger  exponent  for 
the  most  diffuse  5-type  Gaussian.  It  has  been  found  generally  that  the  work  function 


1 50 

and  cohesive  energy  /'or  a  given  system  arc  relatively  insensitive  to  the  choice  of 
tile  charge  (it  basis  provided  there  is  no  negative  density. 

I'he  generalization  of  the  i.ca  ro-rr  technique  just  described  to  include  spin- 
polarization  cll'ects  is  straightforward.  Instead  of  fitting  only  the  total  charge  density, 
the  spin-up  and  spin-down  densities  mu.st  be  fitted  independently.  Then  spin-up 
and  spin-down  parts  of  the  xc  energy  density  and  potential  can  be  formed  and 
fitted  on  the  numerical  integration  mesh.  The  only  significant  concern  in  this  pro¬ 
cedure  lies  in  the  fact  that  the  minority  spin  charge  density  will  generally  be  more 
localizeil  than  the  majority  spin  or  total  charge  density.  Thus  the  fitted  minority 
spin  charge  density  is  more  apt  to  have  regions  of  negative  density,  Although  such 
a  result  would  not  alfect  the  Coulomb  energy  or  the  work  function,  provided  the 
total  charge  density  is  positive  everywhere,  it  could  adversely  alfect  the  xc  energy. 
This  point  will  be  discussed  further  below. 


/y^/v/.v  .V/W 

It  has  been  found  that  for  atoms  in  the  outer  layer  of  a  iri  i .  the  .v-type  and  /1,- 
type  orbital  biisis  sets  should  be  atomic-like  (dilfuse).  while  the  />,-typc  and  />y-typc 
basis  sets  should  be  cry.stallinc-like  (well  localized)  (10),  Similar  guidelines  for  r/- 
states  can  be  inferred.  For  the  .seven  .Id  transition  metals  considered  here  (Ti,  V. 
(V.  Mn.  l  e.  Co.  and  Ni),  the  orbital  ba.sis  sets  u.sed  are  derived  from  Huzinaga’s 
1 1 1 1  ( 14.d>/;5fy)  atomic  basis  sets. 

For  each  system,  the  .v-type  orbital  basis  is  formed  by  contracting  the  live  most 
local  (rros  in  the  atomic  basis  using  the  expansion  coefficients  of  the  Is  atomic 
orbital  as  contraction  coefficient.s.  For  the  p-type  orbital  ba.ses.  the  four  most  local 
atomic  (t  ios  have  been  contracted  using  the  2p  orbital  cocificicnts,  Bccau.se  the 
highest  occupied  /i-statc  in  the  atom  (3p)  is  quite  local,  the  />-lypc  bases  for  the 
monolayers  have  been  augmented  with  a  evro  whose  exponent  is  0. 1 4  for  all  atoms, 
except  Ni  for  which  an  exponent  of  0, 1 5  was  preferred.  The  />.  manifold  was  further 
augmented  with  a  err  c)  of  exponent  0,06.  For  the  </-typc  orbital  basi.s,  the  four  most 
local  atomic  c;  ros  were  contracted  using  3d  orbital  cocnjcicnt.s.  All  of  the  </-type 
ba.ses  used  were  augmented  with  one  additional  CTO  whose  exponent  depended  on 
the  atom  under  consideration  (0,16 — Ti.  V,  Cr;  0.18 — Mn;  0,20 — Fc,  Co,  Ni), 
To  avoid  linear  dependencies,  the  exponent  of  the  most  diffuse  r/-typc  atomic 
c;  K)  for  Ti  and  V  was  replaced  with  0,32;  twice  the  value  of  the  exponent  used  to 
augment  the  basis,  ("or  all  of  the  monolayers,  the  final  orbital  basi.s  .set  u.sed  was  a 
( I4vl0/;6r/ 1- 1/^,)  primitive  basis  contracted  to  a  ( IO.v7/;3</+ 1  /;.-)  basi.s. 

In  keeping  with  previous  experience,  the  fitting  function  ba.sis  sets  for  the  charge 
fit  and  X(  fit  have  been  chosen  to  be  identical  (the  fit  basis)  (12).  'fhe  symmetries 
of  the  monolayer  dictate  that  only  two  types  of  CiTOs  (with  I  rc  2)  can  appear  in 
the  lit  basis;  ,v  and  i /2f  . ,  J,.  A  .single  1 1  c 70  .v-type  ba.si.s  has  been  u.sed  for  all 
of  the  monolayers  considered  here.  Three  distinct  3  c  ro  r/-typc  ba.sis  .sets  were  u.sed 
depending  on  the  atom  under  consideration.  All  of  the  fit  function  exponents  used 
here  arc  given  in  Table  I.  Note  that  for  the  fighter  atom.s,  the  r7-typc  exponents 
used  arc  larger.  Localization  of  the  d  basis  .set  was  ncccs.sary  for  those  sy.stcms  to 
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Table  I.  Fitting  function  exponents  for  the  seven  transition 
metal  monolayers.  One  s-type  basis  was  used  for  all  seven  systems. 
Three  distinct  d-type  bases  were  used  depending  on  the  specific 
atom. 


r-type 

4-type 

All  monolayers 

Ti.  V,  Cr,  Mn 

Fe 

Co,  Ni 

3000000 

3  80 

3.00 

3.00 

4500.00 

1.80 

1.20 

0.90 

750.00 

0.80 

0.50 

0.30 

240.00 

80.00 

27.00 

9.00 

3.00 

0.90 

0.27 

0.08 


avoid  negative  values  of  the  minority  spin  density,  as  discussed  above.  Test  cal¬ 
culations  on  Mn  using  the  most  diffuse  and  the  most  local  of  the  three  tf-type  fit 
bases  produced  roughly  a  10  mRy  shift  in  the  total  energy  (an  acceptable  variation 
for  these  systems)  and  roughly  a  20  mRy  shift  in  The  shift  in  (!>  is  larger  than 
would  generally  be  considered  acceptable.  However,  this  shift  is  not  indicative  of 
the  imprecision  of  the  final  value  used  for  $  because  the  more  diffuse  basis  produces 
negative  electron  density  in  the  far  vacuum  region  which,  as  was  noted  above,  has 
disastrous  consequences  for  4>. 

Additional  Computational  Details 

The  real-space  numerical  integration  mesh  used  in  these  calculations  was  of  the 
general  form  described  earlier.  The  atomic  sphere  region  included  9 1  exponentially 
spaced  radial  points  and  28  nonequivalent  angular  points.  In  each  vacuum  region, 
1 5  exponentially  spaced  z-points  and  36  nonequivalent  symmetrically  distributed 
points  were  used.  A  total  of  3673  uniformly  distributed  points  were  used  for 
the  interstitial  integration.  Thus,  the  real-space  numerical  integration  mesh  included 
a  total  of  7274  points  in  the  UTF  unit  cell. 

In  all  of  these  calculations,  the  irreducible  triangle  of  the  Brillouin  zone  ( BZ )  was 
sampled  at  36  uniformly  distributed  Bloch  vectors.  The  necessary  BZ  integrations 
were  carried  out  using  the  linear  triangle  technique.  The  SCF  cycle  was  iterated 
until  the  total  energy  was  converged  to  less  than  0.0 1  mRy.  In  all  cases,  the  spin 
magnetic  moment  p.  was  converged  to  0.001  mb- 

Results 

The  most  fundamental  quantity  yielded  by  any  LSDA  calculation  is  the  total 
energy.  For  this  reason,  it  is  important  to  obtain  the  cohesive  energy  (Er.i)  of  the 
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Table  II.  Atomic  energies  (Ea),  monolayer  cohesive  energies 
(Et,\),  and  bulk  cohesive  energies  (Ecj^  from  Ref.  (4];  ail  in  eV. 


System 

Ea 

Ec, 

Ec.b 

Ti 

23055.438 

4.04 

6.12 

V 

25625.982 

3.27 

6.12 

Cr 

28358.633 

2.27 

4.82 

Mn 

31254.251 

2.66 

4.95 

Fe 

34317.464 

3.35 

6.26 

Co 

37553.985 

3.30 

6.48 

Ni 

40968  036 

in 

5.70 

monolayers  even  though  they  have  not  been  geometry  optimized.  Table  II  lists 
total  LSDA  energies  for  isolated  atoms  of  Ti,  V,  Cr,  Mn,  Fe,  Co,  and  Ni  calculated 
using  the  same  orbital  basis  sets  as  were  used  for  the  monolayers;  except  that  the 

and  /?,  basis  sets  are  augmented  to  make  them  identical  to  the  p,  basis.  The 
atomic  configurations  were  assumed  to  be  identical  to  those  given  in  Ref.  [4] .  The 
largest  sources  of  error  in  Ec.\  are  two  inconsistencies  between  the  atomic  and 
monolayer  calculations:  ( 1 )  the  atom  calculation  did  not  use  fitting  functions  and 
(2)  the  atom  calculation  did  not  include  off-site  orbital  basis  functions.  These  in¬ 
consistencies  may  introduce  an  error  in  on  the  order  of  a  few  tenths  of  an  eV. 
In  Table  11,  the  present  values  of  £t,i  are  compared  with  bulk  cohesive  energies 
(£./,)  reported  by  Ref.  [4] .  The  ratios  Ec,\IE^^h  for  these  systems  all  lie  in  the  range 
0.47  to  0.66;  smaller  than  has  been  found  in  other  monolayer  systems  [3].  Given 
the  fact  that  the  monolayers  are  calculated  at  expanded  nearest  neighbor  distances, 
these  values  for  £■<.,!  are  quite  sensible. 

In  Table  III,  the  total  spin  magnetic  moments  obtained  here  for  the  3d  transition 
metal  monolayers  are  compared  with  those  reported  by  Ref.  [  5  ]  at  the  same  lattice 


Table  III.  The  total  spin  magnetic  moments  and  muffin- 
tin-sphere  spin  magnetic  moments  found  here  for  the 
transition  metal  monolayers  using  the  lcgto-ff  method  are 
compared  to  the  values  of  ftmo  found  in  Ref  [5]  with  the  FLAPW 
method  and  the  bulk  values  of  n,  from  Ref.  [4);  all  in 


System 

LCGTO-FF 

FLAPW 

Bulk 

fil 

Mmit 

Ti 

2.05 

1.62 

1.72 

V 

3.10 

2.70 

2.87 

— 

Cr 

5.04 

4.52 

4.50 

— 

Mn 

4.57 

4.33 

4.32 

— 

Fe 

3.32 

3.27 

3  29 

2.15 

Co 

2.13 

2.15 

2  20 

1.56 

Ni 

0.96 

0.98 

1.02 

0.59 
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Table  IV.  The  theoretical  work  functions  (4>i)  for  the  transition 
metal  monolayers  are  compared  with  experimental  surface  work 
functions  (4>,)  given  in  Ref.  i  1 3).  Theoretical  values  of  4>,  obtained 
in  Ref.  [14]  with  the  surface  Green's  function  method  are  also 
given.  (All  values  are  in  eV). 


System 

4>, 

'!>,  exp. 

41,  theo. 

Ti 

5.01 

4.33 

V 

5.30 

4.3 

— 

Cr 

5.08 

4.5 

Mn 

4.11 

4.1 

— 

Fe 

4.73 

4.67 

4.50 

Co 

5.24 

5.0 

5.52 

Ni 

5.41 

5.22 

5.75 

constant  using  the  same  LSDA.  Since  the  values  given  in  Ref.  [5]  are  calculated 
inside  of  muffin-tin  spheres,  equivalent  muffin-tin  values  are  given  for  the  present 
calculations.  In  all  cases  there  is  good  agreement  between  the  FLAPW  and  lcgto- 
FF  muffin-tin  sphere  values,  demonstrating  the  high  quality  of  the  present  results. 
It  is  also  worth  noting  that  the  muffin-tin  sphere  values  of  ix  seriously  underestimate 
the  total  values  for  Ti,  V,  Cr,  and  Mn.  This  result  is  due  to  the  rather  diffuse 
character  of  the  majority  spin  d-states  for  the  lighter  transition  metals.  Table  III, 
also  includes  theoretical  bulk  values  of  fi  reported  by  Ref  [4]  for  Fe,  Co,  and  Ni; 
using  the  same  lsda  as  was  used  here.  Comparison  of  the  monolayer  and  bulk 
values  reveals  the  now  standard  enhancement  of  the  spin  magnetic  moment  of  the 
monolayer  relative  to  the  bulk. 

One  additional  calculated  quantity  which  bears  a  careful  examination  is  the  work 
function  ('I>).  In  Table  IV,  values  of  #  are  reported  for  all  of  the  systems  studied 
here,  together  with  experimental  surface  values  [13].  Table  IV  also  includes  theo¬ 
retical  values  of  $  for  Fe,  Co,  and  Ni  surfaces  (with  square  symmetries)  obtained 
with  a  surface  Green’s  function  technique  [14],  Given  the  differences  in  the  types 
of  systems  considered  (expanded  monolayer  versus  surface),  the  present  results  are 
in  reasonable  agreement  with  the  experimental  values,  except  for  the  lighter  Z 
systems.  The  larger  difference  for  Ti,  V,  and  Cr  may  be  due  to  the  fact  that  the 
monolayer  is  spin-polarized  while  the  surface  is  not.  Overall,  there  is  no  evidence 
of  difficulties  with  the  new  spin-polarized  version  of  FILMS. 


Conclusions 

The  calculations  presented  here  demonstrate  that  the  new  spin-polarized  version 
of  FILMS  is  capable  of  producing  reliable  results  for  transition  metals.  This  method 
could  be  further  improved  by  allowing  different  fitting  function  basis  sets  to  be  used 
for  the  majority  spin  and  minority  spin  charge  and  xc  functions.  A  further  useful 
development  would  be  inclusion  of  relativistic  effects. 
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The  present  calculations  have  also  demonstrated  the  need  to  include  the  interstitial 
contributions  to  the  spin  magnetic  moment.  For  hirl  y  polarized  materials,  using 
only  the  muffin-tin  sphere  portion  of  the  magnetic  momerit  may  result  in  as  much 
as  a  20%  underestimate  of  fi. 
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The  clcvclopmcnl  of  new  inorganic  materials  at  Corning.  Inc,  has  a  long  and 
distinguished  history.  A  short  and  necessarily  incomplete  list  of  important  materials 
discoveries  at  Corning  includes  low  expansion  borosilicaic  glasses  such  as  Pyrex,'^ 
ultra-low  expansion  titanium-silicate  glasses  made  by  flame  hydrolysis,  silicones, 
fiberglass®,  the  complex  field  of  glass-ceramics  which  ranges  from  ultra-high  strength 
materials  to  machinable  mica-based  forms,  photochromic  glasses,  x-ray  absorbing 
television  glasses,  extrudable  oriented  ceramics,  and  the  ultra-pure  materials  for 
optical  fibers.  While  extremely  successful,  the  development  of  new  materials  at 
Corning  has  been  a  diflicult  process.  From  the  start  of  a  research  project  until  the 
manufacture  of  a  commercial  product  can  ike  from  5  to  10  years,  and  the  vast 
majority  of  research  projects  fail.  One  important  factor  at  the  heart  of  the  long 
development  times  and  the  high  failure  rate  is  a  lack  of  fundamental  understanding 
concerning  reactions  and  structure  in  amorphous  materials.  New  materials  arc 
developed  by  a  mixture  of  intuition  and  empiricism. 

From  the  time  of  Zachariason  {!],  we  have  had  reasonable  ideas  about  glass 
structure.  In  summary,  first  neighbor  environments  look  almost  exactly  the  same 
as  they  do  in  crystals,  but  these  building  blocks  of  first  neighbor  configurations  arc 
put  together  in  a  continuous  random  network.  Electro-neutrality  is  preserved  in 
the  smallest  possible  volume  consistent  with  ion  sizes.  Unfortunately,  the  bonding 
diflcrcnccs  between  a  glass  such  as  fu.scd  silica  and  the  crystal  structure  of  quartz 
are  reasonably  subtle.  We  know  the  structure  of  quartz  througli  the  analysis  of  x- 
ray  diff  raction  experiments.  The  repeating  nature  of  the  quartz  crystal  gives  many 
diffraction  peaks  which  allow  the  structure  to  be  unambiguously  extracted.  Fused 
silica  has  no  repeating  units  other  than  the  Si04  tetrahedra  and  the  strong  diffraction 
peaks  arc  related  to  these.  The  bonding  differences  between  crystalline  and  amor¬ 
phous  silica  occur  primarily  at  third  neighbors  and  beyond,  and  it  is  precisely  this 
information  which  is  washed  out  by  diffraction  from  the  amorphous  structure. 
Most  other  probes  again  give  strong  information  about  first  neighbors,  weak  results 
about  second,  and  arc  washed  out  by  third  neighbors.  Thus  we  arc  not  certain 
about  the  structure  of  the  most  prototypical  of  all  glasses,  fused  silica,  which  serves 
as  a  backbone  for  most  of  the  silicate  glasses. 

Doris  Evans,  an  x-ray  crystallographcr  at  Corning,  built  a  model  of  fused  silica 
in  the  mid  1960s  [2]  to  see  if  a  model  could  be  built  with  the  constraint  that  the 
only  construction  variable  was  the  bonding  angle  between  Si04  tetrahedra.  She 
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built  a  model  of  over  1200  tetrahedra  and  was  able  to  convince  herself  that  the 
procedure  could  be  continued  indefinitely.  A  careful  analysis  of  the  model  showed 
that  it  could  nearly  reproduce  most  of  the  known  properties  of  fused  silica.  The 
most  important  discrepancy  was  m  the  density.  A  careful  study  of  variations  in  the 
density  showed  that  it  could  be  varied  over  10%  without  rearranging  the  bonding 
or  violating  the  constraint  of  perfect  tetrahedra  [  3  ] .  There  had  to  be  something 
other  than  a  geometric  constraint  which  determined  the  final  values  of  the  physical 
properties.  Clearly  structure^  arrange  themselves  to  minimize  the  free  energy  at 
elevated  temperatures.  At  low  temperatures,  the  product  of  temperature  and  entropy 
becomes  negligible  and  the  free  energy  converges  to  the  potential  energy.  Fused 
silica  is  a  child  of  the  elevated  temperatures  at  which  its  bonding  arrangements  are 
formed  and  the  relaxation  in  geometry  but  not  bonding  ai  lower  temperatures. 
Consequently  it  was  felt  that  tne  only  way  to  attack  the  problem  of  molecular 
structure  of  glasses  was  through  molecular  dynamical  modeling  at  high  temperatures 
which  could  properly  account  for  the  dynamic  effects  of  entropy  coupled  with  a 
slow  decrease  in  temperature  to  relax  the  geometry. 

Born-Meyer-Huggins  pair  potentials  were  created  for  silicon  ions  and  oxygen 
ions,  and  1  performed  molecular  modeling  for  fused  silica  for  over  1 .000  ions.  This 
led  to  a  model  which  resembled  the  model  of  Evans  but  was  different  in  many  of 
the  details.  She  then  asked  me  if  my  potentials  could  reproduce  the  structure  of 
the  crystalline  polymorphs  of  silica,  i.e.,  quartz,  cristobalite.  cohesite.  tridymite, 
and  stishovite.  Since  1  was  claiming  to  be  able  to  model  amorphous  Si02, 1  should 
be  able  to  reproduce  the  periodic  structures  that  silica  found  to  be  the  most  natural 
ways  to  arrange  itself  1  found  to  my  dismay  that  I  could  not  reproduce  such  struc¬ 
tures.  This  led  to  Evans  Law  of  Glass  Structure:  “Accept  no  model  of  glass  structure 
which  cannot  leprod'  ce  as  stable  structures  the  underlying  crystalline  polymorphs 
of  similar  compositio  i.” 

In  the  early  1980s,  the  need  for  a  first  principles  treatment  of  glass  chemistry 
was  becoming  more  urgent,  and  I  undertook  a  new  survey  of  the  field  to  find 
changes  and  opportunities.  At  mit,  talks  with  Professor  John  Joannopoulos  con¬ 
vinced  me  that  the  state  of  the  art  of  density-functional  quantum  mechanics  de¬ 
veloped  by  Kohn  and  Sham  [4]  had  reached  the  point  where  solids  could  be  re¬ 
alistically  modeled  without  parameterization. 

The  typical  condensed  matter  calculation  modeled  a  semiconductor  with  a  few 
atoms  in  a  periodic  cell.  Planewaves  and  pseudopotentials  were  used,  and  to  model 
silicon  required  approximately  50  to  100  planewaves  per  atom.  The  diagonalization 
of  matrices  a  few  hundred  square  was  not  a  demanding  task,  but  since  the  com¬ 
putational  effort  to  carry  out  diagonalization  grew  as  the  number  of  planewaves 
cubed,  much  bigger  problems  were  computationally  intractible.  Because  of  the 
deep  nonlocal  pseudopotentials  required  to  accurately  model  oxygen,  oxygen  re¬ 
quires  ten  times  as  many  planewaves  per  atom  as  silicon.  To  simulate  quartz  would 
require  the  diagonalization  of  10.000  by  10.000  matrices. 

Douglas  Allan  and  1  at  Corning  had  the  opportunity  to  develop  a  new  ceJe  from 
scratch.  The  first  analysis  we  performed  was  with  respect  to  basis  sets.  Table  1  shows 
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Table  I.  Piopertics  of  vanous  bases. 


Planewaves 

Orbitals 

Muffin 

tins 

Finite 

elements 

Orthonormal  cuir.pictc 

Yes 

No 

No 

No 

Fast  transform 

Yes 

No 

No 

Yes 

Hi  resolution  with 
small  number 

No 

Yes 

Yes 

Yes 

Easy  solution  to 

Poisson’s  cqn 

Yes 

No 

No 

No 

Exact  density 

Yes 

No 

No 

Yes 

No  Pulay  eorrections 

Yes 

No 

No 

No 

All-electron  for  all  Z 

No 

Yes 

Yes 

Yes 

Easy  periodic 
implementation 

Yes 

No 

Yes 

No 

Fast  Iterative  solution 
methods 

Yes 

No 

No 

No 

Natural  X  —  C 
integration  grid 

Yes 

No 

No 

Yes 

Easy  to  code 

Yes 

No 

No 

No 

the  various  basis  sets  in  common  use  at  the  time  and  their  relative  advantages  and 
disadvantages.  The  ideal  basis  set  would  clearly  have  no  defects. 

We  generated  simple  plane-wave,  gaussian,  and  finite-element  codes  and  began 
comparing  performances  on  model  problems.  In  1985,  Car  and  Parrinello  [5]  wrote 
an  article  on  quantum-mechanical  molecular  dynamics  which  emphasized  the  it¬ 
erative  approach  tc  matrix  diagonalization  and  showed  how  to  derive  an  eigenvector 
by  a  method  which  scaled  nearly  linearly  with  the  number  of  planewaves.  The  new 
method  required  local  pseudopotentials  which  would  not  work  for  oxygen  and  was 
still  too  slow,  but  we  began  a  systematic  development  of  a  planewave  program 
because  the  possibility  of  major  performance  improvement  was  significant.  The 
nonlocal  pseudopotentials  of  Hamann,  Schluter  and  Chiang  [  6  ]  were  generating 
physical  properties  whose  errors  were  at  the  few  percent  level  and  lookeo  extremely 
promising.  We  began  a  program  to  see  if  we  could  utilize  these  nonlocal  pseudo- 


Tablf  II.  <(-Quartz  structure  (tngonal  1 ) 


Parameter 

Experiment 

Theory 

A(%) 

a  (nm) 

0.491.14 

0  4880 

-0.68 

(•(nm) 

0  54052 

0.5.’’70 

-0  65 

Si(x) 

0.46987  (9) 

0.46.18 

-1.3 

0(.\) 

0.4141  (2) 

0.4081 

-1.4 

O  ( r) 

0.2681  (2) 

0.2758 

+2  9 

0(;) 

0.2855  (1) 

0  2782 

-2.6 
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Table  III.  «-Cnstobalite  structure  (tetragonal  F4,2,2). 


Parameter 

Experiment 

Theory 

A(%) 

a  (nm) 

049570(1) 

0.495 

0.04 

;•  (nm) 

0.68903  (2) 

0.6906 

0.23 

Si  {X  =  y) 

0.3047  (2) 

0.3030 

-0.56 

0(x) 

0  2381  (2) 

0.2380 

-0.04 

0(.v) 

0  1109  (2) 

0.1112 

0.27 

0(.-) 

0.1826(1) 

0.1825 

-0.05 

potentials  in  a  Car-Parrinello  approach.  We  were  aided  by  an  article  by  Kleinman 
and  Bylander  who  showed  how  to  generate  nonlocal  pseudopotentials  which  were 
separable  in  momentum  space  and  which  scaled  linearly  in  the  number  of  plane 
waves.  Unfortunately  their  article  came  out  before  the  advent  of  fast  iterative  solution 
methods,  so  the  initial  advantages  of  using  their  methods  were  minimal  since  so¬ 
lution  time  dominated  matrix  setup  time.  The  real  importance  of  their  approach 
was  developed  later. 

Using  the  new  separable  pseudopotentials,  Allen  and  Teter  [  7  ]  showed  how  oxides 
could  be  modeled  successfully,  and  Teter,  Payne,  and  Allen  [8]  showed  how  ei¬ 
genvectors  could  be  extracted  most  efficiently.  The  details  of  the  resulting  iterative 
diagonalization  along  with  a  review  of  the  planewave  pseudopotential  method  may 
be  found  in  the  reviev/  article  by  Payne  et  al.  [9].  For  the  first  time,  this  made  the 
pianewave  pseudopotential  treatment  of  oxides  viable.  We  then  began  the  systematic 
treatment  of  silicates  using  ab  initio  methods  and  have  fulfilled  Doris  Evans’  chal¬ 
lenge  that  all  crystalline  polymorphs  of  silica  be  faithfully  modeled  by  this  method. 
The  tables  below  show  the  calculated  versus  measured  structural  parameters  for 
several  silica  and  titania  crystals  [10].  These  are  reproduced  in  Tables  II  through  V. 

After  developing  a  general  response  functional  capability  within  the  planewave 
code,  the  optical  phonons  in  quartz  were  simulated  [11].  The  results  were  with  a 
few  percent  of  experiment,  but  more  importantly,  the  interaction  between  ions 
could  be  quantitatively  mea':ured.  A  study  of  the  Bom  Effective-Charge  Tensor  for 
oxygen  shows  that  no  model  which  treats  oxygen  ions  as  charged  balls  can  effectively 
model  the  dynamics  in  oxides.  The  polarization  of  the  ions  is  at  least  as  important 
as  their  charge  in  determining  the  correct  interactions,  and  this  is  fundamentally  a 
quantum-mechanical  effect. 


Table  IV.  Stishovitc  structure  (tetragonal  /Mj/mnm). 


Parameter 

Experiment 

Theory 

A(%) 

a  (nm) 

0.41773(1) 

0.4255 

1.9 

f  (nm) 

0  26655(1) 

0.2604 

-2.3 

0  '.X  =  .1) 

0.30614(3) 

0..3082 

0  6'’ 
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Table  V.  Rutile  (TiOiI  structure  (tctiagonal  PA^Imnm). 


Parameter 

Expenmeni 

Theory 

A(%) 

a  (nm) 

0.4594 

0.4584 

-0.22 

t(nm) 

0  2958 

0.2961 

0.10 

0  (A-  =  y) 

0.305 

0.3044 

-0.20 

Having  generated  an  ab  initio  code  for  studying  oxides  we  began  the  study  of 
defects  in  crystalline  structures  such  as  oxygen  vacancies  and  interstitials.  We  found 
interesting  dynamic  reactions  involving  these  defects  whose  barriers  were  c  f  the 
order  of  a  few  tenths  of  an  eV.  The  question  naturally  arose  as  to  whether  eitiier 
LDA  or  the  pseudopotentials  were  really  this  accurate.  Since  the  solutions  methods 
were  now  really  fast  for  hundreds  of  thousands  of  grid  points  we  began  a  systematic 
investigation  of  the  all-electron  behavior  of  carbon.  The  finding  was  that  under 
compression,  the  norm-conserving  pseudopotentials  had  deviations  in  the  total 
energy  versus  lattice  constant  curves  when  compared  with  the  all  electron  results 
of  the  order  of  1.5  ev.  These  results  are  shown  in  Table  VI.  This  was  clearly  too 
much  error  in  the  pseudopotential  approximation  to  trust  the  earlier  results.  As  a 
consequence,  a  great  deal  of  effort  was  expended  to  determine  the  major  source  of 
error  remaining  in  the  pseudopotential  approximation.  This  was  determined  to  be 


Table  VI.  Energies  for  diamond  under  iMstatic  compression  for  norm-conserving  pseudopotential, 
all-elearoii  and  extended  norm  and  hardness<onserving  pseudopotential.  A  constant  has  been  subtracted 
from  both  pseudopotential  calr-ulations  to  make  their  average  energies  the  same  as  the  all-electron  case. 
Also  included  are  the  standard  deviations  of  the  differences  from  the  all-electron  case,  the  equilibrium 
lattice  constant  a<,,  the  bulk  modulus  B,  and  the  pressure  derivative  of  the  bulk  modulus. 


Lattice  constant 
(bohr) 

Norm<ons. 

-63.112325 

(hartree) 

All-electron 

(hartree) 

ENHC 

-62.361247 

(hartree) 

5.0 

-73.559096 

-73.679091 

-73.677824 

5.8 

-74.206141 

-74.242149 

-74.240666 

60 

-74.281435 

-74.301995 

-74.300981 

62 

-74.334449 

-74.342446 

-74.340841 

6.4 

-74.360408 

-74.364967 

-74.364244 

6.6 

-74.389790 

-74.374818 

-74.374361 

6.8 

-74.398456 

-74.373822 

-74.373871 

7.0 

-74.397801 

-74.364842 

-74.365143 

7.2 

-74.389796 

-74.348265 

-74.350026 

8.0 

-74.313485 

-74.247460 

-74.251898 

a  (hartree) 

0.0492 

0.0018 

aO  (bohr) 

6.882 

6.678 

6.689 

B  (hartree/bohr’) 

0.01362 

0.01628 

0.01571 

dB/dP 

3.728 

3.599 

3.648 
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Table  VII.  Structure  of  a-quartz  for  norm-conserving,  hardness 
and  extended  nonn<onserving  pseudopotentials  and  experiment. 


Parameter 

Norm-cons. 

HENC 

Experiment 

a  (bohr) 

9  363 

9.285 

9.290 

c{bohr) 

10.304 

10.209 

10.215 

Si  A 

.4638 

.4688 

.4697 

O.v 

.4081 

.4140 

.4135 

0.r 

.2758 

.2699 

.2669 

Or 

.2782 

.2839 

.2858 

a  difference  between  the  responses  of  the  atom  and  pseudoatom  with  regard  to 
changes  in  occupancies  of  the  valence  states.  A  prescription  for  forcing  the  pseu¬ 
dopotential  to  mimic  the  correct  response  as  well  as  fulfill  the  earlier  constraints 
led  to  more  than  an  order  of  magnitude  reduction  in  the  pseudopotential  error, 
and  typical  differences  between  the  all-electron  results  and  the  pseudopotential 
results  were  .05  e\'  [12].  Because  these  pseudopotentials  give  response  correct  to 
second  order  in  arbitrary  occupancy  changes  in  the  valence  states,  they  are  called 
Hardness-Conserving  pseudopotentials.  The  structure  of  quartz  was  relaxed  using 
the  new  pseudopotentials  with  the  basis  set  taken  to  energy  convergence  well  below 
the  millihartree  level.  The  results  are  in  Table  VII.  You  will  notice  that  the  exper¬ 
imental  numbers  are  different  from  those  in  Table  II.  For  the  earlier  work,  a  room 
temperature  measurement  for  the  silica  structure  was  used.  Since  we  had  every 
reason  to  believe  that  we  had  significantly  reduced  the  error  in  the  structure,  we 
found  a  zero  degree  kelvin  measurement.  The  error  when  compared  with  experiment 
has  been  reduced  by  over  an  order  of  magnitude.  The  same  pseudopotentials  were 
then  used  to  regenerate  the  optical  phonon  frequencies  and  it  was  found  that  the 
errors  dropped  from  around  5%  to  below  1%.  For  the  first  time  we  were  able  to  see 
systematic  deviations  in  the  frequency  errors.  The  lower  the  optical  frequency  mode, 
the  more  the  harmonic  frequency  fell  below  the  experimental  result.  It  is  currently 
conjectured  that  this  difference  is  due  to  anharmonic  corrections  which  are  expected 
to  be  greater  for  lower  frequency  modes,  and  this  is  being  tested.  The  summary  of 
these  results  seems  to  be  that  for  structural  properties  in  solids,  the  LDa  is  better 
than  previously  believed,  and  most  of  the  error  seems  to  be  pseudopotential  error. 
I  also  expect  some  improvement  for  isolated  molecules,  versus  earlier  pseudopo¬ 
tential  calculations,  but  in  these  cases,  the  self-interaction  in  the  tails  of  the  wav- 
efunction  probably  dominates  the  error  more  than  in  a  solid  where  the  density 
never  really  goes  to  zero. 

Having  generated  a  reference  plane-wave  code  and  a  new  genei  ition  of  pseu¬ 
dopotentials,  we  are  now  ready  to  attack  realistic  glass-chemistry  problems.  Un¬ 
fortunately,  this  tool  cannot  be  used  directly.  Glass  is  unique  in  that  the  major 
unknown  in  glass  structure  is  the  bonding  topology.  The  differences  between  bonding 
topologies  in  fused  silica,  and  the  normal  silica  crystals  begin  at  about  third  neigh¬ 
bors.  Clearly  the  correct  bonding  arrangement  is  that  which  minimizes  the  free 
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energy  at  the  glass  formation  temperature.  If  one  wants  to  simulate  infinite  amor¬ 
phous  structures  by  using  very  large  periodic  cells,  then  the  minimum  cell  size 
which  allows  freedom  for  third  neighbors  to  bond  without  interference  with  periodic 
images  is  about  400  atoms.  More  realistically  1,000  to  10,000  atoms  should  be 
modeled.  The  upper  limit  on  the  plane-wave  code  is  about  100  to  200  atoms,  for 
a  few  dozen  dynamic  steps.  We  need  millions  of  dynamic  steps  to  achieve  a  rea¬ 
sonably  annealed  structure.  The  reference  code  must  then  be  used  to  generate  models 
which  can  go  much  faster.  We  have  a  few  rudimentary  ones  under  development 
with  a  few  simple  calculations,  but  it  is  clear  that  a  new  generation  of  molecular 
modeling  which  allows  dynamic  bond  forming  and  breaking  is  needed  to  realistically 
model  glass  chemistry. 

In  the  spirit  of  the  Sanibel  conference,  1  would  like  to  close  with  a  few  words 
concerning  my  view  of  the  future  of  density-functional-based  molecular  modeling; 

1 )  First,  with  regard  to  basis  sets,  I  believe  that  plane-waves  and  pseudopotentials 
will  probably  vanish  like  dinosaurs  in  the  coming  meteoric  collision  of  com¬ 
putational  methods  with  wavelet  representations.  Professor  Joannopoulos’s 
group  at  MIT  has  recently  shown  that  it  is  possible  to  represent  accurately  the 
1 S  state  of  uranium  as  well  as  its  adjustment  under  any  reasonable  perturbation 
with  a  few  hundred  three-dimensional  wavelets  [  1 3  ] .  It  takes  o\  er  1 ,000  plane- 
waves  to  accurately  represent  the  3D  state  of  titanium,  and  over  a  billion  to 
represent  the  IS  state  of  uranium  in  a  box  large  enough  to  simulate  a  significant 
molecule.  Wavelets  seem  to  represent  that  ideal  representation  with  no  defects, 
at  least  within  the  list  shown  in  Table  1.  At  the  moment  both  our  optimism 
and  ignorance  are  unbounded,  and  to  quote  the  immortal  Murphy,  “The 
best  thing  about  a  new  idea  is  that  we  don’t  know  what’s  wrong  with  it.’’ 

2 )  We  in  solid  state  physics  are  seeing  the  development  o*"  new  ab  initio  solution 
methods  which  scale  linearly  in  the  number  of  atoms.  These  are,  at  the  mo¬ 
ment,  ponderous  and  represent  no  threat  to  current  methods.  As  a  prognos¬ 
ticator,  I  feel  that  these  methods  will  dominate  ab  initio  calculations  in  large 
systems  within  5  years. 

3 )  New  density  functionals  hold  the  promise  of  reducing  the  lda  errors  by  an 
order  of  magnitude,  without  a  corresponding  increase  in  computational  effort. 
Some  of  these  are  based  on  gradient  corrections.  Careful  comparison  between 
gradient-corrected  lda  calculations  and  exact  results  from  diffusion  Monte 
Carlo  calculations  show  that  the  gradient  corrections  still  have  serious  diffi¬ 
culties.  I  personally  believe  that  the  most  promising  new  functionals  are  based 
on  an  improved  weighted  density  approach  utilizing  a  nonlocal  integral  for¬ 
mulation  of  the  exchange-correlation  hole. 

4)  Lastly,  1  believe  that  a  new  generation  of  accurate  molecular  modeling  is 
coming  which  will  include  electron  degrees  of  freedom  and  allow  not  only 
reactions,  but  be  fast  enough  to  handle  systems  of  10,000  atoms  for  millions 
of  time  steps.  These  models  will  be  parameterized  directly  from  the  density- 
functional  codes.  While  we  have  one  of  these  under  development  for  glass 
chemistry,  it  is  too  early  to  speak  of  results  other  than  qualitatively  it  seems 
to  work. 
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Abstract 

We  have  used  the  ab  initio  technique  of  UHF/LANLlMB  to  study  the  interactions  between  absorbed 
oxygen  atoms  and  molecules  with  their  host  nickel  lattice.  The  host  lattice  is  approximated  by  a  small 
cluster  of  nickel  atoms  that  are  arranged  as  face-centered-cubic,  except  for  a  tubular  void  descnbed  by 
the  absence  of  three  or  four  lattice  elements.  The  oxygen  absorbates  are  placed,  one  at  time,  within  this 
cavity.  We  conclude  that,  for  tubular  cavities,  the  interaction  between  the  nickel  lattice  and  the  oxygen 
absorbates  is  stronger  than  the  interactions  between  pairs  of  oxygen  atoms,  suggesting  that  the  absorbates 
are  mostly  atomic  (as  apposed  to  diatomic)  in  character,  while  the  (energetically)  most  preferred  number 
of  oxygen  atoms  to  reside  inside  an  n-site,  tubular  hollow  of  nickel  is  simply  (n  -  1 )  6  1993  John  Wiley 
&  Sons.  Inc 


Introduction 

The  stabilization  of  microcavity  voids  in  face-centered  cubic  (fee)  solids  is  believed 
to  result  in  an  increase  in  the  homogeneity  of  the  material  as  well  as  an  increase 
in  the  microstructure  stability  [  1 ,2].  In  turn,  these  changes  are  believed  to  result  in 
an  improvement  of  bulk  properties  such  as  tensile  strength,  fatigue  strength,  and 
creep  resistance  [3],  We  have  undertaken  this  study  in  order  to  gain  fundamental 
understanding  of  void  stabilization  and  its  relation  to  the  microstructure  of  a  ma¬ 
terial.  The  lattice  of  nickel  was  selected,  with  oxygen  (atomic  and  molecular)  im¬ 
purities,  as  a  specific  system  to  study  because  of  a  collaboration  we  have  with 
experimentalists  from  this  lab  who  are  analyzing  the  properties  of  this  system.  The 
term  “void  stabilization”  taken  by  itself  is  ambiguous  in  that  it  is  not  inherently 
clear  what  is  taken  as  the  reference  point.  Ultimately,  one  would  like  to  compare 
the  energies  of  void  defects,  with  and  without  absorbates,  to  the  energy  of  the  pure 
metal  plus  molecular  O2  and  to  the  energies  of  other  kinds  of  defects  (stacking  fault 
or  dis'oeation  loop)  with  and  without  absorbates.  We  feel  the  best  way  to  progress 
toward  this  comparison  is  to  make  a  tendency  toward  their  standard  heats  of  for¬ 
mation  in  the  following  manner: 

Ni„(s)  +  (m/2)02(g)  -►  Ni„Om(s)  (single  defect) ,  AH  =  A°*^Hf, 
rxn  1 :  Ni„(s)  -*•  Ni„(s)  (single  defect) ,  AHi  =  void  formation  energy  , 
rxn  2:  Ni„(s)  (single  defect)  -I-  (m/2)02(g)  -►  Ni„Om(s)  (single  defect) , 

AH2  =  void  stabilization  energy  . 
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The  body  of  this  report  is  concerned  only  with  reaction  2,  the  heat  of  which  we 
define  as  the  void  stabilization  energy.  An  alternative  view  is  presented  as  an  ap¬ 
pendix.  Namely,  a  definition  of  void  stabilization  is  given  by  the  following  equation: 

Ni„0„,(s)  (single  defect)  +  0(g)  Ni„0„+i(s)  (single  defect) 

Method 

The  host  lattice  is  approximated  by  a  small  cluster  of  nickel  atoms  that  are 
arranged  as  face-centered  cubic,  except  for  a  tubular  void  described  by  the  absence 
of  three  or  four  lattice  elements.  The  oxygen  absorbates  are  placed,  one  at  time, 
within  this  cavity.  Elucidation  of  the  electronic  structure  was  accomplished  at  the 
spin-unrestricted  Hartree-Fock  (UHF)  level  with  the  LANLlMB  basis.  The  full  basis 
set  description  has  been  published  elsewhere  [4]  but  will  be  reviewed  here  briefly 
for  completeness.  Each  electron’s  orbital  around  oxygen  is  described  by  three  gauss- 
ian  primitives,  contracted  into  each  of  the  Is  and  2sp  shells.  The  Is  through  3p 
electrons  on  Ni,  taken  to  be  core  electrons,  are  described  by  a  single  pseudopotential, 
while  the  remaining  4s  and  3d  valence  electrons  go  into  orbitals  that  are  described 
by  single  sets  (three  primitives  per  set)  of  s-,  (five)  p-,  and  d-type  functions. 

In  the  interest  of  easing  computational  demands,  while  maintaining  computa¬ 
tional  integrity,  the  optimized  nuclear  configurations  which  are  shown  in  Figure  1 
were  determined  under  the  following  constraints  based  upon  the  symmetry  of  the 
cluster  or  the  bulk.  For  the  reference  structure  all  Ni — Ni  bonds  were  constrained 
to  be  of  the  same  length  (R)  and  all  Ni — Ni — Ni  bond  angles  were  fixed  at  1 80’ 
or  1 20°  as  shown  in  Figure  1 .  The  so-optimized  structure  was  then  taken  as  the 
host  lattice,  which  remained  rigid  for  all  subsequent  calculations.  In  structure  Al, 
the  oxygen  atom  resides  at  a  distance  (R2)  of  4. 126  A  from  the  edge  of  the  cavity. 
A  full  description  of  the  potential  as  a  function  of  the  c-coordinate  of  oxygen  is 
shown  in  Figure  2.  For  structure  B 1 ,  the  O2  bond  distance  (Rq)  was  reoptimized  to 
1 .268  A  from  its  gas  phase,  uhf/lanlI  MB  value  of  j  .2 1 7  A  [5].  Concurrent  to  this 
optimization,  the  distance  from  the  center  of  the  O2  bond  to  the  edge  of  the  cavity 
{Rj)  was  determined.  For  structure  A2  a  £>2/,  symmetry  constraint  was  enforced 
throughout  the  optimization  of  R2,  the  distance  between  atomic  oxygen  and  the 
edge  of  the  cavity.  However,  once  determined,  the  energy  of  this  structure  was 
compared  to  the  energy  of  an  alternative  structure  that  has  one  oxygen  atom  in 
the  center  of  the  cavity  and  the  other  a  distance  of  R2  from  the  edge  of  the  cavity, 
as  well  as  several  others. 

The  structure  A2  shown  in  Figure  1  is  the  lowest  energy  configuration.  The 
values  of  R2  and  R3  were  optimized  in  a  similar  manner  for  structures  B2,  B3,  and 
A3.  Structure  A4  is  simply  an  fee  layer  with  four  nickel  atoms  displaced  by  four 
oxygen  atoms.  Finally,  a  single  point  energy  calculation  was  completed  foi  structure 
B4  as  a  check  on  our  presumption  that  the  AHj  of  these  clusters  will  increase 
quickly  as  the  number  of  atoms  inside  the  void  exceeds  the  number  of  vacancies 
defining  the  cavity. 
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Figure  1.  (a)  The  layout  of  this  figure  is  something  like  a  table  in  that  the  structures  are 
delineated  by  rows  and  columns.  Structure  B2.  for  example,  is  in  column  B  and  row  2. 
(b)  The  opien  circles  represent  nickel  The  plain  shaded  circles  represent  atomic  oxygen. 
The  wavy  shaded  circles  represent  molecular  oxygen,  (c)  All  structures  shown  belong  to 
the  Du  point  group,  except  for  structure  A1  which  belongs  to  the  C2,.  point  group,  (d)  R 
is  the  distance  between  adjacent  nickels.  R„  is  the  distance  between  oxygens,  and  Rj  is  the 
r-position  of  oxygen,  relative  to  the  edge  of  the  box  R3  is  the  c-position  of  the  midpoint 
of  Oj,  relative  to  the  edge  of  the  box 


Results 

One  Oxygen  Atom 

The  behavior  of  a  single  oxygen  atom  trapped  inside  a  tubular  void  of  an  fee 
solid  is  interesting  not  only  beeause  it  lays  the  foundation  for  understanding  void 
stabilization,  but  also  beeause  it  displays  at  once  several  fundamental  aspects  of  all 
physical  systems.  Its  behavior  is  influenced  by  symmetry,  by  a  periodicity  of  the 
field  generated  by  the  host  metal,  and  by  edge  effects  which  are  akin  to  the  particle- 
in-a-box  model.  It  was  shown  many  years  ago  [6]  that  any  molecular  system,  char¬ 
acterized  by  a  nondegenerate  electronic  state  which  is  at  a  stationary  point  within 
point  group  G,  will  also  be  at  a  stationary  point  within  any  subgroup  of  G.  The 
highest-order  point  group  to  which  this  Ni/0  cluster  (oxygen  trapped  within  a  foui- 
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Figure  2.  The  solid  line  represents  a  fit  to  the  ab  initio  data;  represented  by  open  circles. 


site  tubular  cavity  in  nickel)  conceivably  could  belong  is  D2h-  This  conformation 
has  oxygen  at  the  center  of  the  void,  and  it  must  correspond  to  a  local  stationary 
point.  In  the  bulk  system,  that  we  are  simulating,  the  potential  along  the  (a)  direction 
is  exactly  the  same  as  the  potential  along  the  (j’')  direction,  where  the  y'  direction 
is  perpendicular  to  the  z-axis  while  it  makes  an  angle  of  60°  with  the  x-axis,  and 
the  z-axis  runs  along  the  length  of  the  void.  In  our  model,  the  width  of  the  void  is 
4.887  A,  just  10%  more  than  twice  the  uhf/lanlImb  bond  length  (2.208  A)  of 
NiO  gas.  Because  of  these  two  points,  it  is  unlikely  that  the  energy  of  system  would 
decrease  as  oxygen  moves,  within  the  A'>-plane,  away  from  the  geometric  center. 


Table  I.  Absolute  energies  (hartrees). 

Species 

UHF  energy 

5(5+  l)A^ 

R2,  or  bond  (A) 

0 

-73.8042 

2.000 

Ni 

-38.9803 

2.000 

NiO 

-112.7492 

2.854 

2.208 

O2 

-147.6342 

2.000 

:.2I7 

Nij 

-77.9527 

2.004 

2.742 

Ni,o 

-389.6730 

10.129 

2.822 

N110O2 

-537.4486 

10.762 

4.18 

Ni,2 

-467.7216 

10.700 

2.822 

Ni,20 

-541.5954 

II  295 

4.1.3 

Ni,202 

-615  4290 

12.206 

4.22 

Ni,203 

-689.2588 

1.3.494 

4.01 

N|,204 

-763.0271 

— 

— 

Ni, 202(02) 

-763.0382 

13  ■’10 

3.92 

Ni, 2(02)2 

-763.0665 

12.653 

4.42 

Ni, 20(02)2 

-836.3584 

13.394 

3.59 

Ni.2(02b 

-910.6624 

— 

3.00 
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Table  II.  Off-center  energies  of  Nii20, 


Distance  off  center  (A) 

A£-  =  (eV) 

y  =  0.05 

0.07 

y  -  0.00 

0.00 

y  =  -O.OS 

0.07 

The  results  shown  in  Table  II  validate  this  presumption  for  the  one  oxygen  case, 
and  we  presume  that  such  a  condition  also  holds  for  multiple  absorbates. 

Now  that  we  have  established  that  oxygen  should  lie  somewhere  on  the  z-axis, 
we  may  defend  our  o,  cision  to  include  a  single  plane  for  our  model  by  simply 
noting  that  the  xz  and  yz  planes  are  exactly  equivalent  while  a  third  plane  {y''z)  is 
shown  in  Figure  3.  Deletion  of  the  (y'z)  and  {y''z)  planes  is  expected  to  effect  no 
influence  on  the  qualitative  predictions  of  our  model,  as  the  kinds  of  interactions 
that  are  prevalent  in  the  (>’"z)  plane  are  outweighed  by  the  closer,  shifted  interactions 
prevalent  in  each  of  the  (y'z)  and  (xz)  planes.  The  quantitative  predictions  are, 
however,  likely  to  be  compromised,  but  this  kind  of  result  is  already  compromised 
by  computational  level  and  basis  set  size.  Therefore,  we  feel  nothing  significant  is 
lost  by  considering  a  single  plane. 

For  displacement  of  oxygen  along  the  z-axis,  we  reiterate  that  the  Dj/,  structure 
corresponds  to  a  minimum  or  a  maximum  by  symmetry.  In  this  direction,  however, 
there  exists  4.85  A  (as  apposed  to  0.23  A  in  the  (x)  or  (.v)  directions)  of  “fi\,e”  range 
before  the  boundary  of  the  box  is  first  impeded  upon.  Any  potential  energy  curve 
covering  mis  much  distance,  through  chemically  significant  regions,  is  likely  to 
encompass  several  extrema.  Certainly  the  periodic  Ni/O  system  we  are  simulating 
will  exhibit  three  or  four  local  minima  (with  two  unique  well  depths)  in  which  the 
oxygen  atom  could  reside.  Our  model  gives  three  minima  (see  Fig.  2).  The  Dih 
structure  is  slightly  higher  in  energy  than  shown  in  Figure  1  (Al),  as  well  as  (of 
course)  is  mirror  image  reflection  through  the  (xy)  plane. 


Figure  .1.  A  four-site  cavity  in  nickel:  the  (F'X)  plane. 
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The  potential  energy  curve  (Fig.  2)  can  be  divided  conceptually  into  three  distinct 
partitions.  Empirical  fits  to  each  of  these  partitions  are  shown  in  Figure  4.  The 
periodic  part  [Fig.  4(a)]  is  due  to  interactions  of  oxygen  with  the  Ni  cores.  The  edge 
effect  [Fig.  4(b)]  is  due  to  the  physical  boundary  of  the  box.  The  delocalized  partition 
[Fig.  4(c)]  is  due  to  a  propagated  and  progressive  difference  in  the  valence  region 
of  the  Ni  atoms  from  the  edge  to  the  center.  Such  ?  decomposition  may  be  useful 
for  making  projections  of  the  behavior  of  one  oxygen  atom  in  progressively  larger 
tubular  voids.  We,  however,  have  employed  it  as  a  guide  in  our  characterization 
and  analyses  of  multiple  oxygen  atoms  within  the  same,  four-site  hollow. 


z 


z 


z 

Figure  4.  (a)  E  =  Ao  sin^(A.-).  Ao  ^  0  29835  eV.  6  =  1,1 134  A"';  (b)  E  =  kJ^hZf,  = 
0.00190  eV  A-*:  (c)  E  =  Eg  +  A’ +  ki{bz)\  k,  =  -0.04641  eV  A'',  Eg  -  -0  00073  cV. 
A'  =  c'((6z)  sin(^Z)F,  f '  =  -0.00383. 
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Two  Oxygen  Atoms 

For  two  oxygen  atoms  inside  the  void,  there  exists  four  physically  distinct  nuclear 
configurations  that  may  correspond  to  local  minima  on  the  ground  electronic  sur¬ 
face.  There  may  be  one  oxygen  atom  in  each  of  the  wells  near  ±3  A  [Figs.  2  and 
4(a)],  there  maybe  one  oxygen  atom  at  either  of  these  locations  while  the  second 
oxygen  is  near  the  geometric  center,  or  there  may  be  two  oxygen  atoms  in  any  one 
of  the  wells  shown  in  Figure  2.  Each  of  these  configurations  was  explored  in  some 
detail. 

We  have  found  no  local  minimum  corresponding  to  a  pair  of  oxygen  atoms  (i.e., 
an  oxygen  molecule)  inside  a  single  welb  This  indicates  that  oxygen-oxygen  at¬ 
tractions  are  overwhelmed  by  oxygen-lattice  interactions,  at  the  UHF/lanlImb 
level.  Each  of  the  remaining  two  configurations  do  correspond  to  local  minima. 
The  minimum  at  (ri  =  2.83,  Z2  =  -2.83)  is  0.29  eV  lower  in  energy  than  the  local 
minimum  at  (ci  =  2.63,  Z2  -  0.01),  which  is  about  the  same  as  the  difference  in 
energy  between  the  two  distinguishable  structures  for  Nii20.  The  position  of  the 
first  oxygen  in  the  higher  energy  structure  (of  Nii202)  is,  however,  shifted  by  0.2  A 
relative  to  the  position  of  oxygen  in  the  lowest  energy  structure  of  Nii20  and  relative 
to  the  first  oxygen  in  the  global  minimum  for  Nij202.  Of  course,  the  second  oxygen 
atom,  in  the  higher  energy  structure,  is  also  shifted  somewhat  from  the  geometric 
center,  as  expected  within  the  laws  governed  by  C21  symmetry  rules.  These  shifts 
indicate  that  oxygen-oxygen  attractions  are  exerting  some  influence  on  the  relative 
positions  of  each  oxygen  within  the  void.  Taken  with  our  earlier  said  conclusion, 
this  influence  is  propagated  through  a  barrier  rather  than  within  a  single  well. 

Three  Oxygen  Atoms 

Using  our  results  for  the  two  oxygen  case  as  a  guide,  we  doubt  that  more  than 
one  (locally)  minimum  energy  configuration  exists  for  the  three  oxygen  case.  The 
arrangement  of  the  single,  expected  minimum  energy  structure  is  £>2/1,  as  shown  in 
Figure  1,  structure  A3.  The  direct  (and  indirect)  force  exerted  by  the  center  oxygen 
on  the  two  “edge”  oxygens  is  divided  equally  between  them,  thereby  diminishing 
its  magnitude  relative  to  the  higher  energy  structure  of  the  two  oxygen  case.  At  the 
liHF/LANLlMB  level  this  direct  force  is  insufficient  (relative  to  oxygen-lattice  in- 
temctions)  to  effect  an  appreciable  influence  on  the  relative  positions  of  each  oxygen 
within  the  void.  On  the  contrary,  the  electron  withdrawing  character  of  the  oxygen 
atoms  appears  to  affect  the  host  lattice  in  such  a  way  as  to  increase  the  magnitude 
[the  curvature  of  curve  4(c),  if  you  will]  of  the  propagated  difference  in  the  valence 
region  of  the  Ni  atoms,  resulting  in  a  net  increase  in  the  skew  of  the  edge  oxygen 
atoms  toward  the  edge  of  the  cavity. 

The  resultant  void  stabilization  of  this  structure  is  the  greatest  that  we  compiled 
during  the  four-site  hollow  portion  of  this  study  (the  compilation  is  given  in  Table 
111).  It  is  interesting  to  note  that  most  of  the  volume  of  the  Ni  cavity  is  occupied 
by  the  three  oxygen  atoms,  distributed  throughout  the  void  so  that  the  largest  un¬ 
occupied  hollow  is  just  4.01  A  in  length  (and  4.89  A  in  diameter).  The  results  in 
Table  Ill  also  suggest  that  the  change  in  stabilization  energy  decreases  as  more 
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Table  III.  Void  stabilization  energies. 


Figure 

Stabilization  energy 
(no.  of  impurities  in  cavity) 

Pattern  A 

Pattern  B 

1 

1.54  (1.1)^'’ 

2  08  (2.4) 

2 

1.99  (2.2) 

1.31  (3.4) 

3 

2.34  (3.3) 

1.39  (3,5) 

4 

1.01  (4.4) 

1.1  (3.6) 

*  Energies  are  in  units  of  eV. 

*’  The  first  number  in  parentheses  is  the  total 
number  of  (atomic  and/or  molecular)  impurities, 
while  the  second  number  is  the  total  number  of 
atoms. 


oxygen  atoms  are  stuffed  into  the  void;  Ni,202  is  more  stable  than  Nii20,  while 
Nii204  is  0.26  eV  less  stable  than  Nii203,  etc. 

Four  Oxygen  Atoms 

The  four  oxygen  case  is  interesting  because  it  has  more  oxygen  atoms  confined 
within  the  cavity  than  the  cavity  has  wells  into  which  the  atoms  may  reside.  It 
follows  then  that  at  least  one  well  will  have  to  accommodate  two  oxygen  atoms  (an 
oxygen  molecule),  provided  that  the  analysis  of  the  pes  shown  in  Figure  2  remains 
qualitatively  correct.  We  have  explored  those  regions  of  the  pes  near  the  equilibrium 
geometries  of  each  of  the  structures  shown  in  Figure  1,  A4,  Bl,  and  B2,  as  well  as 
an  additional  structure  which  is  described  later  in  this  section. 

Structure  A4  is  expected  to  correspond  to  saddle  point  configuration  because 
each  of  the  four  absorbed  oxygen  atoms  reside  at  a  maximum  on  the  periodic 
portion  of  the  PES  [see  Fig.  4(a)].  Further,  we  observe  that  there  exist  four  ways 
that  each  of  the  two  interior  oxygen  atoms  may  fall  off  these  maxima.  These  are 
represented  by  structures  Bl  and  B2  and  by  a  doubly  degenerate  structure  with  a 
molecule  in  one  of  the  outer  wells  and  a  single  atom  each  of  the  remaining  two 
wells.  Note  that  the  pair  of  outer  oxygen  atoms  is  presumed  to  fall  inward  due  to 
the  edge  effect  that  becomes  strongly  repulsive  as  "z"  surpasses  4.2  A  (as  R2  drops 
below  2.8  A). 

Each  of  these  nuclear  configurations  were  found  to  be  lower  energy  structures 
than  that  of  A4.  as  expected.  Hence  we  observe  four  directions  that  lead  to  lower 
energy  than  that  of  structure  A4.  Therefore,  the  energy  of  structure  A4  represents 
an  upper  bound  of  the  barrier  to  conversion  between  structure  Bl  and  B2  (or  any 
couple  of  the  four  minima  described  above)  [7].  At  the  lane  1  mb  level,  structure 
Bl  is  more  stable  than  structure  B2  by  0.77  eV,  while  the  barrier  to  conversion  of 
B2  to  Bl  is  less  than  0.30  eV.  The  doubly  degenerate  €2^  structure,  not  shown,  has 
an  energy  only  slightly  lower  than  that  of  structure  B2.  This  is  surprising  as  we 
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expected  (based  upon  the  arguments  presented  in  the  next  section)  the  energy  of 
this  structure  to  be  about  half  that  of  structure  B2,  taking  the  energy  of  structure 
B 1  as  zero  (possibly,  we  missed  the  minimum  by  a  significant  amount  as  our  search 
was  somewhat  limited). 

Apparently  the  effect  shown  in  Figure  4(c)  continues  to  increase  in  magnitude 
as  the  number  of  oxygen  absorbates  continue  to  increase.  We  say  this  for  two 
reasons:  The  first  is  that  the  difference  in  energy  between  the  global  minimum  and 
the  next  lowest  minimum  increases  steadily  from  the  one  oxygen  case  to  the  two 
oxygen  case  to  the  four  oxygen  case.  The  other  reason  is  that  degree  of  skew  of  the 
outer  oxygen  atoms  in  the  three  oxygen  case  toward  the  edge  of  the  box  points  to 
an  increase  in  this  effect,  as  discussed  in  the  section  devoted  to  the  three  oxygen 
case.  In  the  four  oxygen  case  this  skew  loses  its  meaning  as  it  is  impractical  to 
distinguish  between  those  effects  which  result  from  oxygen-oxygen  interactions 
and  those  which  result  from  delocalized  lattice-oxygen  interactions. 

Finally,  let  us  speak  to  the  probability  of  finding  two  oxygen  molecules  inside  a 
four-site,  nickel  lattice  hollow.  Table  III  shows  that  the  greatest  stabilization  is 
effected  by  the  three  atomic  oxygen  absorbates.  If  such  cavities  at  first  form  quickly 
around  impurities,  rather  than  by  diffusion  of  impurities  into  preformed  cavities, 
then  three  atomic  oxygen  absorbates  is  an  unlikely  number,  as  is  any  odd  number 
of  atomic  oxygen  absorbates  in  any  cavity.  The  exception  to  this  rule  is  the  event 
which  includes  the  collapse  of  a  large  cavity  during  a  latter  stage  of  solidification, 
after  the  molecular  oxygen  absorbates  have  dissociated  and  dispersed  throughout 
the  void.  The  question  we  have  answered  is  whether  a  four-site  hollow  is  more 
likely  to  form  initially  around  one  oxygen  molecule  or  two  oxygen  molecules.  The 
results  in  Table  III  indicate  that  the  net  stabilization  energy,  defined  as  the  energy 
of  the  naked  nickel  cluster  (Niji)  plus  n  times  the  energy  of  gaseous  O2  minus  the 
minimum  energy  of  NinO^n,  is  nearly  the  same  for  each  case.  On  this  basis,  one 
cannot  determine  which  of  the  above  scenarios  is  more  probable.  However,  the 
rapid  solidification  of  nickel  is  a  process  that  occurs  at  high  temperature;  thus 
entropy  is  expected  to  be  an  important  factor  in  the  determination  of  most  probable 
void  sizes  and  nucleation  sites.  Usually  this  factor  will  favor  the  channel  with  more 
translational  degrees  of  freedom,  and  there  is  no  reason  to  expect  anything  different 
here.  Hence,  the  scenario  which  describes  nucleation  of  a  void  around  a  single 
molecular  impurity,  followed  by  dissociation  into  a  pair  of  trapped  oxygen  atoms, 
is  expected  to  be  the  most  probable. 

Five  and  Six  Oxygen  Atoms 

The  supersaturated  case  studies  were  performed  primarily  to  determine  the  max¬ 
imum  number  of  oxygen  atoms  that  could  be  placed  inside  a  four-site  Ni  hollow 
without  effecting  a  net  positive  SHi-  This  number  is  six,  or  a  pair  of  atoms  per 
natural  well  of  the  system.  We  propose  the  maximum  number  of  oxygen  absorbates 
in  any  tubular  void  is  governed  by  the  same  physical  circumstance.  Hence,  (In  - 
2)  is  the  maximum  number  of  oxygen  absorbates  that  can  be  packed  into  any  n- 
site  tubular  Ni  hollow. 
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Three-Site  Hollow 

Those  results  in  Table  III  that  point  to  “three”  as  the  optimal  number  of  trapped 
oxygen  absorbates  inside  the  four-site  Ni  hollow  already  suggest  that  it  is  energetically 
favorable  to  have  each  natural  well  of  the  void  be  occupied  by  exactly  one  atomic 
absorbate.  The  results  for  two  oxygen  atoms  inside  a  three-site  hollow  strongly 
reinforce  this  suggestion.  The  stability  of  Niio02  is  1.86  eV  greater  than  that  of 
NijiO:.  Taken  alone,  this  suggests  that  number  of  vacancies  defining  a  tubular  void 
in  nickel  is  not  persuaded  by  energy  to  exceed  (In  +  I ),  where  n  is  the  number  of 
O2  molecules  around  which  Ni  is  solidifying.  There  are,  of  course,  many  other 
components  that  need  to  be  evaluated  (such  as  the  energy  required  to  create  an  n- 
site  hollow  from  a  perfect  fee  lattice,  for  example)  before  one  can  say  definitively 
that  the  creation  of  one  cavity  size,  in  the  presence  of  oxygen,  is  preferred  over 
another.  We  feel  the  comparison  above  is  one  very  important  component  toward 
this  end. 


Conclusion 

We  have  shown  that  «-site  tabular  hollows  in  nickel  exhibit  (n  —  1 )  natural  wells 
into  which  oxygen  might  reside.  The  interaction  energy  between  the  nickel  lattice 
and  these  oxygen  absorbates  is  greater  than  the  interaction  energy  between  pairs  of 
oxygen  atoms.  This  implies  that  the  absorbates  exist  in  a  form  which  is  predomi¬ 
nantly  atomic,  as  opposed  to  diatomic.  The  most  preferred  number  of  these  atomic 
absorbates  is  {«  -  1 ),  exactly  matching  the  number  of  natural  wells  resulting  from 
the  structure  of  the  host  lattice.  The  maximum  number  of  oxygen  atoms  within  a 
single  «-site  hollow  is  2(n  -  1 ),  or  one  molecule  per  natural  well.  Finally  let  us 
remind  the  reader  that  each  of  the  stated  conclusions  apply,  at  the  uhf/lanlImb 
level,  only  to  tubular  (i.e.,  linear)  cavities.  Substantially  different  conclusions  may 
result  from  studies  of  2D  and  3D  cavities,  especially  for  larger  cavities  that  begin 
to  share  more  characteristics  with  adsorption,  than  with  typical  absorption. 

Appendix 

Let  us  alternatively  define  void  stabilization  as  the  binding  energy  of  an  oxygen 
atom  inside  a  nirkel/void  complex  that  is  already  occupied  by  m  oxygen  atoms, 
ni  =  0-5  (see  the  first  section).  Using  this  definition,  we  find  that  void  stabilization 
decreases  steadily  from  1 .89  eV  at  m  =  0  to  -0.33  eV  at  m  =  4  before  increasing 
slightly  again  at  m  =  5.  From  this  approach  we  find  that  w  =  4  is  the  maximum 
number  of  oxygen  atoms  that  can  be  inserted  into  the  cavity  before  the  void  sta¬ 
bilization  energy  (vsE)  changes  sign.  This  contrasts  with  the  previous  answer  of  (at 
least)  six  absorbates  before  the  sign  of  A//2  changes.  Clearly  this  difference  is  the 
result  of  differing  definitions  of  void  stabilization.  The  binding  energy  (BE)  equal 
to  zero  marks  the  point  at  which  it  is  no  longer  energetically  favorable  to  bind  the 
nickel/void/oxygen  complex  with  another  oxygen  atom.  Thus,  the  BE  definition  of 
VSE  would  also  suggest  that  ?n  =  4  is  the  most  favorable  number  of  oxygen  absorbates 
to  insert  into  the  nickel/void  complex.  However,  w  =  4  is  only  slightly  favored 
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over  wi  =  3  so  that  we  cannot  determine  definitively  which  number  is  most  preferred. 
This  is  again  in  sharp  contrast  to  the  A//2  definition  of  VSE  which  leads  to  m  =  3 
as  a  clear  favorite,  with  m  =  2  and  m  =  4  being  too  close  to  call.  These  differences 
prove  that  great  attention  must  be  paid  to  which  definition  of  void  stabilization 
has  been  invoked,  a  number  of  which  have  been  seen  in  the  literature  [8-10]. 
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Abstract 

An  ab  imiui  perturbed  ion  study  using  X-ray  diflraetion  data  has  been  carried  out  for  ZrSiO.  (zircon). 
ZrOj  (monoclinic  zirconia.  baddelevite).  and  S1O2  (a-cnstobalite)  crystal  lattice  structures.  The  different 
substitutions  of  V'*  for  Zr^*  and  Si^’  occurring  in  these  host  lattices  have  been  analyzed.  Geometry 
optimizations  have  been  performed  with  the  aim  of  determining  the  relative  stability,  cell  parameters, 
and  force  constants  of  radial  displacement  associated  with  the  local  relaxation  for  pure  and  doped  structures. 
Numerical  results  are  confronted  against  expenmental  data  and  compared  with  previous  results  The 
geometncal  cell  parameters  of  different  structures  obtained  by  computer  simulation  and  the  results  of 
the  X-ray  diffraction  studies  agree  with  previous  expenmental  data.  For  the  zircon  lattice,  the  substitution 
of  V**  for  Zr**  at  an  eightfold-coordinated  site  is  energetically  favorable  while  the  substitution  of 
for  Sr*  at  a  fourfold-coordinated  site  is  unstable.  For  ZrOj.  the  substitution  of  V'*  for  Zr^*  is  energetically 
favorable  while  the  substitution  of  V**  for  Si‘*  in  S1O2  is  energetically  unfavorable.  There  is  less  sensitive 
influence  of  the  crystal  lattice  parameters  for  substitutions  occumng  at  the  eightfold-coordinated  ion 
site  in  ZrSiO.  and  S1O2  structures.  The  doping  process  produces  a  decrease  of  force  constant  (.k)  values 
associated  with  the  breathing  fundamental  vibrational  mode  for  all  structures.  The  k  associated  with  the 
radial  displacement  in  dodecahedral  substitution  in  the  ZrSiO.  structure  is  especially  high  The  force 
constants  for  this  movement  in  tetrahedral  substitution  in  the  ZrSiO..  ZrO:.  and  S1O2  structures  have  a 
lower  value.  The  differences  between  ionic  radii  reported  by  Shannon  and  Prewitt  of  the  species  concerned 
in  the  doping  process  are  not  capable  of  explaining  the  relaxation  of  crystal  lattice  parameters  in  the 
Zr02  and  S1O2  structures,  fc  1993  John  Wiley  &  Sons.  Inc 


Introduction 

It  is  well  known  that  there  is  considerable  interest  within  the  cenmics  industry 
in  the  zircon  structure,  ZrSiOa,  due  to  its  natural  capacity  to  accept  replacements 
of  its  cations  by  foreign  cations  in  a  substitutional  solid  solution  [!].  In  particular, 
the  discovery  of  the  vanadium-zircon  blue  pigment  with  high  stability  and  com¬ 
paratively  low  cost  was  a  fundamental  advance  in  the  ceramic  pigments 
field  [2-4]. 

The  principal  structural  unit  of  zircon  is  a  chain  of  alternating  edge-sharing  SiOz 
tetrahedra  and  ZiOr  tnangular  dodecahedra  extending  parallel  to  the  c-axis.  The 
chains  are  joined  laterally  by  edge-sharing  dodecahedra.  In  zircon,  octahedral  sites 
are  present  but  contain  no  cations  [5].  Therefore,  the  foreign  cations  could  occupy 
the  Si'*^  position  and/or  the  Zr"*  in  the  zircon  structure. 

From  an  experimental  point  of  view,  it  is  difficult  to  assess  spectroscopically  the 
exact  substitutional  site  because  both  positions  have  the  same,  Dvh,  symmetry.  This 
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fact  has  produced  an  interesting  controversy  in  the  literature:  Demiray  et  al.  [6] 
have  concluded  that  the  V'’^  ion  only  occupies  the  dodecahedral  sites  in  the  lattice, 
replacing  the  Zr'*^  ion.  Di  Gregorio  and  Greenblatt  [7]  conclude  that  the  ion 
would  preferentiallv  occupy  the  tetrahedral  environments,  replacing  the  Si''^.  Lastly. 
Hong  et  al.  [8].  using  the  previous  experimental  results  and  calculating  the  distri¬ 
bution  of  energ>  levels,  have  concluded  that  V*^  replaces  both  Si'’^  and  Zr"*^  in  the 
zircon  lattice.  Recently,  Chandley  and  Clark  [9]  suggest  that  vanadium  occupies 
either  the  Si'*^  alone  or  both  the  Zr""  and  Si"*^  sites  but  predominantly  the  Si'*^  site 
in  ZrSi04. 

Our  interest  in  the  zircon  structure  has  existed  for  a  considerable  time,  and  we 
initiated  our  experimental  research  by  examining  the  synthesis  of  different  kinds 
of  solid  solutions  by  a  sol-gel  process  and  their  characterization  [10].  From  a  theo¬ 
retical  point  of  view,  two  studies  have  been  carried  out  by  us  [  1 1  ]  and  Chandley  et 
al.  [12]  in  order  to  determine  the  site  at  which  vanadium  prefers  to  be  doped  in 
the  zircon  structure.  From  this  aspect,  the  present  paper  can  be  considered  as  a 
prolongation  of  these  works. 

There  are  several  ways  to  address  the  problem  of  substitution  of  V'**  into  the 
ZrSi04  lattice,  depending  upon  which  mode  of  synthesis  is  being  analyzed.  We  can 
consider  two  procedures: 

(i)  In  a  flux-grown  crystal,  the  most  simple  analysis  involves  two  possible  sub¬ 
stitutions.  as  defect  reactions  show: 

V"^(g)  +  .v(ZrSi04)(s)  -  Zr,-  ,VSi,04^(s)  +  Zr^’^fg) .  (I) 

V^(g)  +  .v(ZrSi04)(s)  -  Zr,Si,.  ,V04,(s)  +  Si"^(g) .  (2) 

(ii)  ZrSi04  can  be  formed  by  the  reaction  of  with  ZrO;  (monoclinic  zirconia, 
baddeleyite)  and  SiO;  («-cristobalitc)  that  act  as  precursors  [9,10],  then  the  incor¬ 
poration  of  V**  into  ZrSi04  is  therefore  dependent  on  the  defect  reactions: 

V^(g)  +  .vfZrO.Xs)  —  Zr,_,V02>(s)  +  Zi'^fg) ...  (3) 

V^(g)  +  .v(SiO.)(s)  Si,.  .VO.Js)  +  Si"^(g) .  (4) 

In  conclusion,  depending  on  the  chemical  environment  present  during  the  syn¬ 
thesis  ofZrSi04.  V"*"  can  be  readily  substituted  either  for  Zr‘’\  via  direct  substitution 
[Eq.  (1)]  or  via  ZrO:  doping  [Eq.  (3)].  o'"  for  Si'**,  via  direct  substitution  [Eq.  (2)] 
or  via  SiO:  doping  [Eq  (4)]. 

The  ah  initio  perturbed  ion  (aiPl)  method  has  been  successfully  applied  to  the 
calculation  of  the  electronic  structure  of  different  structures  like  halides  [  1 3],  hydrides 
[14],  binary  oxides  [15],  different  oxo.  and  fluoroperovskites  [16].  This  method 
gives  detailed  information  on  the  ionic  wavefunctions  and  the  microscopic  inter¬ 
actions  determining  the  relativ  e  stability  of  pure  or/and  doped  structures. 

Following  recent  achievements  in  applving  this  methodology  in  order  to  acquire 
an  understanding  of  the  physical  and  chemical  properties  relevant  to  condensed 
matter,  the  present  work  has  been  launched  with  the  aim  of  investigating  theoretically 
the  following  aspects: 
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(i)  Calculation  of  different  properties  like  lattice  energy,  equilibrium  cell  param¬ 
eters  of  ZrSi04,  ZrO^.  Si02.  Zr»_i  VSi»04,,  Zr^Si^  i  V04v.Zrx_i  VO^i.  and  Si^-i 
VO2,  structures. 

(ii)  Calculation  of  force  constants  associated  with  breathing  or  radial  vibrational 
modes  of  pure  and  doped  structures. 

(in)  Numerical  results  are  analyzed  and  compared  against  other  theoretical  results 
and  experimental  data. 

The  method  and  model  are  summarized  in  the  next  section.  The  results  and 
discussion  arc  presented  in  the  third  section.  A  short  section  of  conclusions  closes 
this  paper. 


Method  and  iviodcl 


aiPl  method 

The  aiPi  method  [17]  provides  an  adequate  quantum-mechanical  treatment  of 
the  atom-in-the-lattice  structure.  The  particular  features  of  this  method  have  been 
described  by  Luana  and  Florez  [18],  and  it  is  suitable  for  studying  crystal  defects 
in  terms  of  clusters  w  hose  size  and  characteristics  can  properly  simulate  the  effects 
induced  by  the  presence  of  impurity  ions. 

Effective  energies  and  optimized  structures  are  reported  for  clusters  of  the  pure 
crystals.  ZrSi04,  ZrO;.  and  SiO;.  and  of  the  vanadium-doped  structures.  Zr,_i 
VSi(04».  Zr^Si^-i  V04v,  Zr». ,  VO;,,  and  Si,-i  VO;,.  The  impurity  ions  occupying 
cationic  vacancies  in  the  crystal  are  simulated  assuming  that  the  substitution  occurs 
in  a  single  center,  which  is  the  reference  origin  in  the  construction  of  the  clusters. 
The  clusters  have  been  built  successively  adding  shells  of  ions  that  are  symmetrically 
distributed  around  the  origin. 

Model 

To  avoid  undesirable  effects  related  to  the  cluster-lattice  interface  as  have  been 
prev  lously  noted  in  other  theoretical  studies  by  Luana  and  Florez  [18].  the  different 
structures  and  the  impurity  center  (V"*^)  are  simulated  by  a  large  cluster.  In  this 
work,  we  have  self-consistently  computed  the  electronic  structure  of  eight  different 
clusters  that  includes  a  total  number  of  atoms  from  5.^  to  1 17  (20-46  shells  of 
neighbors).  Zr2^Sii40}io.  Zr22VSii40}<().  Si24Zri407(,.  and  Si2;VZri4076  (around  4^ 
molar  of  doping)  clusters  embedded  m  a  zircon  lattice  have  been  used.  ZrnOj,,  and 
ZrihVO.ih  (around  b'l  of  doping)  clusters  embedded  in  a  monoclinic  zirconia  lattice 
while  Si290s)t  and  Si2sV05)i  (around  4^^  of  doping)  clusters  embedded  in  a  «-cris- 
tobalite  lattice  have  been  employed. 

L^trge  STO  basis  sets  arc  used  on  every  ion:  (5s5p)  for  0‘  .  (5s4p)  for  Si'*^ ,  (8s6p5d) 
for  V*'.  and  (10s9p5d)  for  Zr'*^  [19J.  The  wavcfunclions  and  effective  potentials, 
used  to  represent  the  lattice  environment  on  each  cluster,  have  been  obtained  from 
a  previous  aiPl  calculation  on  the  pure,  infinite  clu.ster.  The  Madelung  potential, 
responsible  for  the  largest  part  of  the  interaction  energies,  has  been  analytically 
integrated.  The  quantum  mechanical  contributions  to  the  interaction  energies  have 
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been  counted  for  a  large  number  of  neighboring  shells  until  a  convergence  of  10'^ 
hartrees  is  achieved  in  the  effective  energy  of  the  clusters.  The  effective  energy 
includes  corrt.ation  estimated  by  means  of  the  unrelaxed  Coulomb-Hartree-Fock 
formalism  [20].  The  term  unrelaxed  is  used  because  the  aiPi  wavefunctions  are  not 
affected  by  this  correction. 

The  geometry  of  the  different  crystal  structures  was  optimized,  varying  the  struc¬ 
ture  and  calculating  the  effective  energy.  The  optimized  distances  were  calculated 
in  the  following  way;  The  geometry  of  the  first  shell  was  varied  until  a  minimum 
effective  energy  was  obtained;  with  the  first  shell  fixed  at  the  minimum,  we  proceed 
to  optimize  to  the  second  si.ell,  and  so  on.  A  schematic  representation  of  ZrSi04, 
ZrOi,  and  SiO:  •  p;esented  in  Figures  1,  2.  and  3,  respectively. 


Results  and  Discussion 


Struclures 

The  calculated  crystal  lattice  parameters  for  ZrSi04.  Zr,-i  VSi^04,,  Zr^  Si,_i 
VG4v.  ZrO:,  and  SiOi  obtained  from  t  X-ray  diffraction  study  [10]  using  the  Rietveld 
method  [21.22]  are  ''sted  in  Table  I.  In  Tables  lla  and  Ilb  are  given  the  calculated 
bo  ■  d  len.  ths  for  pure  a  d  vanadium-doped  zircon  at  Zr"*^  and  Si'*''  sites,  respectively. 
The  1.  .pe  nental  listances  are  o'  tained  from  the  values  presented  in  Table  I, 
coiibidcrin  he  i"*ragonal  14|/amd  space  group. 


I'lgure  I .  Schemalic  rcprcscnuiion  of  zircon  structure.  The  chains  of  alternating  edge- 
sharing  SiOj  tetrahedra  and  ZrO*  triangular  dodccahcdra  extending  parallel  to  (  and  joined 
laterally  by  edge-sharing  dodccahcdra. 
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(a) 


(b) 


Figure  2.  Schematic  representation  of  monoclinic  zireonia  (baddele>ite)  (a)  7-coOidination 
of  Zr**  (idealized);  (b)  projection  of  atomic  dispositions 


00 

.  Si 


Figure  3.  Schematic  representation  of  a-cristobahte  structure. 
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Table  1.  Lattice  calculation  data:  crystal  lattice  parameters. 


Lattice 

a  (bohr)* 

b  (bohr)* 

c  (bohr)* 

L'(bohr’)* 

b  (deg) 

Origin 

ZrSi04 

12.485 

12.485 

11.304 

1762.19 

90 

Ref.  [23] 

12.4609 

12.4609 

1 1.2854 

1752.32 

90 

This  work 

(12.4741) 

(12.4741) 

(11.2968) 

(1757.81) 

90 

This  work 

ZrOz 

9.7234 

9.8408 

10.0358 

947.87 

99.23 

Ref  [23] 

9.7330 

9.8485 

10.0482 

950.71 

99.23 

This  work 

S1O2 

9.407 

9.407 

13.130 

1163.41 

90 

Ref  [23] 

9.4089 

9.4089 

13.1506 

1 164.20 

90 

This  work 

Doped  structure  values  in  parentheses. 


In  Tables  lie  and  Ild  the  corresponding  bond  distances  for  pure  ZxOi,  Si02  and 
doped  Zr^-i  VO^x,  and  Si^-i  VO2X  structures  are  summarized,  and  the  experimental 
distances  are  obtained  from  the  values  presented  in  Table  I.  considering  the  mono¬ 
clinic  P2|/c  and  tetragonal  P4|2i2|  space  groups,  respectively. 

For  the  zircon  structure,  a  comparison  of  the  optimized  and  experimental  values 
reported  by  us  and  Hyde  and  Andersson  [23]  (Tables  Ila  and  lib)  shows  a  good 
agreement,  around  2%  and  5%  difference  for  geometrical  parameters  related  to  Zr'** 
and  sites,  respectively.  The  effect  of  doping  vanadium  into  the  zirconium  site 
is  to  shorten  the  four  equatorial  metal  oxygen  bonds  and  to  lengthen  the  four  axial 
metal  oxygen  bonds.  The  presence  of  the  impurity  center  in  the  lattice  produces  a 
change  of  equatorial  oxygen  ions  that  are  moved  0.0944  a.u.  closer  to  the  metal 
ion,  altering  the  bond  distance  from  4.2087  (Zr'*'^ — )  to  4. 1 143  a.u.  '). 

The  effect  of  doping  vanadium  into  the  silicon  site  is  that  the  four  oxygen  ions 
bonded  to  the  vanadium  ion  move  away  from  their  original  positions,  altering  the 
lattice  ion  to  oxygen  bond  lengths  from  3.1881  (Si'*'^ — O^”)  to  3.4332  a.u.  (V"*^ — 
0^“),  a  difference  of  0.245 1  a.u.  This  is  compensated  for  by  moving  the  zirconium 
and  silicon  ions  bonded  to  those  oxygen  ions  further  away  from  the  dopant  ion, 
and  by  small  changes  in  Zr'*^ — O^"  and  Si^^ — O^”  bond  lengths.  Thus  the  greatest 


Table  Ila.  Calculated  bond  lengths  (a.u.)  for  pure  and  vanadium-doped  zircon  at  Zt**  site. 


Metal  ion 

Ion 

Optimized 

distance 

Experimental 

distance 

From  the  literature 

Pure  structure 

Zr** 

4.x  0^"  (axial) 

4  0351 

4.0351 

4.025  [23] 

4a'  0^”  (equatorial) 

4.2087 

4.2997 

4.288  [23] 

Vanadium-doped 

structure 

V** 

4.V  0^"  (axial) 

4.0394 

4.0394 

3.855  [12] 

4x  0^'  (equatorial) 

4.1143 

4.3041 

4.301  [12] 
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Table  lib.  Calculated  bond  lengths  (a.u.)  for  pure  and  v?.nadium-doped  zircon  at  Si**  site 


Optimized  Experimental 


Metal  ion 

Ion 

distance 

distance 

From  the  literature 

Pure  structure 

Si** 

4.\0^-(l) 

.4.1881 

3.0315 

3.065  [23] 

2vZr** 

5.6427 

5.6427 

4.\  0=-  (2) 

6  2310 

6.2310 

Vanadium-doped 

structure 

V** 

4.vO='(l) 

3.4332 

3  0346 

3.322  [12] 

2.vZr** 

5.8743 

5  6484 

4.\  O^-  (2) 

6.3251 

6  2374 

Table  lie. 

Calculated  bond  lengths  (a.u.)  for  pure  and  vanadium-doped  ZrOj. 

Metal  ion 

Anion 

Optimized 

distance 

Experimental 

distance  Ref.  [24] 

Pure  structure 

Zr** 

o=-(i) 

3  6837 

3.8776 

O'- (2) 

3.7037 

3.8986 

O'-  (3) 

3.8648 

4.0682 

Average 

3  75 

3.95  3.91 

O'-  (4) 

3.8719 

4  0757 

O'-  (5) 

3.9073 

4.1129 

O'-  (6) 

4.0246 

4.2364 

O'-  (7) 

4.0693 

4.2835 

Average 

3.96 

4.18  4.18 

Vanadium-doped 

structure 

V** 

O'-(l) 

3  2960 

O'-  (2) 

3.3138 

O'-  (3) 

3.4580 

Average 

3.36 

O'-  (4) 

3.4643 

O'-  (5) 

3.4960 

O'-  (6) 

3.6009 

O'-  (7) 

3.6410 

Average 

3.55 
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Tablk  Ild  Calculated  bond  lengths  (a  u  )  for  pure  and  vanadium-dop>ed  SiO;. 


Optimized 

Evperimental 

Mclul  ion 

Ion 

distance 

distance 

Ref.  (23) 

Pure  structure 

Si‘* 

ZvO’-(l) 

2.9485 

3.0392 

2.VO--  (2) 

3.0566 

3.0400 

Aserage 

3.003 

3.040 

3.04 

4\  Sd’ 

5.8160 

5  8160 

2a  0=-  (3) 

6.6234 

6.6157 

Vanadium-doped 

Structure 

2aO’-(I) 

3.1007 

2a  0-- (2) 

3.1015 

Average 

3.101 

4a-  Sd* 

5.8160 

2a  0=-  (3) 

6.62.34 

distortion  of  the  lattice  on  doping  occurs  with  vanadium  doping  into  the  Si'*'^  site. 
These  results  agree  with  the  data  reported  by  Chandley  et  al.  [  1 2] 

For  the  ZrOj-  structure,  a  comparison  of  optimized  and  experimental  distances 
shows  a  5^r  difference  (Table  11c).  In  this  lattice  the  Zr'*^  is  7-coordinated  (Fig.  2), 
and  three  oxygen  ions  have  a  practically  coplanar  arrangement  while  the  remaining 
four  oxygen  ions  present  a  tetrahedral  coordination  with  a  larger  mean  distance 
from  the  metal  ion.  The  effect  of  doping  vanadium  into  the  zirconium  site  is  to 
shorten  the  two  groups  of  metal  oxygen  bonds.  The  three  coplanar  oxygen  ions  are 
moved  in  an  average  of0..19a.u.  closer  to  the  metal  ion  while  the  tetrahedral  oxygen 
ions  are  displaced  in  an  average  of  0.41  a.u.  closer  to  the  metal  ion.  The  doping 
process  produces  a  noticeable  compression. 

For  the  SiO:.  a  comparison  of  optimized  and  experimental  distances  shows  a  3% 
difference  (Table  lid).  The  effect  of  doping  vanadium  into  the  silicon  site  is  to 
lengthen  the  four  metal  oxygen  bonds  while  the  remaining  bond  distances  are  un¬ 
affected. 


Energetics 

The  defect  reaction  (\E)  energies  to  substitute  Zr'”  and  Si"*^  in  the  host  materials 
ZrSiOj  for  V’”;  Vz^iZrSiOj  (Eq.  (1)]  and  Vs,:ZrSi04  [Eq.  (2)].  respectively,  are 
listed  in  Table  III,  together  with  \E  to  substitute  vanadium  for  Zr"*^  in  ZrOj. 
ZrO:  (Eq.  (3)]  as  well  as  vanadium  for  Si"*^  in  SiO:,  Vs,:SiO:  (Eq.  (4)).  In  this  table. 
SE  (experimental  structure)  is  the  calculated  defect  reaction  energy  associated  with 
experimental  geometry  without  optimization  procedure  while  A/:  (optimized  struc- 
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TABLt  III.  Defect  reaction  energies  (a.ii.). 


ImpurilN  center 

A/,  (experimental  structure) 

A/;  (optimized  structure) 

Vz,ZrSi04 

-0.2021 

-0.2212 

Vs,:ZrSi04 

I..t564 

0  2509 

Vz,:Zr02  (m) 

-0.2513 

-0.5723 

Vs,;SiOz  («) 

0.7587 

0.7505 

lure)  corresponds  to  the  value  obtained  optimizing  the  geometry  as  described  in 
the  previous  section. 

The  SE  values  show  that  ion  would  substitute  more  easily  for  Zr"*^  than  for 
Si"^.  The  substitution  of  on  the  Zr'"^  site  is  favored  throughout  by  -0.22  and 
-0.57  a.u.  in  ZrSi04  and  ZrO^  structures,  respectively.  Considering  that  ZrSiOa  is 
not  formed  by  the  reaction  of  a  -bearing  melt  with  ZrSi04,  [9.10]  but  rather  by 
the  reaction  of  doped  ZrOi  and  Si02,  Zr"*^  can  be  readily  substituted  for  V"**  via 
ZrOi  doping. 

It  is  important  to  note  that  the  geometry  optimization  process  for  the  Vs,:ZrSi04 
structure  significantly  modifies  the  value  of  AE  from  1 .36  to  0.25  a.u.  For  the  Vz^: 
ZrO:  structure,  this  change  is  minor,  from  -0.25  to  -0.57  a.u.  This  results  can  be 
explained  due  to  large  changes  of  crystal  lattice  parameters  when  the  optimization 
of  geometry  is  carried  out  (see  Tables  lib  and  lie). 

Force  Constants 

The  aim  of  this  section  is  to  simulate  lattice  relaxation  around  a  point  defect 
and  to  use  this  to  evaluate  the  energy  and  force  constant  tk)  associated  with  the 
symmetric  breathing  (radial)  fundamental  vibrational  mode.  The  k  value  was  studied 
by  means  of  the  calculation  of  the  effective  energy  as  function  of  the  crystallographic 
displacement  which  was  changing  in  a  stepwise  fashion;  a  fitting  method  to  a  fifth 
potential  polynomial  is  used. 

The  k  values  for  pure  and  doped  ZrSi04,  ZrO:,  and  SiOi  structures  are  listed  in 
Table  IV.  These  numerical  values  can  be  summarized  as  follows:  (i)  The  doping 
process  produces  a  decrease  of  k  values  for  all  structures;  (ii)  the  k  value  associated 
with  the  dodecahedral  site  in  the  zircon  structure  is  especially  high;  (iii)  the  k  values 
for  the  tetrahedral  site  in  the  zircon  structure,  ZrOz  and  SiO:  cry  -al  lattices  render 
similar  values;  (iv)  the  doping  in  tetrahedral  substitution  for  zircon  structure  pro¬ 
duces  a  considerable  decrease  of  k  value.  This  fact  agrees  with  the  appreciable 
geometrical  distortion  after  this  substitution  occurs  at  the  tetrahedral  site  in  the 
zircon  structure  and  with  the  experimental  values  on  compressibility  studies  reported 
by  Hazen  and  Finger  [25]. 

If  we  try  to  rationalize  our  results  on  the  basis  of  a  comparison  of  the  ionic  radii 
of  the  species  concerned  in  the  doping  process,  taking  ionic  radii  from  Shannon 
and  Prewitt  [26]  the  differences  in  size  between  the  substituent  and  the  ion  in 
the  host  lattices  are:  R(V")  -  R{Si'‘*)  =  1.380  -  0.756  =  0.624  bohr  and  RfV*^) 
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Table  IV.  Force  constants  (a.u.). 


Tetrahedral  site 

Dodecahedral  site 

ZrSiOi 

Pure  structure 

1.86 

82.62 

(V-<Joped) 

SiOj  (a) 

(0.64) 

(68.80) 

Pure  structure 

2.51 

(V-doped) 

ZrO:  (m) 

(2..t8) 

Pure  structure 

1.82 

(V-doped) 

(1.64) 

-  R{Zr‘’^)  =  1 .380  -  1 .625  =  -0.245  bohr.  For  the  zircon  structure,  the  geometry 
relaxation  following  substitution  on  the  dodecahedral  site  is  expected  to  be  less 
sensitive  if  we  consider  the  similar  ionic  radii  of  Zr^"^  and  V''^  as  our  data  show. 
However,  for  the  doping  process  in  Z1O2  and  SiO^  structures,  this  argument  does 
not  work.  The  use  of  the  relative  size  of  ions  and  volume  of  the  crystal  cells  (see 
Table  1)  do  not  reproduce  theoretical  and  experimental  evidence  for  these  crystal 
lattices. 


Conclusions 

Cluster-in-the-lattice  simulations  of  pure  and  doped  ZrSi04,  Zr02,  and  SiOz 
structures  and  X-ray  diffraction  studies  appear  to  be  an  appropriate  procedure  to 
systematically  characterize  the  energy,  the  changes  or  distortions  of  cell  parameters, 
and  force  constant  for  these  crystal  lattices  associated  with  the  doping  process. 

The  results  of  the  present  study  can  be  summarized  as  follows: 

(i)  The  calculated  crystal  lattice  parameters  obtained  from  a  X-ray  diffraction 
study  using  the  Rietveld  method  for  these  structures  are  similar  to  these  reported 
by  Hyde  and  Andersson. 

(ii)  Less  sensitive  influence  of  the  crystal  lattice  parameters  for  substitutions 
occurring  at  the  eightfold-coordinated  ion  site  in  ZrSi04  and  SiO;  structures. 

(iii)  Substitutions  of  for  Zr^^  at  the  dodecahedral  site  in  zircon  structure 
and  for  Zr^"^  in  ZrOj  structure  are  energetically  favorable. 

(iv)  Our  stability  results  disagree  with  Qtandley’s  et  al.  [  1 2]  conclusions:  however, 
the  structural  data  and  distortion  of  the  lattice  which  take  place  on  substitution  are 
similar. 

(v)  An  analysis  of  the  defect  energy  and  the  breathing  fundamental  vibrational 
mode  for  substitutions  of  V*^  for  Si^*  at  the  tetrahedral  site  in  the  ZrSi04  and  SiOi 
structures  does  not  reveal  any  indication  of  energy  stabilization  for  this  doped 
structure.  This  behavior  confirms  previous  findings  [6]  while  being  at  variance  with 
others  (7.9|. 
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(vi)  The  doping  process  produces  a  decrease  of  force  constant  values  associated 
with  the  breathing  fundamental  vibrational  mode  for  all  structures. 

(vii)  The  k  associated  with  the  breathing  displacement  in  dodecahedral  substi¬ 
tution  for  the  ZrSi04  structure  is  especially  high.  The  force  constants  for  this  move¬ 
ment  in  tetrahedral  substitution  for  the  ZrSi04.  the  Zr02.  and  Si02  structures  have 
a  lower  value. 

(viii)  The  differences  between  ionic  radii  of  the  species  concerned  in  the  doping 
process  do  not  explain  the  relaxation  of  crystal  lattice  parameters  for  the  Zr02  and 
Si02  structures. 
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Abstract 

Due  to  suggestions  that  Self  Inter  iction  Corrections  (si' ),  gradient  corrections,  and  short-range  electron- 
electron  interactions  in  the  Local  (Spin)  Density  Approximation  (l(s)da)  scheme  may  significantly 
influence  the  computed  electronic  structure  for  the  Mott-insulating(  Ml)  transition  metal  oxides  (tmos), 
a  comparative  study  has  been  made  of  Hart'ce  Fock  (HF)  and  l(S)DA  computations  for  NiO  Since  HF 
lacks  electronic  correlation,  it  overestimates  fx'nd  width  (m  conductors)  and/or  band  gaps  (in  insulators). 
It  gives  the  band  gap  for  NiO  two  times  larger  than  that  in  expenment.  while  lsda  gives  the  gap  one 
order  of  magnitude  smaller  than  the  expenmental  value  We  demonstrate  that  the  HF  results  are  consistent 
with  some  previously  believed  to  be  well-understood  expenments.  while  the  l(S)DA  results  are  not  It 
IS  sugge..ed  that  hf  may  offer  a  better  reference  state  for  the  development  of  a  usda  scheme  ©  I99.t 
John  Wiley  &  Sons,  Inc 


Introduction 

The  discovery  of  the  high-T,  superconductors  (  htscs  )  [  1  ]  based  on  copper  oxide 
layer  structures  has  re-opened  interest  in  the  study  of  the  (Ml)  TMOs,  a  class  of 
materials  to  which  the  HTSCs  also  belong,  and  for  which  the  theoretical  computation 
of  their  electronic  structure  has  been  always  a  challenge.  In  order  to  understand  all 
important  properties  of  these  materials  including  the  high-Tr  superconductivity, 
detailed  knowledge  of  their  electronic  structures  is  essential. 

Due  to  the  relatively  poor  computers  available  at  the  lime  and  the  imperfections 
of  the  computational  methods,  early  electronic  structure  studies  on  the  (Ml)  TMOs 
[2]  obtained  electronic  structures  which  compared  poorly  with  experiments  in  some 
very  critical  features,  such  as  magnetic  properties,  band  gaps  and  widths,  relative 
positions  of  bands,  etc. 

Recently,  there  have  been  many  attempts  to  improve  the  understanding  of  these 
problems.  Singh  and  Pickett  [  3 )  added  gradient  corrections  to  the  exchange-cor¬ 
relation  potential  within  the  L(S)da  [4].  Yu  and  Freeman  [5]  included  a  short- 
range  electron-electron  interaction  in  addition  to  the  normal  l(S)da  potential  in 
order  to  describe  the  electronic  correlations  of  the  localized  electrons  more  properly. 
Svane  and  Gunnarsson  [6]  added  to  the  l(s)da  scheme  the  sic  which  result  from 
the  approximate  treatment  of  the  exchange.  On  the  other  hand,  Shen  et  al.  [7] 
reported  recently  experimental  band  structures  for  NiO.  In  the  T  -  X  direction. 
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there  are  bands  at  about  -11  eV  below  the  Fermi  energy  (c/.)  which  cannot 
be  identified  as  any  of  the  l(S)da  bands,  no  matter  which  corrections  have 
been  made. 

In  order  to  find  out  how  these  deviations  come  about,  the  electronic  structure 
has  been  calculated  in  the  present  study  within  both  HF  and  lda  approximations 
for  NiO,  which  is  a  typical  (MI)  TMO.  Comparison  has  been  made  between  results 
from  both  methods,  and  with  typical  previous  lsda  results  f  8].  It  should  be  men¬ 
tioned  that  Wilson  ( 1970)  [9]  and  Kunz  ( 1981 )  [10]  studied  the  (Ml)  TMOs  with 
approximate  HF  methods  and  obtained  valuable  physical  insight  but  their  studies 
were  ignored  in  the  literature. 


Methods 

The  presumably  most  advanced  method  for  electronic  structure  calculations  of 
solids  is  the  Density  Functional  (DF)  theory  with  the  use  of  the  Kohn  and  Sham 
[11]  one-electron  Hamiltonian.  It  includes,  along  with  the  kinetic  and  Hartree 
terms,  an  effective  exchange  and  correlation  potential.  lf',r(p),  which  is  in  turn  a 
functional  of  the  electronic  density  p(r).  The  exact  form  of  this  functional  for 
almost  all  realistic  systems  is  unknown.  The  l(s)da  has  to  be  made  by  evaluating 
H\,(p)  at  the  local  density,  i.e.,  W\r(p)l,  ^  W',.(p(r)).  In  practice,  the  explicit 
form  of  the  functional  W\,(p(r))  is  usually  considered  not  to  be  critical,  except 
that  it  should  be  exact  in  the  limit  of  the  homogeneous  electron  gas  [12]. 

The  HF  method  [13],  on  the  other  hand,  does  not  suffer  from  the  local  approx¬ 
imations  on  the  exchange-correlation  potential.  The  corresponding  single  deter- 
minantal  wavefunction  is  a  clearly  defined  quantity  which  takes  account  of  the 
different  nodal  structures  of  the  different  orbitals  via  its  non-local,  exact  exchange 
term  In  principle,  it  is  known  what  is  missing  from  the  HF  state  and  how  it  can  be 
improved  by  introducing  correlation  either  through  perturbational  or  variational 
procedures  [14].  Asa  result,  hf  calculations  can  be  used  as  a  good  reference  for 
other  computations.  In  fact.  Kohn  and  Sham  in  their  original  article  [1 1]  suggested 
an  alternative  lda  scheme  by  using  HF  as  the  reference  rather  than  the  usual  non¬ 
interacting  system. 

With  the  HF  method,  it  is  known  that  because  the  non-local  exchange  term  is 
not  compensated  due  to  the  lack  of  electronic  correlation,  calculated  conducting 
band  structures,  especially  for  metals  where  screening  is  important,  are  about  twice 
as  wide  as  those  from  experiments,  as  are  band  gaps  for  insulators  and  semicon¬ 
ductors.  However,  hf  calculations  may  be  considered  as  the  lower  limit  of  ap¬ 
proximate  screening  in  insulators  while  the  L(S)DA  is  the  upper  limit.  Generally, 
the  hf  method  overestimates  band  widths  and  band  gaps  while  l(s)da  underes¬ 
timates. 

Another  important  fact  is  that  in  the  presence  of  strong  correlations,  as  in  the 
TMOS  with  their  quite  localized  d  electron.s,  both  methods  may  lead  to  large  devia¬ 
tions  because  in  the  HF  approximation,  by  definition,  correlation  is  not  included 
at  all  while  in  the  L(S)DA,  the  very  inhomogeneous  charge  distribution,  which  is 
far  from  that  of  the  electron  gas.  invalidates  the  approximation.  Although  L(S)DA 
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considers  electronic  correlation  to  some  extent,  it  is  really  not  known  which  effect 
is  larger.  In  this  study  for  NiO,  both  methods  were  employed  and  it  is  hoped  that 
comparison  between  the  results  from  these  two  methods  and  with  that  of  experiments 
may  reveal  which  effect  is  more  important  for  the  calculated  electronic  structures 
of  these  materials.  In  the  present  study,  the  restricted  hf  calculation  was  done  with 
CRYSTAL  [13]  because  there  is  still  no  facility  in  the  present  laboratory  to  do  a 
spin  polarized  hf  calculation,  and  a  corresponding  non-magnetic  lda  calculation 
was  carried  out  with  WIEN  which  is  a  Full  Potential  lapw  lda  program  [15]. 

The  NiO  crystal  has  the  NaCI  structure  of  space  group  OliFmSm)  [16],  a  = 
4.17  A,  and  atomic  positions  Ni(0.0, 0.0, 0.0)  and  0(0.5, 0.0, 0.0).  For  the  present 
HF  study,  a  He-core  pseudopotential  for  O  [  1 7  ] ,  an  Ar-core  pseudopotential  for  Ni 
[18]  are  used.  For  Ni-3d  valence  orbitals,  TZotos  are  used  with  4s  basis  [19].  For 
O,  a  DZ  GTO  basis  for  the  2p  valence  orbitals  are  employed  [17]. 

Results  and  Discussions 

Electronic  structure  studies  for  Mi  TMOs,  as  well  as  for  the  parent  materials  of 
HTSCs  [20],  show  that  the  uppermost  band,  the  O-p  and  metal-rf  <r-antibonding 
band,  is  half-filled.  Because  of  the  strong  on-site  Coulomb  repulsion,  a  gap  opens 
between  the  lower-  and  the  upper-half  bands  and  the  system  undergoes  a  metal- 
insulator  transition.  Therefore,  these  materials  exhibit  antiferromagnetism  (af) 
below  the  Neel  temperature  and  poor  conductivity.  As  discussed  in  Ref.  [6],  lsda 
electronic  structure  calculations  for  NiO  and  MnO  with  artificially  dimerized  unit 
cells  result  in  af  insulating  ground  states,  as  they  should,  but  fail  to  give  such  a 
ground  state  for  CuO,  CoO,  and  FeO.  lsda  gave  the  other  non-bonding  metal  t2g 
rf-bands  slightly  (ca.  1  eV)  below  the  bonding  metal  Cg  i/-band(s)  and  above  the 
O  p-bands.  However,  the  recent  lsda  calculations  by  Svane  and  Gunnarsson  [6] 
with  SIC  included  successfully  found  af  insulating  ground  states  for  CuO,  FeO,  and 
CoO  as  well,  in  contrast  to  usual  lsda  scheme  [6  ] .  Unfortunately,  bands  and  DOS’s 
were  not  published  from  these  calculations.  For  comparison,  the  band  gap  for  NiO, 
which  i'  one  of  the  most  critical  criteria,  as  listed  in  Ref.  16],  is  4.3  eV  as  from 
experiment,  0.2  eV  given  by  traditional  lsda,  2.5  eV  give  by  Svane  and  Gun¬ 
narsson,  but  14.4  given  by  Unrestricted  HF  [21]. 

In  Figure  1,  the  non-magnetic  hf  band  structures  of  NiO  are  shown,  in  com¬ 
parison  with  the  corresponding  lda  results,  as  can  be  seen  for  the  T  -  L  portion 
also  in  Ref.  [22].  The  corresponding  hf  densities  of  states  (doss)  are  shown  in 
Figure  2;  a  previous  lsda  DOS  by  Terakura  et  al.  [8]  is  given  as  an  insert.  It  can 
be  seen  from  Figure  1  that  the  lda  and  hf  results  are  very  consistent  with  each 
other  for  bands  around  c/r.  The  crossing  points  of  the  partially  filled  p  -  d  anti¬ 
bonding  band  (marked  with  “A  -  B”)  with  e/.  from  both  methods  are  in  good 
agreement.  The  splitting  and  dispersion  of  the  partially  filled  band  fron  both  cal¬ 
culations  are  very  similar.  However,  it  should  be  emphasized  again  that  for  a  non¬ 
magnetic  study,  all  band  structure  methods  fail  to  give  a  gap  in  the  partially  filled 
band.  The  comparison  given  shows  that  both  methods  yield  similar  features  in  the 
considered  energy  regions  under  similar  conditions. 
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Figure  1.  Non-magnetic  band  structures  of  NiO  calculated  with  both  hf(CRYSTAL) 
and  lda(WIEN)  methods.  The  Fermi  energies  are  set  at  0.0.  The  antibonding  Ni^ —  O, 
band  is  marked  with  “A-B”. 


It  is  not  surprising  that  the  HF  band  dispersion  is  about  two  times  larger  than 
the  LDA  because  the  two  methods  differ  by  the  treatment  of  exchange  and  correlation. 
The  most  significant  difference  is,  however,  that  HF  predicts  the  metal  to  lie 
below  all  other  valence  bands.  Yet  the  typical  l(s)da  result  leaves  them  only  1  eV 
or  less  under  the  uppermost  occupied  band  (referring  to  the  lda  parts  of  Fig.  1 ). 
This  difference  can  also  be  seen  from  the  DOSs  in  Figure  2,  where  lsda  predicts  all 
metal  r/-states  to  be  concentrated  to  give  one  single  peak  lying  within  3  eV  below 
the  Cf ,  while  HF  gives  three  peaks,  one  around  Cf,  one  far  below  cp  and  all  other 
valence  bands  (d,^^  non-bonding  bands),  and  another  in  between  (which  is  from 
the  pure  Op  bands  and  the  metal- r/  -  0-p  bonding  bands). 

Traditionally  [2],  the  -  d,^  splitting  was  considered  to  be  a. pure  crystal  field 
effect.  The  two  sets  of  orbitals  were  believed  to  differ  by  roughly  an  energy  of  the 
order  of  magnitude  of  the  overlap  integral  between  df^  and  the  ligands,  which  is 
about  ca.  1  eV..  The  d,^^  is  considered  to  be  unchanged.  But  in  this  understanding, 
an  important  effect,  relaxation,  was  missed.  In  reality,  however,  once  the  electrons 
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NiO  (HF,  non- magnetic) 


Figure  2.  HF  doss  of  NiO  corresponding  to  Figure  1 .  The  shaded  part  in  the  HF  Ni(d ) 
DOS  is  the  contributions  of  Ni — +  Ni — (/,j  states  which  take  part  in  the  Ni^ —  O, 
ff-bonding.  It  is  clear  that  the  DOS  around  the  Fermi  energy  is  predoniinantly  of  Ni — d 
character  The  spin  polarized  lda  total  DOS  by  Terakura  et  al.  [8]  is  shown  as  the  insert. 


form  bonds  with  ligand  electrons  and  become  delocalized,  they  will  leave  space  for 
the  electrons,  and  the  latter  see  a  less  screened  core  and  drop  to  an  even  lower 
energy.  This  process  results  in  a  splitting  much  larger  than  the  overlap  integral  with 
the  leading  term  due  to  the  difference  in  electron-core  interactions.  This  feature 
can  hardly  be  reproduced  by  the  standard  lda  scheme  because  in  such  a  scheme 
there  is  usually  a  physical  boundary  between  core  and  valence  regions,  into  which 
the  valence  electrons  can  never  penetrate.  But  this  is  not  true  for  3d  electrons  as 
they  can  be  very  close  to  th  core  in  reality.  It  was  shown  by  an  ab  initio  correlated 
electronic  structure  calculation  for  Cu — O  clusters  [23]  that  the  HF  results  are 
very  close  to  experimentally  determined  metal  d  and  O  p  occupations,  which  are 
critical  to  explain  the  magnetic  momentum  and  on-site  Coloumb  interaction.  Re¬ 
cently,  Anisimov  et  al.  showed  that  in  an  improved  lsda  model  band  structure 
calculation,  the  metal  non-bonding  d  band  lies  below  all  other  valence  bands  [24]. 

Recently,  angle-resolved  photoemission  experiments  by  Shen  et  al.  [7]  provided 
an  experimental  band  structure  for  NiO.  The  uppermost  antibonding  band  and 
the  O-p  bands  confirm  both  the  l(s)da  and  hf  results.  But  in  the  T  -  X  direction 
( referring  to  Fig.  25  in  Ref.  [  7  ] ) ,  there  are  two  very  weakly  dispersed  experimental 
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bands  below  the  0-p  bands,  which  cannot  be  assigned  to  any  of  the  LDA  and/or 
LSDA  bands.  However,  these  two  bands  fit  fairly  well  with  the  metal  i/,j^-bands  from 
the  hf!  In  fact,  if  we  look  back  at  some  old  experiments,  believed  to  be  well  un¬ 
derstood,  we  can  see  that  they  can  be  interpreted  very  well  by  HF  calculations  but 
not  by  LDA. 

The  satellites  in  the  photoemission  spectrum  of  TMOs  and  their  assignment  first 
proposed  by  Wertheim  and  Hiifner  [25]  is  the  first  of  such  widely  discussed  topics. 
From  L(s)DA  computations,  all  metal  d states  are  concentrated  (see  the  insert  in 
Fig.  2,  from  Terakura  et  al.  [  8  ] )  within  3  eV  below  .  It  is  very  difficult  to  interpret 
the  satellite  structure  within  l(s)DA  without  invoking  complicated  many-particle 
processes.  But,  these  satellites  are  located  just  at  the  positions  of  the  non-bonding 
metal  states  predicted  by  hf,  and  it  is  straightforward  to  assign  the  satellites  to 
these  states.  Comparison  of  the  HF  DOS  of  NiO  in  Figure  2  with  Figure  1  (NiO)  of 
Ref.  [25]  shows  good  agreement. 

In  the  early  70s,  Powell  reported  the  photoabsorption  spectrum  for  a  NiO  crystal 
(Fig.  3  in  Ref.  [26]).  This  spectrum  can  be  understood  also  very  well  by  referring 
to  the  HF  DOS  of  NiO  in  Figure  2.  In  Figure  3,  the  theoretical  photoabsorption 
spectra  are  shown  in  companson  with  the  experiment.  They  were  calculated  by 


O 
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Energy  (  eV  ) 


o 

Ph 

Figure  i  The  theoretical  HF  and  LDA  photoabsorption  spectra  are  shown  in  comparison 
with  the  cxperime.lt  by  Powell  [  26  J .  A  shift  was  made  to  position  the  edge  of  the  platform 
at  about  4.?  eV  in  order  to  compare  with  the  experimental  result. 


ELECTRONIC  STRUCTURE  OF  Ml  TMOs 


193 


integrating  the  HF  and  the  LSDA  lX)S’s  weighted  with  the  X-ray  absorption  cross 
sections  from  cf  to  the  bottom  of  the  metal  d  and  O  p  bands.  The  hf  spectrum 
reproduces  all  the  basic  structure  of  the  experimental  spectrum!  If  one  keeps  in 
mind  that  the  inclusion  of  electronic  correlation  will  open  a  gap  in  the  half-filled 
band  across  the  cp,  which  will  result  in  a  “steeper”  stepping-up  for  the  spectrum 
at  the  beginning,  and  a  higher  platform  between  4.3  to  9. 1  eV.  The  narrowing  of 
the  bands  due  to  correlation  will  open  up  gaps  and  thus  leave  shoulders  at  13.1 
and  14.3  eV.  On  the  other  hand,  it  is  difficult  to  see  any  similarity  of  the  lsda 
spectrum  with  the  experiment. 


Conclusions 

from  the  calculations  discussed  above,  it  is  clear  that  the  computational  tech¬ 
niques  for  the  electronic  structures  of  (MI)  TMOs  still  need  essential  improvements. 
In  LSDA,  because  of  the  shortcomings  of  its  approximations,  the  computed  band 
gaps  of  the  Ml  tmos  and  the  magnetic  moments  are  not  quantitatively,  in  some 
cases  even  not  qualitatively,  consistent  with  experiments.  The  magnetic  correlation 
effect  above  the  Neel  temperature  is  not  necessarily  commensurate  [23],  so  that 
the  current  treatment  of  magnetism  by  simply  dimerizing  the  unit  cell  cannot  hope 
to  fit  experiments  in  every  detail.  HF,  on  the  other  hand,  does  not  include 
correlation  at  all  by  definition,  and  thus  cannot  hope  to  give  a  correlated  tXtcixonic 
structure.  It  should  be  emphasized  though  that  a  UHF  computation  can  give  a  band 
gap  and  an  af  insulating  ground  state  for  NiO  with  artificially  dimenzed  unit  cell 
[21].  The  results  shown  above  imply  that  the  HF  state  may  offer  a  better  reference 
state  for  further  improvements  as  Kohn  and  Sham  suggested  in  their  first  ar¬ 
ticle  [11]. 
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Abstract 

The  structure  of  small  Rh  clusters  compatible  in  size  with  the  particles  that  are  present  in  highly 
dispersed  supported  catalysts  are  studied  at  the  SCF  level  by  means  of  intermediate  neglect  of  differential 
orbital  (iNDO)  calculations.  In  agreement  with  expenmental  determinations  and  calculations  on  other 
metals,  we  have  found  nonperiodic  (precrystalline)  structures  of  Cj,  and  Da  symmetry  the  most  stable 
ones.  An  analysis  of  these  geometries  suggest  that  they  result  from  first-  and  second-order  Jahn-Teller 
distortions  of  icosahedra.  Although  higher  multiplicities  characterize  higher  symmetry  structures,  we 
find  that  the  lower  symmetry  structures  are  characterized  by  low  multiplicities.  The  electronic  characteristics 
of  the  surface  sites  are  analyzed  at  the  ci  level.  In  contrast  to  extended  periodic  surfaces,  which  are 
charactenzed  by  negative  charge  density  on  the  metal  atoms,  the  surfaces  of  the  small  clusters  appear  as 
electron  deficient  structures,  with  negative  chaige  localized  on  the  internal  atoms.  This  may  be  a  key 
factor  in  understanding  the  different  catalytic  activity  of  metal  surfaces  and  supported  metal  catalysts. 
©  1993  John  Wiley  &  Sons.  Inc. 


Introduction 

The  field  of  metal  cluster  chemistry  has  reached  a  particularly  exciting  point  in 
its  development  because  the  size  of  the  cluster  being  characterized  may  be  large 
enough  for  meaningful  comparison  of  their  physical  and  chemical  properties  with 
bulk  metal  and  metal  surfaces,  and,  on  the  other  hand,  sufficiently  small  as  to  be 
able  to  examine  by  quantum  chemical  techniques  [1]. 

A  metal  cluster  compound  has  been  defined  as  a  group  of  two  or  more  metal 
atoms  where  direct  and  substantial  metal  bonding  is  present  [2].  The  size  of  the 
arrangement  can,  thence,  vary  from  two  to  several  hundred  atoms,  and  most  of 
the  research  in  this  field  has  focused  on  the  way  several  properties,  such  as  chemical 
hardness  [3],  ionization  potentials  [4],  electron  affinities  [4],  binding  energies 
(BEs)  [5],  and  polarizabilities  [6],  change  with  size,  allowing  an  examination  of 
the  transition  from  molecular  to  metallic  behavior  [7],  The  geometry  of  a  metal 
cluster  also  changes  from  a  polyhedron  (precrystalline)  structure  to  a  reticular, 
lattice-based  one,  defining  a  transition  for  a  certain  number  of  atoms  that  depends 
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on  the  element  under  consideration  [4-8].  In  this  framework,  the  structure  of  a 
cluster  is  one  of  the  most  controversial  subjects,  because  it  is  both  difficult  to  probe 
experimentally  and  hard  to  calculate.  This  is  especially  the  case  with  transition 
metal  atoms,  and  their  agglomerates,  many  of  which  are  of  great  industrial 
importance. 

Cluster  sizes  of  particular  stability  are  often  revealed  in  a  mass  spectnim  by 
striking  maxima  in  the  envelope  of  peak  intensities.  The  number  of  atoms  associated 
with  the  maxima  which  defines  the  most  stable  structures  (magic  numbers)  [9] 
depends  on  the  bonding  character  of  the  atoms  forming  the  cluster  [lOj.  The  van 
der  Waais  bonded  rare  gas  clusters  have  magic  numbers  of  a  complete  icosahedra 
[6,9,10],  whereas  the  smallest  magic  numbers  of  alkali  and  noble  metals  correspond 
to  the  shell  structure  of  the  valence  electrons  [6,9, 10] .  Magic  numbers  corresponding 
to  icosahedral  packing  of  Na,  K,  Ca,  Ba,  Sr,  Mg,  Ni,  Co,  and  Cu  atoms  have  been 
also  found  [8-10].  However,  the  icosahedral  clusters  have  the  same  magic  numbers 
as  the  cubooctahedral  structures,  and  the  question  then  naturally  arises  as  to  which 
one  is  formed  more  easily. 

Transition  metal  clusters  are  important  in  catalytic  research,  where  highly  dis¬ 
persed  metal  catalysts,  of  particle  size  small  enough  to  be  considered  precrystalline 
structures,  are  associated  with  the  higher  activity  due  to  the  occurrence  of  a  larger 
concentration  of  active  sites.  In  order  to  understand  their  particular  reactivity,  the 
geometric  and  electronic  characteristics  of  the  different  sites  must  be  accurately 
determined.  Modern  quantum  chemical  technologies  that  include  gradient-driven 
geometry  optimizing  algorithms  are  particularly  suited  to  examine  the  individual 
bonding  behavior  of  the  elements  that  make  up  precrystalline  structures,  and,  in 
turn,  can  be  used  for  the  calculation  of  the  electronic  descriptors  on  the  different 
sites  cf  a  given  optimized  structure. 

When  dealing  with  systems  of  catalytic  importance,  mainly  associated  with  ele¬ 
ments  of  the  second  and  third  transition  series,  only  small  clusters  ( three  or  four 
atoms)  have  been  treated  in  configuration  interaction  calculations  [11-15],  Mo¬ 
lecular  dynamics  [10],  as  well  as  Monte  Carlo  [11]  simulations,  however,  have 
been  used  in  an  interesting  fashion  to  analyze  the  precrystalline  to  reticular  transition 
in  extended  systems  as  a  function  of  the  number  of  atoms  required  for  the  transition 
structure. 

Because  of  our  interest  in  reaction  mechanisms  on  supported  catalysts,  we  are 
looking  for  an  accurate  and  quantitative  description  of  the  geometric  and  electronic 
characteristics  of  the  different  reaction  sites  in  the  smallest  structures  compatible 
with  those  that  can  be  found  in  the  real  support. 

Rhodium  clusters  have  industrial  uses  as  catalysts  in  a  variety  of  hydrocarbon 
oxidation  reactions,  perhaps  the  best  known  use  that  in  automotive  exhaust  anti- 
polution  systems  [16-18].  Considerable  progress  has  been  made  in  the  knowledge 
of  the  influence  of  the  particle  size,  surface  structure,  and  surface  composition  on 
the  reactivity  of  Rh  clusters.  Particle  sizes  as  small  as  1  nm  have  been  characterized 
in  supported  catalysts  [19],  a  size  that  is  associated  with  precrystalline  structures. 
Surface  atoms  represent  the  majority  of  the  atoms  in  these  particles,  and  sites  of 
different  topologies  (kinks,  edges,  and  comers)  are  defined.  Theoretical  investigation 
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is,  however,  a  challenging  task,  primary  due  to  the  large  number  of  incomplete  d 
shells  of  the  metal  atoms,  which  lead  to  a  great  number  of  low  lying  states  and 
associated  structures.  The  presence  of  so  many  states  causes  serious  convergency 
problems  when  self-consistent  field  (SCF)  techniques  are  applied,  and  most  often 
necessitates  a  treatment  that  includes  electron  correlation.  These  concerns  make 
examination  of  open-shell  clusters  using  SCF-based  techniques  that  can  address 
questions  of  structure  and  multiplicity  difficult  to  apply. 

In  this  work  we  apply  a  version  of  the  intermediate  neglect  of  differential  overlap 
(INDO)  model  [20],  at  both  the  SCF  and  the  configuration  interaction  (ci)  level, 
using  the  ZINDO  program  [  2 1  ]  to  determine  the  structure  as  well  as  the  electronic 
characteristics  of  the  different  sites  in  the  small  Rh  particles  that  may  be  present 
in  highly  dispersed  supported  catalysts.  We  stress  may  here,  for  in  this  preliminary 
study  we  have  not  considered  the  interesting  possibility  that  the  support  itself  might 
influence  structure  as  well  as  electronic  properties. 

Computational  Details 

Most  often  each  calculation  starts  as  configuration-averaged  Hartree-Fock 
(CAHF),  with  an  average  multiplicity  for  the  number  of  electrons  considered  [22a]. 
A  small  CI  using  these  orbitals  on  several  interesting  multiplicities  then  indicates 
which  states  should,  in  general,  be  pursued  through  the  restricted  open-shell  Hartree- 
Fock  ( ROHF )  procedure  [  22b  ] .  For  each  structure  and  possible  multiplicity,  a  sep¬ 
arate  geometry  search  is  made  using  the  rohf  procedure  as  described  below. 

The  geometry  optimization  step  of  the  calculation  is  particularly  important  in 
obtaining  a  proper  description  of  the  geometric  and  electronic  parameters  that  are 
responsible  for  and,  at  the  same  time  may  justify,  the  different  catalytic  activity  of 
the  surface  sites  defined  on  a  given  structure.  The  calculated  geometries  are  the 
result  of  a  full  optimization  of  the  coordinates  (interatomic  distances  and  angles) 
without  any  constrain  on  their  variation.  The  Head  and  Zerner  technique  [23,24] 
has  been  used,  with  the  Broyden-Fletcher-Goldfarb-Shanno  (BFGS)  algorithm  to 
update  the  inverse  Hessian  matrix.  For  a  complete  search  of  the  structure  of  min¬ 
imum  energy  on  the  potential  energy  surface,  different  initial  geometries,  associated 
with  different  symmetries  of  the  cluster,  have  been  defined  as  initial  guesses  for  the 
geometry  optimization  routines,  including  both  polygonal  structures  and  pieces  of 
the  bulk.  In  this  way.  the  relative  stability  of  different  stable  geometries  that  may 
be  present  in  the  precrystalline  structures  are  also  compared.  The  number  of  atoms 
in  the  cluster  has  to  be,  then,  compatible  with  the  definition  of  octahedral,  icosa- 
hedral,  fee,  and  hep  structures  free  of  undesirable  effects  of  nonregular  environments 
on  the  atoms  of  the  borders.  Rhu  and  Rh,y  match  these  requirements;  the  latter 
are  close  in  size  to  the  1  nm  particles  characteristic  of  highly  dispersed  catalysts. 
Although  pieces  of  bulk  (fee  and  hep)  structures  may  be  a  priori  ruled  out  for 
precrystalline  structures,  because  the  precrystalline  to  reticular  transition  would 
occur  for  a  cluster  size  larger  than  I  nm.  it  is  nevertheless  interesting  to  compare 
the  energy  of  the  crystalline  structures  with  those  of  the  precrystalline  ones  for  the 
different  cluster  sizes. 


Figure  I.  Starting  Rh|t  structures  for  the  geometry  opiimizaiion  procedure,  (a)  icosahedial 
symmetry.  A;  (b)  "d^"  symmetry.  (c)  octahedral  symmetry.  O*;  (d)  close-packed 
Djji,  (c)  close-packed  fee  structure  (ci,  r- arc  different  views  of  the  same  structure),  Djj. 
Starting  geometnes  represented  by  Figures  (a),  (b).  and  (c)  evolve  into  the  same  final 
structure  depicted  in  Figure  2. 
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Table  1.  Total  energies  (a  u.)  assoaated  with  the  most  stable  multiplicities  of  the  different  Rhu  geometries 

after  a  full  optimization  procedure  * 


M 

hep 

fee 

Oc 

D, 

Ic 

2 

-294.81 

-294.87 

-294.69 

-294.87 

-294.87 

4 

-294.76 

-294.86 

-294.67 

-294.86 

-294.85 

6 

-294.72 

-294.81 

-294.62 

-294.84 

-294.83 

•  Only  the  hexagonal  and  distorted  octahedral  does  not  change  to  a  D3  symmetry. 


The  open-shell  structure  of  the  Rh  atom  (d®s')  defines  a  certain  number  of 
unpaired  electrons  for  the  Rhi3  and  Rhi9  systems.  The  stabilities  of  the  different 
multiplicities  (A/)  have  been  compared  at  the  rohf  level  for  the  different  starting 
geometries.  This  procedure  is  complex,  for  we  find  that  different  multiplicities  form 
different  structures,  or,  perhaps,  vice  versa. 

Two  parameterizations  of  the  INDO  theory  are  used  in  these  studies:  one  for 
geometry,  which  utilizes  two-center  two-electron  integrals  that  are  calculated 
ab  initio,  and  one  to  calculate  the  electronic  descriptors  at  fixed  geometries, 
which  obtains  these  integrals  empirically  from  atomic  spectroscopy  { 20,25-27  ] . 
The  calculation  of  the  electronic  descriptors  as  well  as  the  analysis  of  the  mo¬ 
lecular  orbital  interactions  has  been  done,  in  the  framework  of  this  parameter¬ 
ization,  at  the  Cl  level.  The  resonance  integrals  “/S”  are  chosen  according  to  a 
formula  that  takes  into  account  different  electron  negativities  [ 28  ] .  This  param¬ 
eterization  reproduces  the  available  experimental  geometries  and  excited  state 
manifold  of  Rhj  and  Rh4,  but  does  incorporate  a  correction  for  overbinding 
proportional  to  the  calculated  bond  index  [  28  ] .  Other  aspects  of  the  method 
are  given  elsewhere  [20,25]. 

Results  and  Discussion 

Rhu  Structures 

We  first  compare  the  stability  of  regular  but  nonperiodic  structures  with  that  of 
pieces  of  bulk,  represented  by  means  of  the  same  number  of  atoms,  whose  repetition 


Table  II.  Total  energies  (a.u.)  associated  with  the  different  multiplicities  for  the  different  calculated 

Rtu  geometries.* 


M 

Square  planar 

Tetrahedral 

Tetrahedral  (2.69  A) 

1 

-90.311 

-90.3’’ 1 

-90.038 

3 

-90.308 

-90.337 

-90.019 

5 

-90.248 

-90.282 

-90.030 

7 

-90.161 

-90.168 

-90.045 

*  First  and  second  columns  refers  to  the  full  optimized  square  planar  and  tetrahedral  geometries.  The 
third  column  gives  the  total  energy  of  the  tetrahedral  structure  for  the  bulk  Rh-Rh  interatomic  distances. 
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Figure  2.  Most  stable  Rh  i  j  structure  as  the  result  of  a  full  geometry  optimization  procedure, 
Cjf  Geometric  parameters  are  indicated.  Elearonic  parameters  =  local  density  charges 
on  the  atomic  centers  [q  (a.u.)]  are  as  follows:  ?i  =  ft  =  =  94  =  95  =  -0.128,  ft  =  ft 

=  ft  =  ft  =  fto  =  +0.197,  fti  =  -0.919,  q,i  =  +0.366,  ftj  =  +0.201. 


in  three  dimensions  would  develop  the  three-dimensional  periodic  structures  char¬ 
acteristic  of  the  bulk  metal. 

Among  the  regular  structures,  icosahedral  and  cubooctahedral  structures  can  be 
constructed  by  means  of  either  12  or  13  rhodium  atoms  which  occupy  only  the 
vertices  of  the  icosahedral  or  both  the  vertices  and  the  center  of  the  structure, 
respectively.  However,  1 2  atoms  is  not  an  appropriate  number  to  construct  pieces 
of  fee  or  hep  bulk  structures.  In  addition,  a  comparison  of  the  BE  per  metal  atom 
for  the  Rh|2  and  Rh^  structures  gives  a  higher  stability  to  the  latter  (0.5  eV  for  the 
icosahedral  and  0.25  eV  for  the  octahedral  geometry),  making  Rhiy  a  much  more 
attractive  structure  to  examine. 

For  these  reasons,  we  have  chosen  a  Rh^  cluster  size  to  build  up  structures  of 
icosahedral  of  Ds  and  cubooctahedral  symmetry,  as  well  as  fee  and  hep  pieces  of 
bulk  [Figs.  1  (a)-(e)]  as  starting  structures  for  a  full  geometry  optimization  pro¬ 
cedure.  The  fee  and  hep  structures  can  also  be  described  as  centered  cubooctahedral 
and  anticubooctahedral  symmetries  [1].  Because  they  are  characteristic  of  close- 
packed  metallic  lattices,  we  have  chosen  this  related  nomenclature.  The  octahedral 
structures  depicted  in  Figures  1(c)  and  1(d)  are  almost  equivalent,  as  can  easily 
be  seen  from  a  comparison  of  these  figures.  However,  the  structure  referred  to  as 
cubooctahedral  ( Ic)  has  a  distortion  along  the  r-axis  that  increases  the  distance 
from  the  apex  atoms  to  the  center  by  40%  of  its  regular  value.  Different  Rh-Rh 
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Figure  3.  Most  stable  Rhfj  •‘icosahedral”  structure  after  a  full  geometry  optimization. 
The  Rh-Rh  interatomic  distances  R  ( 4-5  )./?(3-6).R(7-8).R(l-IO).R(2,l2)./?(9,l3) 
are  3%  longer  than  the  others  (see  text). 


interatomic  distancer,  between  2.0  and  2.7  A,  have  been  used  to  build  up  the  initial 
structures.  The  lowest  energies  are  always  achieved  for  closest  interatomic  distances 
of  about  2.5  A,  which  appears  to  represent  a  compromise  between  the  interatomic 
distance  in  the  diatomic  (2.20  A  [28])  and  that  in  the  bulk  (2.69  A  [29]). 

Rh|3  clusters  are  open-shell  structures  characterized  by  a  multiplicity  {M)  larger 
than  1.  Each  structure  has  been  examined  for  different  M  values.  Results  in  Table 
I  demonstrate  that  the  occupation  of  low-lying  energy  levels  stabilizes  the  lowest 
multiplicity.  We  stress  that  each  structure,  associated  with  a  given  Af,  has  been 
derived  from  a  full  optimization  of  ail  distances,  without  imposing  any  geometry 
or  symmetry  constraints.  When  symmetry  results,  it  results  as  a  consequence  of 
the  optimization. 

The  same  result,  related  to  the  higher  stability  of  the  lower  M  values,  has  been 
found  for  Rh4  clusters,  which  show  an  interesting  behavior  relative  to  the  importance 
of  the  geometry  optimization  in  the  definition  of  the  multiplicity  and  vice  versa. 
Starting  from  either  square  planar  or  tetrahedral  structures,  the  optimization  of 
both  multiplicities  and  bond  lengths  gives  a  regular  close-shell  structure,  either 
square-planar,  with  Rh-Rh  bond  distances  of  2.1056  A,  or  tetrahedral,  with  bond 
length  2.2728  K.  These  smaller  structures  yield  interatomic  distances  that  are  close 
to  that  in  the  diatomic.  On  the  other  hand,  if  the  distance  is  kept  frozen  to  that 
observed  in  the  bulk  (2.69  A),  the  most  stable  structure  is  associated  with  6  unpaired 
electrons  (Table  II ).  This  result  is  in  agreement  with  density  functional  calculations 
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Figure  4.  The  frontier  molecular  orbitals  obtain  for  Rh^  in  /*  symmetry  from  a  Cahf 
calculation  with  four  electrons  in  eight  orbitals  Note  that  the  closeness  in  energy  of  these 
orbitals  leads  to  very  many  nearly  degenerate  states  Energies  arc  in  hartrees  The  dashed 
lines  indicate  the  doubly  occupied  space,  the  "open"  space,  and  the  virtual  space. 


[30]  that  model  the  Rh(  111)  surface  structures  by  means  of  regular  tetrahedron 
constructed  with  four  Rh  atoms  separated  by  the  bulk  interatomic  distance.  Six 
unpaired  electrons  give  the  lowest  energy  under  these  conditions.  However,  when 
the  bond  length  is  allowed  to  relax  to  the  equilibrium  interatomic  distances,  the 
electronic  configuration  changes  from  (^^2)  la?lt2le‘*2t23t22e‘'2aTltflt^  to  lowest 
energy  ( '/1| ) Iailt2le^2t23t22e''2at It*.  The  decrease  in  multiplicity  is  clearly  as¬ 
sociated  with  the  relaxation  of  the  interatomic  distances,  leading  to  larger  spin¬ 
pairing  due  to  the  more  efficient  overlap  of  the  atomic  orbitals,  and  the  resulting 
stronger  interactions.  The  comparison  of  the  Miilliken  populations  on  the  Rh  atoms 
for  both  interatomic  distances  shows  a  larger  population  of  the  d  orbitals  in  the 
septet  (5s‘’''’5p‘’*’4d’* ''  vs.  5s‘’‘‘’5p‘’'^4d’'''').  The  higher  occupation  of  the  more 
diffuse  sp  orbitals  favors  the  more  efficient  overlap  in  the  close-shell  structure.  The 
shortening  of  the  bond  distances  in  small  metal  clusters  is  not  surprising  on  the 
basis  of  the  well-known  contractions  of  the  lattice  parameters  in  small  particles 
associated  with  surface  stress  that  has  been  experimentally  determined  for  several 
transition  metal  atoms  [5,31],  and  which  are  easily  confirmed  in  these  calculations. 

Table  1  summarizes  our  results  for  Rhu,  and  indicates  that  the  most  stable  ge¬ 
ometry  ( Fig.  2 )  is  one  of  near  Cj,.  symmetry  and  is  achieved  after  the  optimization 
of  the  initial  Dsj  symmetry  structure.  The  same  final  geometry  is  obtained  from 
full  optimization  of  the  fee  and  the  icosahedral  starting  structures.  The  fee  structure. 
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Figure  5.  Molecular  orbital  interactions  that  involve  the  unpaired  electron  of  the  Rhu 
structure,  (a)  Bonding  interactions  between  opposite  atoms  on  the  alternate  pentagons. 
Antibonding  interactions  are  between  the  s  orbitals  on  the  pentagons  and  the  p  on  the 
central  atom,  (b)  Bonding  interactions  between  the  d  orbitals  of  the  atoms  on  one  of  the 

pentagons. 


in  particular,  goes  through  a  remarkable  modification  of  its  geometry,  requiring  a 
large  number  of  cycles  in  the  geometry  search  to  finally  reach  the  structure  of  Figure 
2.  An  energy  barrier  is  associated  with  the  transition  from  hexagonal  and  from  a 
distorted  cuboctahedral  to  the  final  structure  depicted  in  Figure  2,  and  thus  these 
structures  represent  proper  minima  on  the  potential  energy  surface.  In  these  two 
cases,  the  change  in  geometry  from  the  initial  one  assumed  to  begin  the  optimization 
procedure  involves  a  shortening  of  the  interatomic  distances  approximately  4%, 
again  favoring  bond  lengths  closer  to  that  in  the  diatomic. 

Icosahedral  structures  have  been  identified  in  several  inorganic  Rh  compounds, 
as  [Rh,2Sb(CO)27]"^  [32].  Cubooctahedral  geometries  have  also  been  found  in. 
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Figure  6.  The  frontier  molecular  orbitals  obtained  for  Rhj  j  in  /*  symmetry  from  a  cahf 
calculation  with  five  electrons  in  eight  orbitals.  Note  that  the  closeness  in  energy  of  these 
orbitals  leads  to  very  many  nearly  degenerate  states.  Energies  are  in  hartrees.  The  dashed 
lines  indicate  the  doubly  occupied  space,  the  “open”  space,  and  the  virtual  space. 


for  example,  [Rhi3(CO)24H3]"’  and  [Rh|2Pt(CO)24]~‘‘  [33].  The  presence  of  the 
carbonyl  ligand,  which  is  known  to  involve  important  charge  transfer  interactions 
with  the  metal  [1],  preclude,  unfortunately,  any  inference  from  these  structures  to 
that  of  the  naked  Rhi3  clusters.  Rather  both  icosahedral  and  cubooctahedral  are 
made  plausible. 

According  to  our  calculations,  the  structure  of  Rhi3  clusters  (Fig.  2)  can  be 
described  as  distorted  icosahedron,  defined  by  12  fivefold  axis  and  20  triangular 
faces.  Ten  of  the  12  vertices,  on  the  other  hand,  define  two  alternate  pentagons, 
while  the  others  are  aligned  with  the  center  of  the  structure,  perpendicular  to  them. 
It  is  distorted  in  the  sense  that  the  edges  are  no  longer  of  equal  length  and  th^ 
inversion  center  is  lost.  The  full  symmetry  characteristic  of  the  icosahedra  [34], 
with  120  symmetry  operations,  does  not  apply. 

In  this  particular  case  the  distortion  from  a  regulai  icosahedra  involves  mainly 
(Fig.  2): 

i.  A  change  of  the  relative  sizes  of  the  alternate  pjntagons,  from  two  equal  in  a 
regular  icosahedra  [Fig.  1  (a)],  to  a  larger  and  a  smaller  one,  of  interatomic  distances 
of  2.31  and  2.62  A,  respectively. 

ii.  An  asymmetric  lengthening  of  the  distances  along  the  z-axis,  more  important 
between  the  larger  pentagon  and  the  adjacent  apex  atom. 

iii.  A  change  in  the  distance  from  the  atom  in  the  center  to  the  atoms  in  the 
pentagons  which,  instead  of  equidistant  as  in  a  regular  icosahedra,  is  closer  to  the 
larger  pentagon. 

Origin  of  the  Distortion:  According  to  the  Jahn-Teller  (jt  )  theorem  [35,36],  any 
nonlinear  system  where  there  is  orbital  degeneracy  has  a  vibrational  mode  which 
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Figure  7.  Starting  Rhu  structures  for  the  geometry  optimization  procedure,  (a)  Dji,  sym¬ 
metry;  (b)  closed  packed  fee  structure;  (c)  close  packed  hexagonal  structure.  Both  the  Du, 
symmetry  and  the  fee  structure  evolves  to  the  same  final  structure  depicted  in  Figure  6. 


will  lower  the  symmetry  and  also  lower  the  energy  of  the  system.  A  Jahn-Teller 
distortion  would,  then,  change  the  geometry  of  the  system  in  order  to  decrease  its 
symmetry  in  such  a  fashion  that  the  electronic  state  is  no  longer  spatially  degenerate. 
Calculations  by  other  authors  [  1 1 ,37-39  ]  have  shown  that  the  most  stable  structure 
of  a  metal  cluster  is  strongly  influenced  by  the  jt  effect,  as  we  find  here.  Because 
of  the  nature  of  the  distortion  in  Rhiy  we  examine  the  structure  of  Rhfy.  The  hope 
here  is  that  the  positive  ion  has  states  that  are  further  separated  in  energy  than  the 
neutral  and  the  results  could  guide  our  thinking  on  the  neutral.  Starting  from  an 
icosohedron,  a  geometry  optimization  is  made  for  each  multiplicity.  The  lowest 
energy  of  all  these  structures  is  a  singlet  of  C21,  symmetry  (see  caption  in  Fig.  3). 
This  structure  might  be  understood  as  resulting  from  an  '//„  state  from  an  idealized 
//,  structure  obtained  from  a  cahf  calculation  in  which  four  electrons  are  averaged 
over  eight  orbitals,  the  fi„  and  hg  orbitals  of  Figure  4.  Other  schemes  we  tried  either 


Figure  8.  The  most  stable  Rhu  structure  (£>5*)  calculated  from  a  full  geometry  optimi¬ 
zation.  Geometry  parameters  are  indicated  Electronic  parameters  =  local  density  charges 
on  the  atomic  centers  [q  (a.u.)]  are  as  follows.  9i  =  ft  =  ft  =  94  =  95  =  9i4  =  9i5  =  9i6  = 

9i7  =  9i«  =  +0.360.  96  =  9r  =  9i  =  9«  =  9io  =  -0.480.  9ii  =  91,  =  -1.200.  9i!  =  9i9  = 

+0.690. 

did  not  lead  to  icosohedron,  or  led  to  orbitals  which  did  not  have  the  symmetry  of 
the  nuclear  framework.  This  '//„  state  undergoes  a  distortion  along  an  hg  mode 
and  gives  a  '  E2U  state  in  Did  which  further  undergoes  a  first  order  JT  distortion  to 
C2A  along  an  e^g  mode.  As  is  usually  found  for  highly  symmetric  structure  with  a 
large  number  of  degenerate  orbitals,  the  lowest  h  structure  is  not  a  singlet  but  a 
^Hg  state.  We  calculate  that  the  '//„  state  lies  0.03  eV  above  the  ^Hg  one  for  the  h 
geometry.  The  lowest  quintet,  however,  lies  considerably  higher  than  the  singlet  in 
the  final  Cih  structure.  Within  0. 1  eV  of  our  lowest  calculated  ^Hg  state  lie  more 
than  350  states.* 

In  the  Cy,  structure  the  and  hg  orbitals  (the  frontier  orbitals  active  in  the 
CAHF)  are  calculated  nearly  degenerate,  with  an  energy  difference  obtained  from 

'  The  idealized  /»  structure  we  start  with  corresponds  to  a  perfect  icosohedron  that  results  after  sym¬ 
metrizing  the  Cv,  structure  obtained  from  the  geometry  optimization.  Because  of  the  large  number  of 
molecular  orbitals  very  close  in  energy,  pure  states  are  very  hard  to  separate,  as  they  tend  to  mix  during 
the  SCF  cycles.  Molecular  orbitals  characteristic  of  an  icosohedron  can  only  be  kept  by  means  of  Cahf 
calculation.  In  order  to  properly  choose  a  good  set  of  orbitals  to  be  averaged,  we  started  by  the  analysis 
of  the  orbital  that  result  from  extended  HUckel  (  eht  )  calculations.  From  the  eht  results  we  first  averaged 
four  electrons  in  the  ri„  orbital.  However,  when  the  pure  spin  states  were  projected  by  means  of  multi¬ 
reference  Cl  calculations,  the  orbitals  were  always  involved.  We  therefore  extend  the  cahf  calculations 
of  four  electrons  in  eight  orbitals.  Calculations  using  the  resultant  basis  gives  a  reasonable  explanation 
of  the  JT  distortion  as  discussed 


PRECRYSTALLINE  STRUCTURES  IN  Rh  CATALYSTS 


207 


the  CAHF  procedure  of  only  0.001  eV  (Fig.  3).  The  three  Lowest  Unoccupied 
Molecular  Orbital  (LUMO)  orbitals  are  mainly  fonned  from  contributions  of  the  s 
(and  some  d)  orbitals  of  the  Rh  atoms  of  the  pentagonal  rings,  and  the  p»,  p^,  p.- 
orbitals  on  the  central  atom. 

The  occupation  of  one  of  these  eight  nearly  degenerate  orbitals  to  form  the  neutral 
molecule  brings  about  a  new  interaction,  which  is: 

—  bonding  on  the  s  orbitals  of  opposite  atoms  of  different  rings  (Fig.  5(a)] 

—  antibonding  between  the  s  orbitals  on  the  atoms  of  the  rings  and  the  p  orbitals 
on  the  central  atom  [Fig.  5(a)] 

—  bonding  on  the  d  orbitals  on  the  atoms  1,  2.  3,  4.  and  5  (Fig.  3)  on  one  of  the 
pentagons  [Fig.  5(b)]. 

The  first  interaction  shortens  the  interatomic  distances  between  adjacent  atoms 
on  different  pentagons  by  0. 1  A,  from  2.62  in  the  cation  to  2.52  A  in  the  neutral 
molecule.  The  d  interaction,  on  the  other  hand,  shortens  the  interatomic  distances 
between  the  atoms  of  one  of  the  rings  from  2.62  to  2.32  A,  leading  to  a  more 
effective  overlap  of  the  s  and  p  orbitals  on  the  central  atom.  This  interaction  moves 
the  central  atom  closer  to  the  other  pentagon,  due  to  the  antibonding  character  of 
the  sp  interaction. 

In  spite  of  considerable  work,  and  our  experience  learned  from  RhJ^j.  we  have 
not  been  able  to  track  the  reason  for  the  distortion  of  Rh^  from  //,.  Our  final  state 
is  a  doublet  with  C,  symmetry,  a  structure  which  is  very  nearly  Csf. 

CAHF  calculations  with  five  electrons  in  eight  orbitals  (li„  and  /;^)  does  yield  an 
4  structure.  After  Cl  our  lowest  state  is  a  state  which  should  be  JT  stable.  The 
lowest  octet,  is  calculated  310  cm''  higher  in  energy,  than  a  quartet.  ‘’/4,  1990 
cm''  above  The  lowest  doublet  is  calculated  some  3000  cm  '  higher  in 
energy,  but  this  is  the  parent  that  leads  to  the  lowest  energy  C,  structure.  Some  250 
states  lie  within  0.1  eV  of  the  ^Aif.  state.  A  feasible  chain  for  the  distortion  is 
-*■  -*■  '/li^fCjt).  The  first  step  is  first-order  jt  through  an  hg 

mode.  The  second  is  a  second-order  JT  distortion  along  an  au,  mode  that  mixes 
the  ‘A  ,g(  Dij)  state  with  a  ^A  i„(  state  that  we  calculate  nearly  degenerate.  Indeed, 

the  lower  of  the  two  depends  on  the  size  of  the  Cl ,  We  have  not  proven  this — a 
mathematical  problem,  but  not  a  physical  one.  Starting  from  nearly  any  structure, 
we  obtain  the  same  lowest  energy  structure  and  multiplicity. 

Returning  for  the  moment  to  the  density  of  states  of  near  lying  energy,  it  may 
be  that  many  doublets  are  sampled  at  any  reasonable  temperature,  each  with  some¬ 
what  differing  geometries  leading  to  a  somewhat  flexible  structure. 

Consequences  of  the  Distortion.  In  the  resulting  Rh,,  structure  there  are  several 
surface  sites  (Fig.  2).  Because  of  the  different  electronic  environment  and  local 
density  charges,  each  one  can  be  associated  with  a  different  catalytic  activity.  Single 
coordination  sites  may  correspond  to  atoms  on  the  pentagons  or  to  apex  atoms, 
having  a  different  local  density  due  to  the  different  environment.  In  the  same 
way.  bicoordinated  sites  may  be  defined  between  atoms  on  one  pentagon,  between 
an  apex  and  an  atom  of  a  pentagon  or  between  atoms  of  two  different  pentagons. 
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and  three  coordinated  sites  between  iwo  atoms  of  one  of  the  pentagons  and  the 
adjacent  apex  or  between  atoms  of  different  pentagons.  The  interactions  of  an 
adsorbate  will  be  different  on  each  of  the  different  sites,  and  they  have  to  be 
analyzed  separately  if  we  assume  the  structure  is  fairly  rigid,  or  that  absorption 
may  lock  in  the  different  sites. 

Although  positive  and  negative  local  charges  are  defined  on  the  surface  atoms, 
the  surface  of  the  cluster  is,  on  the  average,  electron  deficient,  bearing  a  0.91  a.u. 
positive  charge  (0.076/surface  atom).  A  charge  of  the  same  magnitude  and  opposite 
sign  is  concentrated  on  the  atom  in  the  center. 

The  positive  charge  density  on  the  surface  of  the  cluster  contrasts  with  the  local 
negative  charge  density  on  the  terrace  atoms  in  the  bulk  [28],  and  is  a  key  factor 
in  understanding  the  different  catalytic  activity  of  both  structures. 

Rhig  Structures 

Larger  structures  can  be  grown  from  the  13  atom  icosahedron  by  adding 
pentagonal  i.ngs  around  each  vertex  [40],  The  arrangement  Rh|9  is  obtained  by 
adding  one  pentagonal  ring  around  a  fivefold  axis  and  one  additional  atom  on  it 
[40]  (see  Fig.  6).  This  double  icosahedron  [Fig.  7(a)]  is  formed  by  two  interpe¬ 
netrating  icosahedra  sharing  two  central  atoms  and  one  pentagonal  ring.  This  cluster 
size  is  also  compatible  with  fee  and  hep  pieces  of  bulk  [Figs.  7(b)  and  7(c)]. 

Nineteen  Atom  Di  structures  have  been  found  in  Pt  and  Au  compounds  [1,40]. 
The  “double  icosohedron”  structure  has  also  been  characterized  for  laser-evap¬ 
orated  Ni|9  clusters  [9c].  Because  of  the  small  size  of  the  cluster,  it  seems  rea¬ 
sonable  that  the  lower  energy  be  associated  to  the  precrystalline  structure  rather 
than  the  reticular  one. 

The  structures  depicted  in  Figures  7(a)-(c),  constructed  with  Rh-Rh  interatomic 
distances  of  2.5  A,  form  starting  geometries  for  a  full  geometry  optimization,  free 
of  constrains  on  the  variation  of  any  of  the  coordinates  or  symmetry  constraints. 
The  lowest  energy  structure  we  obtain  is  given  in  Figure  8.  It  is  of  Ds*  symmetry. 

The  behavior  of  the  Rhi9  cluster  is  very  similar  to  that  of  Rhi3  in  two  main 
aspects; 

1 .  The  final  structure  results  from  the  same  kind  of  distortion  described  for  Rh|3, 
which  stabilizes  a  smaller  central  pentagon  with  Rh-Rh  distances  of  2.3  A  and  two 
larger  pentagons  with  Rh-Rh  distances  of  2.56  A.  However,  there  is  no  change  in 
the  Dsi,  symmetry  from  the  initial  (Fig.  5(a)]  to  the  final  (Fig.  6)  structures,  and 
the  complications  we  encountered  in  the  smaller  structure  are  avoided. 

2.  The  fee  starting  structure  goes  to  the  same  distorted  Ds  one  upon  geometry 
optimization,  while  the  hep  keeps  its  original  symmetry,  decreasing  the  bond 
length  to  an  average  value  of  2.3  A,  giving  a  structure  2.08  eV  less  stable  than  the 
Dsh  one. 

In  the  same  way  previously  described  for  Rho  clusters,  the  distorted  geometry 
is  associated  with  a  distribution  of  positive  and  negative  charges  on  the  surface, 
negative  on  the  atoms  of  the  smaller  pentagon  and  positive  on  the  others  (Fig.  6). 
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The  surface  charge  is  positive.  2.4  a.u.  A  negative  charge  of  the  same  magnitude  is 
uniformlv  distributed  between  the  two  central  Rh  atoms.  Different  surface  sites, 
and  in  particular  electron-deficient  ones,  are  obtained  for  these  structures,  and 
characterize  the  structural  and  electronic  properties  of  the  active  sites  in  the  small 
particles  of  highly  dispersed  catalysts. 

The  larger  stability  of  the  structures  of  fivefold  symmetry'  for  both  Rh^and  Rhi9 
clusters  lead  us  to  infer  that  these  may  be  the  preferred  symmetry  of  the  small  Rh 
particules  that  are  present  in  highly  dispersed  supported  catalysts,  and  the  sites 
defined  in  these  structures  might  be  the  surface  sites  of  catalytic  activity. 

The  energy  of  the  different  structures  for  a  particular  cluster  size  is  the  result  of 
a  balance  between  the  surface  and  the  bulk  energy.  The  structure  ,i  f  the  crystalline 
habit  is  the  most  stable  for  a  bulk  metal  (fee  for  Rh).  In  the  small  clusters,  on  the 
other  hand,  most  of  the  atoms  are  surface  atoms,  and  the  surface  energy  becomes 
the  dominant  contribution  to  the  total  energy.  Icosahedral  symmetries  are  char¬ 
acterized  by  a  large  concentration  of  triangular  faces,  w  hich  are  the  minimum  part 
of  an  fcc(  111)  structure,  associated  with  the  larger  packing  density  and  the  lower 
total  energy.  This  gives  a  plausible  explanation  for  the  larger  stability  of  the  ico- 
sahedral-like  structures  m  small  clusters  of  fee  metals. 

Previous  studies  of  the  structure  of  transition  metal  clusters  also  seem  to  favor 
icosohedral  structures.  However,  distortions  of  a  lower  symmetry  are  not  often 
discussed,  as  the  structures  obtained  are  often  the  results  of  geometry  optimization 
with  symmetry  constraints.  Ah  initio  multireference  calculations  suggest  an  />,  sym¬ 
metry  with  a  lowest  ^Af,  state  of  icosohedral  symmetry  for  both  Li  n  [41  ]  and  Cuu 
structures  [42].  Dynamical  calculations  on  the  structure  of  Ni„  clusters  («  =  4- 
13).  minimizing  the  BE  using  a  near  first  principle-corrective  effective  medium 
theory  (CEM )  yield  an  //,  symmetry  0. 1  eV  more  stable  than  the  Oi,  or  Dy,  ones  for 
Ni,3[43]. 

The  results  presented  in  this  paper  show  the  solution  of  the  structure  of  small 
transition  metal  cluster  when  the  coordinates  are  allowed  to  completely  relax.  One 
might  expect  that  only  after  this  relocation  can  a  proper  descnption  of  the  electronic 
feature  that  characterizes  the  different  surface  sites  be  determined.  We  might  add. 
however,  that  with  so  many  low  lying  states,  the  actual  sites  might  be  characterized 
by  an  average,  or  be  determined  by  a  specific  interaction  with  an  absorbate. 

In  general  agreement  with  systematic  calculations  of  the  properties  of  metal  clus¬ 
ters  as  a  function  of  size  [43].  we  find  an  increase  of  the  BE  per  atom  from  1.10 
eV  for  Rh4.  2.97  eV  for  Rhi:.  3.55  eV  for  Rhn.  and  4.48  eV  for  Rhi^. 

Conclusion 

We  have  calculated  at  the  SC  r-ci  level  the  most  stable  structures  of  the  small  Rh 
clusters.  Rh, ,  and  Rhi^,  that  may  be  present  in  highly  dispersed  supported  catalysts, 
as  well  as  the  structural  and  electronic  characteristics  that  helps  define  the  particular 
reactivity  of  the  different  surface  sites. 

In  agreement  with  previous  calculations  and  experimental  research  on  other 
transition  metal  atoms  [8-1 1].  a  lower  energy  is  found  for  the  structures  of  fivefold 
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symmetry.  As  has  been  described  for  the  Rh,,  molecule,  Jahn-Tcller  distortions 
play  an  important  role  in  the  determination  of  the  final  geometries.  In  addition, 
we  note  that  the  final  structures  generally  form  lower  multiplicities,  rather  than  the 
higher  multiplicities  that  seem  to  characterize  the  higher  symmetry  structures. 

In  the  final  structure,  diflerent  surface  sites  are  defined  that  might  show  a  different 
reactivity  towards  a  given  adsorbate.  The  surface  .is  a  whole  is  characterized,  for 
both  Kh(  t  and  Rh|.,  by  an  electron  deficiency,  which  will  help  determine  the  nature 
of  the  interactions  with  a  given  adsorbate. 

In  the  final  analysis  we  caution,  however,  that  the  support,  itself,  might  effect 
the  electronic  characteristics  of  the  surface  cites.  Preliminary  studies  suggest  sizable 
barriers  for  CO  migration  on  the  various  surface  sites,  and  that,  in  fact,  each  cite 
might  be  expected  to  have  a  somewhat  different  reactivity  [44], 

The  eharactcristics  of  the  resulting  structure  demonstrate  the  importance  of  a 
careful  geometry  optimization  procedure  in  the  analysis  of  the  reactivity  of  a  given 
metal  cluster,  capable  of  differentiating  the  surface  sites  whose  structural  and  elec¬ 
tronic  characteristics  arc  far  from  those  that  may  be  present  in  regular  structures 
of  a  higher  symmetry. 
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Abstract 

Self<onsislent  one-electron  state  calculations  are  earned  out.  for  the  first  time,  for  a  gallium  vacancy 
m  the  GaASi-;tP»  alloy.  The  cluster  model,  within  the  framework  of  the  multiple-scattering  Xa  theory, 
is  used  to  calculate  several  charge  states  of  the  vacancy.  Suitable  clusters  have  been  considered  in  order 
to  simulate  the  phosporous  concentration  x  in  the  alloy  and  how  the  nearest-neighbors  configurations 
affect  the  vacancy-related  levels.  It  is  found  that  the  defect  introduces  a  deep  energy  level  into  the  band 
gap  which  follows  the  host-valence  band  edge  as  .v  varies  The  same  trend  in  donor  and  acceptor  levels 
is  observed,  with  a  very  small  Mott-Hubbard  potential  ei.ergy.  iS  1993  John  Wiley  &  Sons.  Inc 

Introduction 

Among  the  native  defects  which  appear  in  IIl-V  semiconductor  compounds, 
vacancies  are  the  most  common  ones.  They  are  usually  mobile  and  can  thus  migrate 
in  the  material,  being  trapped  by  other  defects  or  impurities  and  forming  complexes. 
They  introduce  deep  localized  stales  into  the  fundamental  band  gap  of  the  semi¬ 
conductor  and  may  appear  as  neutral,  positively  or  negatively  charged  depending 
on  the  position  of  the  Fermi  level. 

It  has  been  shown  that  an  alternative  way  of  investigating  the  electronic  properties 
of  localized  defects  in  III-V  semiconductors  is  to  gradually  vary  the  host  material, 
for  example,  by  alloying  or  applying  hydrostatic  pressure,  and  then  to  study  the 
changes  induced  in  the  defect  properties  [1,2]. 

Experiments  where  the  host  band  structure  of  GaAs  is  continuously  altered 
through  chemical  alloying  with  P,  have  been  proposed  to  investigate  the  particular 
case  of  transition  metal  impurities  in  the  GaASl-J^P^  alloy  [3,4].  Due  to  perturbation, 
band  edges  of  the  host  band  structure  are  shifted  and  impurity  states  are  affected 
differently  depending  on  their  detailed  nature.  The  same  trend  observed  in  the 
impurities  acceptor  binding  energies  along  the  alloy  has  been  recently  confirmed 
by  us  through  calculations  performed  within  the  framework  of  the  multiple-scat¬ 
tering  X„  cluster  model  [  5  ] . 

In  the  present  work  we  apply  the  molecular  cluster  model  and  the  multiple¬ 
scattering  X„  theory  to  investigate  the  electronic  properties  of  a  Ga,  unrelaxed 
vacancy  in  GaASi_,P,.  Self-consistent  electronic  state  calculations  are  carried  out 
for  the  neutral,  positively,  and  negatively  charge  states  of  the  defect  ( V  °,  V  ^ ,  V  “ ). 
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Figure  I.  Schcmiitic  representation  of  the  17-atom  cluster  models  used  to  simulate  the 
GaAs|.  ,1’,  host  alloy  for  tu)  jr  =  0;  (b).r  =  0.25: (c)x  =  0.5;  (d)  jt  “  0.75;  (c)  jr  »•  I. 


We  analyze  the  trends  in  the  vacancy  energy  levels  and  discuss  the  stability  of 
charge  states  as  x  varies  along  the  alloy. 


Theoretical  Model 

The  results  presented  here  for  the  Ga  vacancy  in  the  GaAS)_,P;t  alloy  were  ob¬ 
tained  through  Watson-sphere  terminated  17-atom  clusters  centered  at  the  substi¬ 
tutional  Ga  site  [6,7].  The  cluster  models  consist  of  a  central  atom  surrounded  by 
4  nearest  neighbors  and  1 2  next-nearest  neighbors.  By  starting  with  the  perfect  host 
alloy  and  con.sidcring  the  extremum  jc  =  0,  i.e.,  the  GaAs,  the  cluster  then  comprises 
a  central  Ga  atom,  the  first  shell  of  4As  atoms,  and  the  second  shell  of  1 2Ga  atoms. 
The  P  concentration  x  in  the  alloy  is  simulated  by  replacing  some  of  the  4  As 
nearest  neighbors  by  P,  keeping  the  second  shell  of  1 2  Ga  atoms  unchanged.  Within 
the  molecular  cluster  model,  if  n  of  the  4  nearest  neighbors  atoms  arc  As,  then 
(4  -  //)  atoms  arc  P.  This  configuration  will  correspond  to  a  P  concentration  x  = 
(4  -  n)/4  in  the  alloy.  Therefore,  by  changing  the  configuration  of  the  nearest 
neighbors,  we  simulate  the  undoped  GaAS|_jrPjr  alloy  for  x  =  0,  0.25,  0.5,  0.75, 
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and  I.  The  correspondent  calculated  clusters  are  GaAs  {x  =  0),  3As  +  lP(x  = 
0.25),  2As  +  2P  (x  =  0.5),  lAs  +  3P  (x  =  0.75),  and  GaP  (x  =  1 ).  The  cluster 
models  are  schematically  shown  in  Figure  1.  The  group  symmetries  are  Tj,  Csv, 
C2v.  Csv,  and  Tj  for  clusters  GaAs,  3As  +  IP,  2As  +  2P,  lAs  +  3P  and  GaP, 
respectively.  In  order  to  simulate  the  Ga  vacancy  in  the  alloy  (system  GaAsi-jtP;^: 
Voa)  the  central  Ga  atom  is  removed  in  each  of  the  five  configurations  above,  and 
all  other  parameters  are  kept  the  same  as  in  the  perfect  host  alloy.  The  radii  of  the 
muffin-tin  spheres  utilized  for  the  GaAs  and  GaP  clusters  are  the  covalent  radii  as 
reported  in  previous  works  [  6,7  ] .  For  the  3As  -t-  1 P  and  2 As  +  2P  clusters  we  have 
used  the  same  muffin-tin  parameters  as  for  GaAs;  for  the  1  As  -t-  3P  cluster  the  GaP 
parameters  have  been  adopted.  The  model,  as  it  has  been  applied  to  study  defects 
in  GaAs  [8,9]  and  GaP  [7]  is  described  elsewhere.  Recently,  we  have  also  used  the 
cluster  models  adopted  here  to  describe  impurity  levels  induced  by  copper  and 
manganese  in  the  GaAsi_;tP;f  alloy  [5]. 

The  calculations  carried  out  here  were  performed  within  the  framework  of  the 
spin-unrestricted  approximation,  by  including  all  the  electrons  except  those  filling 
inner  atomic  shells,  up  to  3d  in  As  and  Ga  which  were  kept  frozen.  Electronic  levels 
were  calculated  by  taking  into  account  the  Slater  transition-state  concept  [10]. 

Results  and  Discussion 

We  start  by  analyzing  the  results  obtained  for  the  extrema  of  the  alloy.  The 
neutral  Ga  vacancy  in  GaAs  and  GaP  gives  rise  to  a  deep  threefold  degenerate  h 
level  within  the  band  gap.  Since  three  valence  electrons  are  removed  from  the  lattice 
when  a  Ga  vacancy  is  created,  this  level  is  partially  occupied  with  three  holes  and 
therefore  the  defect  can  act  as  deep  acceptor  and/or  donor  centers.  Its  electrical, 
optical,  and  magnetic  properties  are  then  related  to  the  broken  sp^  chemical  bonds 
which  originate  a\  and  t2  defect  levels,  the  former  appearing  as  a  resonance  below 
the  top  of  the  valence  band  while  the  latter  is  introduced  into  the  band  gap.  For 
the  3As  +  IP  cluster  (x  =  0.25)  the  t2  level  splits  into  two  levels,  ni  and  e  of  the 
Csv  group  symmetry  which  are  occupied  with  two  and  one  electrons,  respectively. 
In  the  case  of  the  2As  -I-  2P  cluster  (x  =  0.5 )  in  C2v  symmetry,  three  levels  a\,b\, 
and  62  appear  within  the  band  gap,  with  the  uppermost  occupied  being  a  62  level 
with  one  electron.  For  the  lAs  +  3P  cluster,  again  in  Csv  symmetry,  an  a\  level 
fully  occupied  and  an  e  level  occupied  by  one  electron  are  found  within  the  band 
gap.  The  charge  distributions  obtained  for  the  vacancy  t2  gap  level,  and  t2-derived 
gap  levels,  show  that  they  are  mostly  localized  in  the  first  shell  of  nearest  neighbors, 
with  practically  the  same  percentage  of  charge  in  all  cluster  configurations  ( 52% 
inside  the  first  shell).  Similar  results  have  been  predicted  by  previous  theoretical 
calculations  for  the  Ga  vacancy  in  GaAs  [11,12]  and  GaP  [13]. 

The  trend  observed  for  the  highest  occupied  t2-derived  energy  level  indicates  that 
the  vacancy  gap  level  remains  nearly  constant,  in  relation  to  the  top  of  the  valence 
band,  as  x  the  concentration  of  P  in  the  alloy  varies  from  0  to  1 .  This  practically 
independent  behavior  of  the  vacancy  defect  gap  level  with  x  is  consistent  with  its 
bonding  state  character  of  a  Ga  p-atomic  orbital.  It  is  worth  mentioning  that  a 
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completely  different  picture  is  obtained  for  transition-metal  impurities  replacing 
Ga  atoms  in  the  GaAsi-;tPji  alloy.  It  has  been  shown  that  the  deep  transition-metal 
impurity  levels  in  III-V  compounds  do  not  follow  any  of  the  host  band-edges; 
instead,  they  are  pinned  to  the  vacuum  level,  due  to  their  antibonding  nature  with 
respect  to  the  impurity-host  orbital  bondings  [14].  Contrary  to  previous  suggestions 
[15],  our  results  for  vacancies  in  GaAsi_;tPjt  reveal  a  host  valence-band  pinning  of 
the  deep  vacancy-related  levels. 

In  Figure  2  we  present  the  trend  in  the  donor  (0/-I-)  and  acceptor  (— /O)  energy 
levels  as  a  function  of  x  in  GaAsi_;tP:,.  The  calculated  electronic  levels  shown, 
evaluated  through  the  transition-sate  concept  [10],  do  not  include  spin-polarization 
effects.  We  should  mention,  however,  that  all  eigenvalues  within  the  gap  are  lowered 
by  less  than  0.2  eV,  for  all  cluster  configurations,  when  spin-unrestricted  calculations 
are  performed.  As  can  be  seen  in  the  figure,  a  similar  trend  for  (-/O)  and  (0/+) 
levels  along  the  alloy  is  observed.  The  difference  in  energy  between  the  (-/O)  and 
(0/  +  )  levels,  which  accounts  for  the  Mott-Hubbard  potential  energy  U,  is  of  the 
order  of  0.05  eV.  This  is  consistent  with  the  expected  multiple  charge  states  character 
of  the  vacancy  [16].  According  to  our  results,  at  least  three  charge  states  (V  V°, 
V")  are  predicted  for  the  Ga  vacancy  in  GaAsi-;,P^,  which  can  occur  stable  de¬ 
pending  on  the  position  of  the  Fermi  level.  However,  it  is  interesting  to  note  that 
due  to  the  valence-band  pinning  of  the  donor  and  acceptor  transition  energies,  for 
a  fixed  Fermi  level  position  the  vacancy  charge  state  is  independent  of  the  alloy 
composition  x. 

In  summary,  we  have  shown  theoretically  that  the  donor  and  acceptor  energy 
levels  induced  by  an  ideal  Ga  vacancy  in  the  GaAsi-;,P^  alloy  are  pinned  to  the 
host  valence  band  edge.  Therefore,  these  levels  cannot  be  used  to  determine  GaAs/ 
GaAS;tP;f  heterostructures  band  offsets,  as  has  been  the  case  of  transition  metal 
impurity  levels  [14]. 


Figure  2.  Calculated  donor  (0/+)  and  acceptor  (-/O)  energy  levels  for  the  Ga  vacancy 
in  GaAsi-,P,  as  a  function  of  the  alloy  concentration  x.  Energies  are  related  to  the  top  of 
the  valence  band.  The  solid  lines  were  drawn  to  guide  the  eyes. 
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Abstract 

In  our  previous  work,  the  stopping  properties  of  metallic  and  covalent  films  were  investigated.  Here 
we  consider  an  lonically  bound  film.  The  energy  loss  of  a  proton  in  an  LiF  monolayer  (LiF-lL)  is 
calculated  in  orbital  fashion,  based  on  kinetic  theory.  The  required  momentum  density  and  mean  excitation 
energy  are  obtained  from  the  local  density  approximation  and  local  plasma  approximation  respectively. 
For  comparison,  the  LiF  molecule  is  treated  by  use  of  a  large  intermolecular  distance  in  the  film.  We 
find  the  stopping  cross  section  of  the  LiF  molecule  to  be  only  slightly  larger  than  that  for  the  LiF-lL. 
The  Bragg  rule  (additivity  of  stopping  for  the  corresponding  atoms)  is  not  valid  for  the  ionically  bound 
molecule  nor  the  corresponding  extended  system,  but  may  be  valid  if  additivity  of  stopping  of  atomic 
ions  is  assumed.  ^  1993  John  Wiley  &  Sons,  Inc. 

Introduction 

While  many  stopping  calculations  have  been  performed  in  the  last  half  century, 
so  far  as  we  know  none  has  treated  stopping  in  an  ionically  bound  crystal  or  mol¬ 
ecule,  except  by  using  Bragg’s  rule  [1],  Mean  excitation  energies  have  been  calculated 
from  the  local  plasma  approximation  (LPA)  [2],  but  the  electron  densities  p(f) 
were  estimated  roughly  from  the  p(7)s  of  corresponding  atomic  neutral  states  and 
ionic  bond  parameters.  Some  experimental  stopping  results  for  LiF  are  also  known 
(that  of  Bader  et  al.  [3]  is  rather  old;  Biersack  et  al.  [4]  measured  1-2  MeV  alpha 
stopping  powers  in  LiF;  Kubo  [5]  measured  the  depths  of  regions  damaged  by  a 
proton  in  an  LiF  crystal).  In  our  previous  work,  the  stopping  properties  for  ultrathin 
metallic  and  covalent  films  were  investigated  [6,7],  Here  we  extend  consideration 
to  an  ionically  bound  film.  Since  the  alkali  halides  are  the  prototypes  of  ionically 
bound  systems,  and  the  calculated  structural  and  electronic  properties  for  the  ground 
state  of  an  LiF- 1 L  are  available  [8],  we  considered  the  electronic  stopping  powers 
for  a  proton  ’  i  an  LiF  monolayer  and  in  isolated  LiF  molecules.  Electron  capture 
by  the  proton  (charge  state  changes),  and  ionic  dissociation  of  LiF  were  leglected. 
The  section  next  describes  the  method  which  we  used,  with  results  and  .cussion 
in  the  section  after  next. 


Methodology 

The  linear  energy  loss  of  a  swift,  massive,  charged  particle  in  matter  is  known  as 


1  dE 
N  a.\ 


(1) 
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where  N  is  the  number  density  of  scatterers  (here  the  unit  scatterer  is  an  LiF  ion 
pair),  5  is  the  stopping  cross  section  per  scatterer,  v  is  the  velocity  of  the  incident 
particle,  E  its  energy,  and  .v  its  path  length.  We  use  Hartree  atomic  units  in  Eq.  (1) 
and  throughout  this  paper.  Because  electrons  in  different  energy  levels  have  different 
stopping  properties,  the  stopping  cross  section  must  be  calculated  in  a  state-depen¬ 
dent  fashion  (that  is,  via  an  energy-based  decomposition  [9]).  Following  Sabin  and 
Oddershede,  we  employed  Sigmund’s  kinetic  theory  of  stopping  [10]  in  the  form 

5(u)  = - 1 —  L(v) ,  (2) 

L(v)  =  2  lav)  ,  (3) 

/ 

•J.  -*■  -♦  V 

In  Eqs.  (2).  (3),  and  (4),  Z|  is  the  projectile  chaise,  Zi  the  number  of  electrons  per 
molecule,  p/(  v:)  is  the  momentum  density  of  the  /th  energy  subband  and  l/(»?/A^eeii) 
is  the  normalization  factor  for  p/(  V2).  We  discuss  the  choices  of  subband  below. 

In  Eq.  (4)  L?  is  of  Bethe  form  with  a  low  energy  cutoff 


where  n  is  the  number  of  molecules  per  unit  cell.  The  orbital  mean  excitation 
energy  //  is  determined  solely  from  the  orbital  charge  density  through  the  orbital 
local  plasma  approximation  (OLPa) 

.  .  I  r  .  f  .  r.../47re^p(7)\''n 


In  7/  =  -^  f  dz  \  dapt(f) 
Vi  J-x  Ja 


/4 

In  X*l- 


For  simplicity  and  clarity  the  scaling  parameter  X  was  chosen  to  be  unity  throughout. 

The  required  momentum  density  is  constructed  from  the  Fourier  transform  of 
the  Kohn-Sham  (KS)  eigenfunctions  in  real  space: 


P,(?)  =  _2  «ulV'a(p)l' 
XcBZ 


'if a  (P)  = 


l27rN„„A]'f^ 


4-a,  Jk, 


da  e''’’ VJ(^)  • 


The  Lam-Platzman  correction  [11]  was  omitted.  The  KS  functions  were  obtained 
in  the  local  density  approximation  (lda)  .dlizing  the  Hedin-Lundqvist  (HL)  model. 
The  atomic  calculations  used  the  same  OL?a  and  lda  but  were  spin-polarized  in 
the  central  field  approximation. 

The  electronic  structure  calculations  used  the  code  FILMS  [  1 2, 1 3].  In  it  the  KS 
functions  are  expanded  in  a  linear  combination  of  Gaussian-type  orbitals,  the  "KS” 
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basis  [14],  The  charge  density  and  exchange-correlation  energy  density  are  expanded 
in  two  auxiliary  Gaussian  bases  sets,  the  “Q”  and  “XC”  fitting  bases.  The  various 
basis  sets  are  tabulated  in  Ref.  [8]. 

The  lattice  structure  utilized  corresponds  to  the  (100)  plane  of^he  NaCl  structure. 
The  irreducible  triangle  is  1/8  of  the  Brillouin  zone  (bz);  ten  k  points  in  that  ir¬ 
reducible  triangle  were  used  [8]. 

The  energy  subbands  were  chosen  as  follows:  Atomic  stopping  was  used  for  the 
Is  bands  (in  tests,  the  discrepancy  between  atomic  and  IL  stopping  for  this  band 
is  in  the  second  decimal  place  of  the  stopping  cross  section  at  its  maximum).  The 
three  energy  bands  of  2p-like  character  are  all  in  a  similar  energy  range,  and  hence 
were  grouped  as  one  energy  subband.  This  grouping  was  tested  by  dividing  the 
energy  range  of  those  three  energy  bands  into  three  windows  (£/-i  to  £/)  according 
to 


Et  =  +  /[(£Six  -  £S:„)/3] ,  /  =  1 ,  2,  3,  (8) 

and  then  calculating  the  stopping  cross  section  at  one  velocity  (near  the  maximum 
position  of  the  stopping  cross  section).  Again  the  discrepancy  between  the  two 
calculations  is  only  in  the  second  decimal  place  in  the  stopping  cross  section  at 
that  t). 

In  the  momentum  integral  (Eq.  (4)],  the  domain  can  be  reduced  to  an  irreducible 
region  in  momentum  space  by  use  of  symmetry  properties.  Since  the  KS  functions 
are  only  available  within  the  irreducible  part  of  the  BZ,  one  must  rotate  the  wave 
vector  into  the  irreducible  part  and  make  the  corresponding  rotation  for  the  cor¬ 
responding  momentum  based  on  the  symmetry  properties  to  evaluate  the  momen¬ 
tum  density.  A  useful  result  (for  computationaUpeed)  is  that  ^,,*(p)  is  a  nonvanishing 
function  if  and  only  if  p  =  -  fc,  where  {X„}  are  reciprocal  lattice  vectors. 

To  treat  the  isolated  molecule,  we  used  FILMS  but  with  a  large  distance  (15.0 
a.u.)  between  the  molecules  in  a  square  lattice.  The  molecular  orientation  was  along 
the  edge  of  the  square.  We  refer  to  this  calculation  as  the  LiF  “quasimolecule.” 

Results 

For  convenience,  the  relevant  basis  set  parameters,  total  energies,  structural 
parameters,  energy  subbands,  mean  excitation  energies,  and  stopping  cross  sections 
for  the  LiF-lL,  the  LiF  quasimolecule,  the  Li  atom,  and  the  F  atom  are  all  shown 
in  Table  I.  The  structural  properties  and  energy  bands  reported  here  are  from 
Ref  [8]. 

Figure  1  shows  the  stopping  cross  section  as  a  function  of  proton  velocity.  An 
interesting  qualitative  feature  is  that  the  stopping  cross  section  for  the  LiF-lL  is 
very  close  to  that  of  the  LiF  molecule.  The  molecular  cross  section  is  only  slightly 
bigger,  about  3%  at  most.  This  is  very  different  from  the  behavior  of  both  metallic 
and  covalent  systems.  However,  it  is  unsurprising  since  the  total  energy  of  the 
molecule  is  close  to  the  total  energy  per  formula  unit  of  the  1 L  and  the  valence 
band  for  the  1 L  is  rather  flat.  These  features  all  reflect  the  fact  that  most  of  the 
cohesive  energy  with  respect  to  isolated  atoms  is  in  the  molecular  ionic  bonding. 
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Table  I  Companson  of  the  phase  difference  m  proton  stopping  by  the  targets  of  LiF-lL,  LiF 

molecule,  and  Li  atom+  F  atom.* 


LiF 

Li  atom  +  F  atom 

quasimolecule; 
molecule  along 

F  atom  Li  atom 

X  direction 

LiF  monolayer 

Ks  bases 

1 3s8p  1 3s 

13s8plpz  for  F 

13s8plpz  for  F 

1 3s4p  Ibr  Li 

I2s4p  for  Li 

Q  and  XC  bases 

I3s7d3p.r  for  F 

13s7dforF 

12s4d2p,  for  Li 

1  Is4d  for  Li 

Distance  between  Li  and 

Infinity 

2.80 

3.50 

F(a.u.) 

F-like  Is  band 

-106.47322 

-106.71320 

-106.81508 

Energy  range  (H) 

-24.19804 

-23.92276 

From  -24.00958 

-24.16738'’ 

to  -24.00958 

/(eV) 

483.498 

483.682 

483.651 

5  at  ti  =  2.25  a.u. 

Li-like  Is  band 

0.160 

0.160 

0.160 

Energy  range  (H) 

-1.87535 

-1.81928 

From  -1.75448 

-1.86729'’ 

to -1.75041 

/(eV) 

71.424 

72.493 

72.580 

S  at  11  =  2.25  a  u. 

2.745 

2  745 

2.778  (film) 

2.745  (atom) 

2s  band 

Energy  range  (H) 

-1.11586  -0.12463 

-0.89822 

From  -0.99472 

-1.05527'’  -0.04807'’ 

to  -0.97130 

/(eV) 

62.550  3.420 

62.943 

62.465 

5  at  ti  =  2.25  a.u. 

2p  band 

3.991  8.072 

4.054 

4.108 

Energy  range  (H) 

-0.44410 

From  -0.25208 

From  -0.38650 

-0.38525'’ 

to  -0.22604 

to  -0.28574 

/(eV) 

54.363 

48.184 

50.688 

5  at  ti  =  2.25  a  u. 

8.034 

11.314 

10.923 

^toui 

91.148  25.935 

79.308 

81.134 

S'°'  at  u  =  2.25  a.u. 

23.002 

18.273 

17.969 

*^mix 

40.489 

18.290 

17.962 

t)at5,„„ 

0.61 

2.32 

2.33 

•  The  unit  of  5  is  lO"”  eV  cmVitiolecule. 
•*  Two  energy  levels  are  for  two  spins. 


This  dramatic  influence  of  binding  upon  stopping  also  is  illustrated  in  Figures  1 
and  2.  There  we  have  applied  the  Bragg  rule  for  a  compound,  say  AB,  to  wit 

SM  «  5a(u)  +  Sb[v)  ,  (9) 

with  A  and  B  chosen  as  the  Li  and  F  atoms,  respectively.  The  Bragg  rule  stopping 
drastically  overestimates  both  the  1 L  and  isolated  molecule  stopping,  as  shown  in 


eVcm  /molecule j 
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Figure  I ,  The  comparison  for  the  stopping  cross  sections  for  the  LiF-1  L.  LiF  quasimo¬ 
lecule.  and  Bragg  rule  (Li  atom-f  F  atom). 


W 
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Figure  2.  Isolated  atom/monolayer  diflerence  of  the  stopping  cross  section  for  LiF. 
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Figure  2.  which  displays  the  difference.  Since  the  latter  two  are  close,  it  is  clearly 
molecular  binding  effects  (missing  from  the  Bragg  rule)  which  are  the  cause  of  the 
large  suppression  of  stopping. 

Application  of  Bragg’s  rule  with  A  and  B  as  ionic  species  is  another  matter. 
Experimental  results  [4]  for  the  stopping  power  for  boron-doped  LiF,  in  thin  evap¬ 
orated  layers,  are  found  to  agree  with  Andersen  and  Ziegler’s  [  1 5]  predictions  within 
a  few  percentage  points,  if  the  validity  of  Bragg’s  rule  for  ions  is  assumed.  We  could 
not  perform  a  similar  test  using  addition  of  isolated  constituent  ions  because  of  a 
well-known  problem  in  lda.  As  a  consequence  of  improper  self-interaction  can¬ 
cellation,  LDA  does  not  yield  a  properly  bound  negative  ion  without  introduction 
of  some  arbitrary  confining  potential.  However,  since  the  LiF- 1 L  has  a  very  localized 
charge  distribution  which  is  quite  similar  to  that  for  the  LiF  molecule,  it  is  reasonable 
to  predict  that  Bragg  rule  additivity  based  upon  the  corresponding  ions  will  be  valid 
for  both  ionically  bound  films  and  molecules,  consistent  with  the  experimental 
results  just  mentioned. 

Further  insight  may  be  obtained  by  comparison  of  the  2s  and  2p  stopping  for 
the  F  and  Li  atoms  with  that  of  the  LiF-1  L  as  shown  in  Figure  3.  The  figure  makes 
apparent  how  much  the  2s  and  2p  orbitals  of  LiF  are  dominated  by  F  contributions. 
The  2s-like  stopping  for  the  LiF-lL  is  almost  the  same  as  that  of  the  F  atom.  The 
2p-like  stopping  in  the  IL  has  the  same  behavior  but  is  bigger  than  that  for  the  F 
atom,  a  consequence  of  charge  transfer,  (there  are  six  electrons  involved  for  the  1 L 
whereas  there  are  five  for  the  isolated  F  atom). 


<u 


Figure  .t.  Comparison  of  the  2s  stopping  and  2p  stopping  for  the  atom  F.  atom  Li,  and 

LiF-lL. 
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Table  11.  Analytic  fitting  parameters  and  standard  deviations  for  LiF-lL,  F  atom,  and  Li-atom 

2s  orbital. 


Range 

(a.u.) 

fli 

ai 

flj 

04 

Standard  deviation 

LiF-lL 

1.0-20.0 

6.490 

97.317 

76.429 

F  atom 

1.0-20.0 

7.103 

69.998 

57.256 

Li  atom  2s 

0.35-20.0 

0.168 

9.735 

0.00283 

0.28 

As  another  means  of  tabulating  stopping  and  to  provide  an  efficient  summary 
of  our  calculated  points,  we  consider  analytic  fitting.  As  shown  in  Figure  4,  a  simple 
four-parameter  fit  modeled  on  the  Bethe  logarithm  works  quite  well.  The  fit  is  so 
good  for  the  Li  2s  atomic  stopping  that  the  original  and  fitted  curves  are  essentially 
in  distinguishable.  The  general  form  is 

When  fit  over  the  range  1.0  a.u.  sv  ^  20.0  a.u.  (or  0.35  a.u.  u  s  20.0  a.u.  for 
the  Li  is  atomic  stopping).  We  get  the  parameters  and  standard  deviations  tabulated 
in  Table  II. 
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Abstract 

Results  of  first-pnnciples,  density-functional,  lmto  calculations  on  quasi-one-dimensional  systems 
are  presented.  Special  emphasis  is  put  on  the  problems  related  with  studying  the  properties  of  real 
materials  by  means  of  electronic  structure  calculations.  As  examples  we  discuss  undoped  and  doped 
trans  polyacetylene,  an  MX  chain  compound,  CaNiN,  and  a  comparison  between  polycarbonitrile  and 
hydrogen  cyanide.  It  is  argued  that  single-chain  calculations  on  undoped  polyacetylene  predict  a  bond- 
length  alternation,  whereas  this  may  be  suppressed  in  multichain  calculations.  For  an  MX  chain  we 
demonstrate  the  importance  of  including  the  full  ligand  structure,  whereas  many  properties  of  CaNiN 
can  be  understood  as  intrinsic  to  single  chains.  A  comparison  between  the  covalently  bonded  polycar- 
bonitnle  and  the  hydrogen-bonded  hydrogen  cyanide  reveals  differences  between  delocalized  and  localized 
electrons  but  also  inaccuracies  in  the  relative  total  energies.  Finally,  results  for  doped  trans  polyacetylene 
indicate  relatively  large  electronic  dopant-polymer  hopping  integrals  as  well  as  problems  in  describing 
electron  transfers  for  weakly  interacting  systems  when  using  a  local-density  approximation.  In  total  it  is 
demonstrated  that  reliable  and  useful  informations  are  obtained  in  most  cases,  but  also  that  one  has  to 
be  careful  in  choosing  the  idealized  system  that  is  to  represent  the  real  material  and  that  certain  aspects 
of  the  local-density  approximation  should  be  improved.  ©  1993  John  Wiley  &  Sons.  Inc. 

Introduction 

One  of  the  goals  of  materials  science  is  to  understand  the  physical  and  chemical 
properties  of  materials  with  the  purpose  of  being  able  to  predict  the  properties  of 
new  materials  and  to  select  materials  with  predefined  properties.  This  requires  de¬ 
velopment  of  general  models  that  describe  the  properties  and/or  materials  of  interest. 
These  models  may  be  derived  by  analyzing  experimental  or  theoretical  results  for 
a  number  of  specific  materials.  However,  the  interpretation  of  the  experimental 
outcomes  may  not  be  simple  or  obscured  by  unwanted  effects  like  those  of  inhomo¬ 
geneities,  impurities,  surfaces,  etc.  Similarly,  detailed  theoretical  studies  of  realistic 
materials  are  often  beyond  the  limits  of  any  computer  facility,  and  one  has  to  study 
idealized  systems  that  may  or  may  not  be  related  to  the  real  systems. 

We  shall  here  report  results  of  theoretical  studies  of  the  electronic  properties  of 
quasi-one-dimensional  materials.  We  shall  concentrate  on  general  problems  related 
to  theoretical  studies  of  materials  properties  by  discussing  some  few  typical  examples. 
The  focus  is  not  on  the  chosen  examples  but  on  the  general  problems.  The  materials 
of  our  interest  are  characterized  by  being  extended  in  at  least  one  dimension  and 
having  interatomic  interactions  that  typically  are  substantially  stronger  in  this  di- 
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rection  than  in  the  other  two.  We  have  previously  developed  a  first-principles  method 
for  quasi-one-dimensional,  periodic,  infinite  systems  [  1  ] .  This  method  will  be  briefly 
reviewed  in  the  next  section.  There  we  also  discuss  a  procedure  for  studying  the 
properties  of  the  material  when  the  periodicity  is  disrupted.  The  third  through 
seventh  sections  contain  various  examples  of  applications.  In  the  third  section  the 
well-known  and  thoroughly  studied  conjugated  polymer  irons  polyacetylene  is  dis¬ 
cussed.  We  shall  here  concentrate  on  whether  the  material  can  be  idealized  as  one- 
or  three-dimensional  and  on  problems  associated  with  estimating  the  size  of  cor¬ 
relation  effects.  The  fourth  section  is  devoted  to  a  so-called  mixed-valence  halogen- 
bridged  transition-metal  linear  chain  compound  which  has  features  resembling  those 
of  polyacetylene.  This  is  chosen  as  an  example  of  the  systems  for  which  omitting 
apparently  less  important  aspects  may  lead  to  significant  effects.  In  the  fifth  section 
we  discuss  another  metal-containing  material  CaNiN.  A  comparison  between  a 
hydrogen-bonded  polymer  (hydrogen  cyanide)  and  a  conjugated  polymer  (poly- 
carbonitrile)  is  the  topic  of  the  sixth  section,  and  the  seventh  section  is  devoted  to 
doped  polyacetylene.  These  two  systems  exemplify  problems  of  the  local-density 
approximation  but  it  is  also  demonstrated  that  useful  information  can  be  obtained 
despite  these  problems.  A  conclusion  is  presented  in  the  last  section. 


Computational  Method 


Periodic  Chains 


Using  the  Born-Oppenheimer  approximation  and  the  Hohenberg-Kohn  [2] 
density  functional  formalism,  we  calculate  the  ground-state  properties  of  the  system 
of  our  interest  by  solving  the  Kohn-Sham  equations  [  3  ] 


2m 


=  eri'iCr). 


(1) 


The  exchange-correlation  parts  of  the  effective  potential  1  cir  are  approximated 
using  a  local  approximation.  Thereby  tne  potential  l\ff  becomes  a  well-defined 
functional  of  the  totai  electron  density  which  in  turn  is  the  sum  of  the  single-particle 
densities. 


occ 

p(7)  =  2  ltf',(7)|' . 

1“  I 


(2) 


Equations  ( 1 )  and  (2)  are  solved  by  expanding  the  eigenfunctions  tZ-T  r)  in  a 
basis  consisting  of  linearized  muffin-tin  orbitals  (LMTOs)  defined  as  follows.  The 
nuclei  are  assumed  occupying  the  centers  of  nonoverlapping  (muffin-tin)  spheres. 
An  LMTO  centered  at  the  nucleus  at  R  is  outside  all  spheres  defined  as  a  Hankel 
function  times  a  harmonic,  iK'*'[{2l  -  l)!!]~'/t}"(x|7  -  7?|)T/,(r^-^)  with  L 
=  (l,m)  and  <0.  Inside  any  sphere  this  function  is  augmented  with  linear 
combinations  of  the  so-called  4>  and  <i>  functions  such  that  the  resulting  function 
becomes  continuous  and  differentiable  everywhere.  The  functions  0  of  a  specific 
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site  are  obtained  by  replacing  I  e(r  of  Eq.  ( 1 )  with  its  spherically  symmetric  com¬ 
ponent.  In  that  case  Eq.  ( 1 )  can  be  transformed  into  a  one-dimensional  differential 
equation  that  can  be  solved  numerically  when  choosing  c  =  e,  “reasonable”  ( i.e., 
in  the  energy  region  for  which  that  particular  function  has  it  largest  components). 
0  is  the  derivative  of  </>  with  respect  to  the  energy  c,.  <f)  and  0  are  specific  for  the 
atom  at  R  and  the  angular  quantum  number  /  and  depends  parametrically  on  c,. 
For  more  details  the  reader  is  referred  to  Refs.  [1,4-8]. 

Since  the  lmtos  are  unfamiliar  to  many  and  only  rarely  depicted,  we  show  in 
Figure  1  the  lmto  s  and  P:  basis  functions  for  a  CO  molecule  placed  along  the  r- 
axis.  Especially  the  augmentation  inside  “other”  spheres  makes  the  LMTO  basis  set 
different  from  many  basis  sets.  Thereby  the  well-known  basis-set  superposition 
error  is  essentially  lacking.  In  this  figure  we  also  show  the  potential  and  its  spherically 
symmetric  part  inside  the  spheres.  Since  the  deviations  from  spherical  symmetry 
is  only  small,  we  expect  the  LMTOs  per  construction  to  be  good  approximations  to 
the  true  eigenfunctions  of  the  Kohn-Sham  equations  inside  the  spheres. 

Before  proceeding  it  may  be  worthwhile  discussing  advantages  and  disadvantages 
of  the  present  approach.  7  his  discussion  may  be  separated  into  two  parts:  the  use 
of  the  density  functional  formalism  and  the  choice  of  the  lmtos  as  basis  functions. 

The  density-functional  formalism  is  in  principle  exact,  but.  due  to  the  local  ap¬ 
proximation,  one  introduces  right  from  the  beginning  an  approximation  that  may 
or  may  not  be  good.  On  the  other  hand,  the  single-particle  form  of  Eq.  ( 1 )  allows 
for  larger  systems  a  much  more  accurate  solution  than  the  exact  many-particle 
Schrddinger  equation  does. 

By  choosing  LMTOS  as  basis  functions  these  are  to  a  large  extent  potential-adapted 
such  that  the  basis  set  may  be  kept  at  a  limited  size  without  loss  of  accuracy.  Using 
Gaussians  or  plane  waves  instead  of  lmtos  would  correspond  to  approximating 
the  potential  by  harmonic  potentials  or  constants,  respectively,  which  clearly  is  a 
worse  approximation  than  the  potential  used  in  constructing  the  lmtos.  On  the 
other  hand,  since  we  are  including  the  full  potential  (and  not  only  its  muffin-tin 
parts)  the  calculations  become  more  evolved,  partly  due  to  the  separation  of  space 
into  two  parts,  for  which,  furthermore,  all  quantities  are  given  analytically  inside 
one  part  (the  interstitial  region )  and  numerically  inside  the  other  part  (the  muffin- 
tin  spheres). 

Finally,  for  a  quasi-one-dimensional  system  that  is  assumed  being  infinite  and 
periodic  in  one  dimension,  we  form  Bloch  waves  from  the  atom-centered  basis 
functions.  This  procedure  can  with  advantage  be  applied  also  for  polymers  with  a 
general  helical  symmetry  (see,  e.g..  Refs.  [1.5.8]). 

Struclural  Deji’cls 

Structural  defects  may  have  crucial  impacts  on  transport  properties  (e.g.,  charge 
and  proton  conductivities)  of  the  materials,  and  it  would  therefore  be  highly  desirable 
to  include  these  in  the  first-principles  description  of  the  systems.  However,  they 
destroy  the  penodicity,  and  one  would  have  to  study  systems  of  finite  size,  to  consider 
periodically  repeated  defects,  or  to  use  a  Greens  function  technique.  As  a  simpler 
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placed  at  the  origin  and  the  O  atom  at  (0,0.2.3)  a.u.  All  quantities  are  depicted  along  the 
molecular  axis  and  in  a.u  In  (a.b)  we  show  the  full  potential  (the  solid  curve)  and  its 
spherically  symmetric  part  (the  dashed  curve)  inside  (a)  the  carbon  and  (b)  the  oxygen 
muffin-tin  sphere,  (c.d)  and  (e.f )  show  x  and  P;,  respectively,  basis  functions  centered  on 
(c,e)  the  carbon  atom  and  (d.f )  the  oxygen  atom.  Here,  the  vertical  dashed  lines  mark 
the  boarders  of  the  muffin-tin  spheres,  the  full  curves  are  the  basis  functions,  and  the 
dashed  curves  the  Hankel  functions 


alternative,  we  have  decided  to  use  model  calculations  in  the  study  of  the  structural 
defects,  for  which  the  models  are  based  on  results  of  first-principles  calculations  on 
defect-free  systems. 

In  Figure  2,  we  show  a  chain  consisting  of  a  number  of  almost  identical  units. 
These  units  may  have  internal  structure  and  will  in  most  cases  consist  of  a  smaller 
number  of  atoms.  The  internal  structure  is  in  that  case  related  to  the  relative  position 
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Figure  2.  Schemaiic  representation  of  an  infinite  chain  consisting  of  repeated  units  with 
internal  structure.  The  regions  R  and  L  are  each  regular,  whereas  D  corresponds  to  a 
domain  wall  (a  defect  region). 


of  the  atoms.  In  order  to  describe  the  structure  of  the  chain,  we  will  ascribe  a  single 
structural  parameter  Xn  to  the  nth  unit.  It  is  thereby  assumed  that  all  other  structural 
parameters  of  the  nth  unit  are  relaxed  upon  variation  of  x„  and  are  irrelevant  in 
describing  the  defect-containing  system. 

The  central  region  in  Figure  2  (denoted  “D” )  separates  two  parts  ( “R”  and  “L” ) 
where  the  chain  is  regular.  Thus,  x„  is  a  constant  in  each  of  the  regular  parts  and 
varies  from  one  value  to  the  other  inside  the  D  region.  If  the  R  and  L  regions  are 
different  but  energetically  degenerate,  the  D  region  is  a  domain  wall  that  can  easily 
move  up  and  down  the  chain.  D  is  in  that  case  often  denoted  a  soliton.  If  the  R 
and  L  regions  are  identical,  D  is  a  polaron,  and  also  this  is  highly  mobile.  The 
importance  of  the  defects  stems  from  the  fact  that  they  may  be  the  carriers  of 
charge,  energy,  protons,  etc.  For  certain  systems  they  occur,  moreover,  in  the  ground- 
state  configuration  of  the  charged  system. 

In  all  cases  x„  vanes  inside  the  D  region  smoothly  between  two  values  X/  and  x„. 
In  order  to  study  the  system  of  Figure  2,  we  perform  first-principles  calculations 
on  periodic  chains,  i.e.,  x„  =  Xo  =  const,  X/  ^  Xo  ^  Xu-  The  results  of  these  are 
subsequently  mapped  onto  a  suitable  model  Hamiltonian  with  which  we  finally 
study  the  defect-containing  system.  Since  doping  or  low-energy  photoexcitation 
leads  to  changes  only  in  the  occupation  of  the  frontier  orbitals,  the  model  Hamil¬ 
tonian  will  typically  contain  a  tight-binding  part  for  their  description  plus  a  simple 
function  of  the  structural  parameters  describing  the  remaining  part  of  the  total 
energy.  The  on-site  energies  and  hopping  integrals  of  the  first  part  will  depend  on 
the  local  environment  as  given  by  {x„}.  A  typical  example  is  trans  polyacetylene 
to  be  discussed  in  the  next  section.  It  should  be  stressed  that  the  success  of  this 
approach  depends  on  a  number  of  assumptions  including  that,  for  a  given  x„,  all 
other  structural  parameters  are  relaxed  in  the  same  way  independent  of  whether 
the  nth  unit  is  a  part  of  a  periodic  chain  or  of  a  defect-containing  chain,  and  that 
effects  specific  for  the  defects  (e.g.,  long-range  Coulomb  effects  or  effects  due  to  an 
increased  localization  of  electrons)  can  largely  be  ignored.  This  is  discussed  in  detail 
in  Ref.  [9]. 


Trans  Polyacetylene 

Trans  polyacetylene  ( Fig.  3 )  is  the  prototype  of  the  conjugated  polymers/synthetic 
metals  that  exhibit  large  changes  in  their  properties  upon  doping  or  photoexcitation 
(see,  e.g..  Ref.  [10]).  There  is  accordingly  a  vast  literature  dealing  with  this  material. 
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Figure  3.  Trans  polyacetylene,  (a)  and  (b)  show  the  regular,  energetically  degenerate 
phases,  (c)  a  soliton  The  black  and  white  circles  represent  carbon  and  hydrogen  atoms. 

respectively. 


From  a  theoretical  point  of  view  it  is  an  ideal  compound  since  its  unit  cell — when 
considering  only  one  isolated  infinite  chain — contains  only  four  atoms,  C2H2. 

The  doping  or  photoexcitation  is  supposed  to  lead  to  the  creation  of  solitons 
[Fig.  3(c)]  which  are  the  domain  walls  in  the  language  of  Figure  2.  The  two  regular 
chains  (R  and  L  in  Fig.  2)  differ  only  in  the  way  the  C — C  double  and  single  bonds 
alternate,  and  each  unit  in  Figure  2  consists  of  one  CH  unit. 

The  experimentally  studied  material  is  far  from  homogeneous.  It  contains  longer 
and  shorter  chains  that  are  arranged  partly  parallel  and  partly  at  random.  On  the 
other  hand,  in  the  parameter-free  studies  idealizations  are  investigated.  These  are 
either  infinite,  isolated,  periodic  chains;  infinite,  periodic,  three-dimensional  crystals; 
or  isolated,  finite  molecules.  This  may  be  one  reason  for  the  scattered  agreement 
between  theory  and  experiment  regarding  the  C — C  bond-length  alternation. 
Whereas  experiment  finds  a  clear  alternation  (bond  lengths  alternating  equal  to 
2.57  and  2.72-2.73  a.u.  [1 1,12]),  the  theoretical  results  range  from  good  agreement 
with  this  to  a  vanishing  bond-length  alternation  (see,  e.g..  Ref.  [6]  and  references 
therein).  In  our  own  study  [6]  on  an  isolated,  .nfinite  chain  we  found  bond  lengths 
equal  to  2.58  and  2.76  a.u.  which  are  seen  to  agree  well  with  experiment.  Moreover, 
it  turns  out  that  the  (almost)  vanishing  bond-length  alternation  mainly  has  been 
found  with  local-density  calculations  on  three-dimensional  crystals.  Since  the  local- 
density  approximation  tends  to  overestimate  the  strength  of  chemical  bonds  [13], 
one  may  speculate  that  also  the  interchain  interactions  are  overestimated,  which 
in  turn  may  lead  to  smaller  inaccuracies  in  the  intrachain  interactions.  This  might 
ultimately  lead  to  the  (almost)  lacking  bond-length  alternation.  On  the  other  hand, 
all  density-functional  calculations  on  single,  finite  or  infinite  chains  have  predicted 
a  bond-length  alternation. 
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In  Figure  4  we  show  our  calculated  [6]  total  energy  as  a  function  of  C — C  bond- 
length  alternation  for  a  single  chain  with  the  unit-cell  length  kept  fixed  at  the  value 
of  the  optimized  structure.  We  see  here  that  the  energy  gain  upon  bond-length 
alternation  indeed  is  small  and  thus  that  it  may  be  strongly  modified  upon  smaller 
changes  in  the  assumptions.  Therefore,  the  results  may  depend  crucially  on  whether 
one  studies  an  isolated  chain  or  one  embedded  in  a  multichain  environment. 

We  next  turn  to  the  discussion  of  the  solitons  and  apply  the  procedure  outlined 
in  the  previous  section.  We  accordingly  map  results  of  first-principles  calculations 
on  periodic  chains  with  different  C — C  bond-length  alternations  onto  a  suitable 
model  Hamiltonian.  To  this  end,  it  is  useful  to  follow  the  ideas  of  Su  et  al.  [14- 
16].  They  realized  that  the  orbitals  split  into  energetically  low-lying  a  orbitals  formed 
by  carbon  sp~  hybrids  and  hydrogen  s  functions  as  well  as  tt  oibitals  generated  by 
carbon  p  functions  around  the  Fermi  level.  This  separation  is  recovered  in  Figure 
5,  where  we  show  our  calculated  band  structures  of  trans  polyacetylene  [6]. 

Su  et  al.  included  a  tight-binding  description  of  the  tt  orbitals  (the  frontier  orbitals) 
whereas  the  remainder  of  the  total  Hamiltonian  was  written  as  a  harmonic  function 
of  nearest-neighbor  bond  lengths.  This  corresponds  for  the  periodic,  neutral  chain 
to  separating  the  total  energy  as  done  in  Figure  4.  Applying  this  model  on  a  doped 
chain  (i.e.,  a  chain  with  more  or  fewer  electrons),  Su  et  al.  found  that  chains 
containing  one  soliton  per  charge  were  the  stabler  ones  in  the  dilute  doping  limit. 
The  solitons  lead  according  to  their  model  to  gap  states  exactly  at  midgap.  However, 
according  to  experiments  the  gap  states  are  found  somewhat  off  (about  0.2  eV )  the 


-0.15  0.00  0.15 

Ad  (A) 


Figure  4.  Total  energy,  the  ts  electron  energy  (defined  as  the  sum  of  the  single-particle 
energies  of  the  occupied  tt  orbitals),  and  the  remainder  (the  a  energy)  in  eV/QHj  unit 
as  functions  of  C — C  bond-lengtn  alternation  Ad  (defined  as  the  difference  in  the  lengths 
of  the  two  nearest-neighbor  C — C  bonds). 
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Figure  5  The  band  structures  of  the  optimized  Iran';  polyacctylenc  chain.  The  dashed 
line  marks  the  Fermi  level,  and  k  -  0  and  A  =  1  is  the  zone  center  and  zone  edge,  respectively. 


midgap  position  [17-19],  The  origin  of  this  discrepancy  is  far  from  clear.  It  has 
been  argued  that  correlation  effects  are  the  reason  [20],  but  also  single-particle 
effects  may  explain  at  least  parts  of  the  difference  [6],  The  first-principles  calculations 
can  be  used  in  refining  the  model  and  its  parameters.  Thereby,  we  showed  [6]  that 
one  may  extend  the  simplest  model  of  Su  et  al.  with  some  realistic  single-particle 
effects  that  ultimately  lead  to  a  shift  of  the  gap  states.  The  thereby  obtained  shift 
was.  however,  only  half  as  large  (about  0.1  eV)  as  that  observed  by  experiments. 

An  MX  Chain 

Another  class  of  matenals  that  recently  has  attracted  considerable  attention  ( see. 
e.g..  Ref.  [  10])  is  the  so-called  mixed-valence  transition-metal  halogen-bridged  lin¬ 
ear-chain  MX  compounds.  As  a  prototype  of  those  we  shall  here  focus  on 
Pt2Brf,(NH3)4.  This  material  consists  of  linear  parallel  chains  with  alternating  Pt 
and  Br  atoms  along  the  chain  axis.  To  each  Pt  atom  are  two  further  Br  atoms  and 
two  NH3  molecules  attached  as  sideligands. 

According  to  experiments  the  Br  atoms  of  the  cnain  backbone  are  placed  pairwise 
closer  to  every  second  Pt  atom  resulting  in  an  alternating  valence  of  the  Pt  atoms. 
We  may  now  relate  the  structure  to  that  of  Figure  2  with  every  unit  being  one 
Pt;Br6(NH3)4  unit.  The  occurrence  of  two  energetically  degenerate  structures — 
differing  in  the  valence  alternation — leads  hence  to  the  possibility  of  the  existence 
of  solitons.  as  originally  proposed  by  Baeriswyl  and  Bishop  [21]. 
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The  present  interest  in  this  class  of  materials  stems  from  the  possibility  of  tuning 
the  properties  by  using  different  types  of  metal  atoms  (e.g.,  Pd,  Ni,  Pt)  and/or 
halogen  atoms  (e.g..  Cl,  1,  Br)  (see,  e.g..  Refs.  [10,22,23]).  Also  by  varying  the 
sideligands  or  applying  pressure  the  properties  can  be  changed. 

We  have  studied  the  Pt2Br6(NHj)4  compound  using  both  single-chain  calculations 
(with  the  method  of  the  second  section)  and  full-potential  LMTO  calculations  on  a 
three-dimensional  material  [24].  Model  calculations  as  described  in  the  second 
section  have  not  yet  been  carried  through,  but  it  was  shown  [24]  that  the  parameters 
extracted  from  the  first-principles  calculations  agree  well  with  those  usually  assumed 
realistic  for  these  materials. 

According  to  the  simplest  picture,  one  would  expect  the  frontier  orbitals  to  be 
generated  by  Pt  </;2  and  Br  orbitals,  where  the  z-axis  is  assumed  parallel  to  the 
chain  axis.  The  alternation  in  valence  is  then  ascribed  to  a  Peierls  distortion  which 
doubles  the  unit  cell.  This  picture  is  only  partly  verified  by  calculations  on  PtBr 
chains  for  which  all  sideligands  have  been  neglected.  Both  according  to  the  single¬ 
chain  calculations  and  according  to  the  crystal  calculations,  the  optimized  Pt-Br 
distance  was  roughly  13%  too  small  compared  with  experiments  [25]  (i.e.,  9.14 
a.u.  vs.  10.49  a.u.).  Moreover,  several  bands  were  found  to  cross  the  Fermi  level, 
and  there  was  no  sign  of  a  dimerization. 

One  might  expect  that  these  results  were  obscured  by  the  exclusion  of  the  Br 
sideligands.  Since  Br  is  an  electronegative,  open-shell  atom,  these  atoms  may  remove 
orbitals  of  the  PtBr  chain  from  the  region  around  the  Fermi  level.  Although  this 
was  confirmed  by  the  calculations,  the  lowest  total  energy  was  still  found  for  an 
undimerized  structure.  Only  by  including  also  the  NH3  sideligands  was  the  exper¬ 
imentally  observed  dimerization  found,  as  shown  in  Figure  6.  Moreover,  only  then 
we  found  Pt  and  Br  p.  orbitals  as  the  only  frontier  orbitals.  Although  NH3  is  a 
closed-shell  molecule,  it  is  polarized,  and  one  may  understand  the  result  as  due  to 
interactions  between  nitrogen  lone  pairs  and  Pt  orbitals. 

CaNiN 

CaNiN  [Fig.  7(a)]  was  synthesized  some  few  years  ago  [26]  and  is  interesting 
since  it  combines  one-,  two-,  and  three-dimensional  features.  Thus,  it  consists  of 
two-dimensional  layers  of  one-dimensional  NiN  chains  separated  by  layers  of  Ca 
atoms  such  that  the  whole  system  becomes  three-dimensional  [Fig.  7(a)].  The 
question  we  here  shall  address  is  whether  the  basic  features  can  be  understood 
"within  a  one-dimensional  picture.”  If  so,  the  material  would  be  interesting  as  a 
polymer  based  to  a  large  extent  on  transition  metals. 

We  first  performed  single-chain  calculations  on  an  isolated  NiN  chain  [27].  This 
gave  an  optimized  NiN  bond  length  equal  to  3.28  a.u.,  which  is  somewhat  smaller 
than  the  experimental  value  of  3.38  a.u.  [26].  Including  one  Ca  atom  per  NiN  unit 
as  shown  in  Figure  7(b)  led  to  an  optimized  Ni — N  bond  length  of  3.50  a.u.,  i.e., 
larger  than  given  by  experiment.  According  to  Mulliken  gross  populations,  0.4 
electrons  have  been  transferred  from  Ca  to  NiN,  but  due  to  the  presence  of  the  Ca 
atoms  the  electrons  of  the  NiN  chains  have  to  some  extent  been  redistributed  and 
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Figure  6.  Relative  total  energy  per  PtsBr^  unit  as  a  (unction  of  the  dimerization  amplitude 
f  for  the  MX  chain  without  (dashed  curve)  and  with  (solid  curve)  inclusion  of  the  NH3 
sideligands.  {'is  the  shift  of  the  Br  atoms  of  the  chain  away  from  the  midpoint  measured 
in  units  of  the  Pt-Pt  interatomic  distance. 


the  transferred  electrons  do  accordingly  not  only  enter  empty  orbitals  of  the  NiN 
chain.  The  distance  between  the  Ca  atoms  and  the  NiN  chains  is  found  to  be  much 
larger  (6.05  a.u.)  than  the  experimental  value  (4.72  a.u.).  Due  to  the  fairly  large 
Ca — NiN  distance,  the  main  interactions  are  of  electrostatic  nature.  Since  these 
are  long-ranged,  the  relative  position  of  the  Ca  atoms  and  the  NiN  chains  becomes 


•Ni  OCa 

Figure  7.  Schematic  representation  of  (a)  the  crystal  structure  of  CaNiN  and  (b)  of  one 

CaNiN  chain. 
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very  sensitive  to  treating  a  single  chain  or  the  complete  crystal,  explaining  the 
difference  between  theory  and  experiment. 

In  order  to  compare  the  single-chain  results  with  those  of  the  crystalline  material, 
we  finally  performed  one  calculation  for  the  experimental  geometry  using  the  lmto- 
ASA  programs  for  crystals  [4,28-30].  In  Figure  8  we  show  the  densities  of  states  as 
obtained  for  the  NiN  chain,  the  CaNiN  chain,  and  the  CaNiN  crystal.  We  observe 
here  close  similarities  for  all  three  systems.  Thus,  the  low-energy  region  in  Figure 
8(c)  resembles  those  of  Figures  8(a)  and  8(b)  except  for  some  shoulders  due  to 


ui  -20  -15  -10  -5  0 


Energy  (eV) 

Figure  8.  Total  density  of  states  of  (a)  a  single  NiN  chain,  (b)  a  single  CaNiN  chain, 
and  (c)  a  CaNiN  crystal.  The  vertical  dashed  lines  mark  the  Feimi  level.  Only  in  (a)  and 
(b)  is  an  energy-zero  defined  (infinitely  far  away  from  the  system). 
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three-dimensional  effects.  Also  closer  to  the  Fermi  level  there  are  similarities;  the 
peak  around  -8--6  eV  (-5  eV)  in  Figures  8(a)  and  8(b)  corresponds  to  that 
around  -4- -2  eV  (0  eV)  in  Figure  8(c).  However,  the  band  structures  for  the 
crystal  (not  shown)  reveal  also  some  dispersion  (i.e.,  interactions)  in  the  direction 
perpendicular  to  the  chains  especially  for  the  upper  valence  bands.  This  is  responsible 
for  the  differences  in  the  density  of  states  closest  to  the  Fermi  level.  Finally,  the 
differences  in  the  ^''idth  of  the  lowest  shown  peak  can  be  related  to  different  lattice 
constants. 


Polycarbonitrile  vs.  Hydrogen  Cyanide 

Upon  replacing  every  second  CH  unit  of  trans  polyacetylene  with  an  N  atom, 
one  arrives  at  polycarbonitrile  which  may  be  classified  as  another  conjugated  poly¬ 
mer.  The  unit  cell  of  this  polymer  contains  the  same  three  atoms  (H,  C,  and  N) 
as  hydrogen  cyanide,  which  is  a  hydrogen-bonded  polymer  consisting  of  covalently 
bonded  linear  HCN  monomers  linked  through  weak  hydrogen  bonds  (cf.  Fig.  9). 
A  comparison  between  these  two  thus  gives  information  on  the  differences  between 
systems  with  delocalized  electrons  and  those  with  localized  electrons.  As  a  third 
polymer  that  can  be  included  in  this  comparison,  we  include  the  hydrogen-bonded 
(CNH  )j:  polymer  of  Figure  9  ( 3 1  ] .  We  shall  here  concentrate  on  the  first-principles 
results  and  refer  the  reader  to  Ref.  [31]  for  a  discussion  of  the  structural  defects 
which  were  studied  using  methods  as  described  in  the  second  section. 

The  optimized  structure  of  polycarbonitrile  was  found  to  be  very  realistic,  although 
precise  experimental  information  was  lacking.  On  the  other  hand,  the  unit  ceil 
length  of  ( HCN  )x  was  found  to  be  about  8%  smaller  than  the  experimental  value 
( 7.62  a.u.  vs.  8.20  a.u.),  and,  moreover,  the  polymer  was  found  to  be  only  metastable 
compared  with  dissociation  into  isolated  monomers.  A  similar  metastability  was 


(a) 

(b)  ^  ^ 

(c) 

Figure  9.  Schematic  representation  of  (a)  hydrogen  cyanide  ( HCN ),.  (b)  (CNH and 
(c)  polycarbonitrile  (CHN);,.  (c)  is  a  conjugated  polymer;  the  other  two  hydrogen-bonded. 
The  filled  circles  and  squares  represent  carbon  and  nitrogen  atoms,  respectively,  and  the 
empty  circles  hydrogen  atoms. 


DENSlTY-ajNCTlONAL  LMTO  STUDIES 


239 


obtained  for  (CNH  for  which  we  found  no  experimental  information.  We  believe 
thi.s  to  be  a  consequence  of  the  tendency  of  the  local-density  approximation  to 
overestimate  strengths  of  chemical  bonds,  i.e..  also  of  a  small  covalent  component 
of  the  otherwise  weak  hydrogen  bonds.  This  overestimate  may  lead  to  too  short 
hydrogen  bonds  and  in  turn  modify  the  covalent  bonds  of  the  HCN  and  CNH 
monomers,  thereby  interchanging  the  energetic  order  of  the  monomers  and  the 
polymers.  Some  recent  results  [32.33]  have  indicated  that  this  failure  is  relatively 
general  for  local-density  calculations  on  hydrogen-bonded  systems,  although  cal¬ 
culations  on  hydrogen  halides  have  given  very  realistic  results  (see,  e.g..  Ref  [»]). 
However,  by  using  a  so-called  nonlocal-density  approximation  proper  structural 
properties  are  obtained  [  32,33  ] .  We  have  not  yet  studied  this  for  the  present  systems. 

We  believe  the  local-density  approximation  also  to  be  responsible  for  our  hnding 
that  the  conjugated  polymer  is  the  stablest  isomer  [31].  We  shall  discuss  this  finding 
in  some  detail.  In  Figure  10  we  show  our  calculated  relative  energies.  However,  the 
overestimate  in  the  strength  of  a  covalent  bond  due  to  the  local-density  approxi¬ 
mation  depends  mainly  on  tlie  type  of  bond  and  is  between  less  than  1  and  more 
than  2  eV  per  bond  [13].  For  the  sake  of  simplicity,  we  shall  here  assume  it  to  be 
2  ±  0.5  eV  per  covalent  bond.  For  the  conjugated  (CHN),  polymer,  we  have  three 
covalent  bonds  per  unit  cell  (one  C — H  and  two  C — N  bonds),  giving  an  overes¬ 
timate  in  the  binding  energy  of  about  6  eV.  For  the  other  two  systems,  the  number 
of  covalent  bonds  per  unit  cell  is  only  two,  giving  an  overestimate  of  roughly  4  eV. 
Thereby  the  relative  total  energies  get  modified  as  indicated  in  Figure  10.  In  total, 
we  thus  see  that  the  calculations  do  not  permit  us  to  make  any  reliable  statements 
of  the  relative  stability  of  these  compounds  due  to  the  different  types  of  bonds  of 
the  d'lferent  systems.  Related  arguments  should  also  be  applicable  for  the  relative 
stabi  i’y  of  the  hydrogen-bonded  polymers  and  the  corresponding  isolated  monomers 
for  v/hich  we  find  the  total  energies  of  the  polymers  to  be  about  0.4-0. 8  eV  per 
monomer  above  those  of  the  isolated  monomers. 


Figure  10.  Relative  total  energies  per  monomer  of  the  different  H— C — N  compounds 
as  calculated  ( lower  part )  and  when  modified  as  dcscr:lx-d  in  the  text  ( upper  part ) . 
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Despite  the  problems  related  to  obtaining  accurate  structural  informations,  the 
calculations  can  be  used  in  providing  insight  into  differences  between  localized  and 
delocalized  electrons.  In  Figure  1 1  we  show  the  band  structures  of  the  three  polymers. 
The  localization  in  the  hydrogen-bonded  systems  results  in  considerably  smaller 
band  widths,  which  show  up  in  the  density  of  states  and  photoelectron  spectra  as 
significantly  narrower  peaks  ( see  Ref.  [31]).  Nevertheless,  similarities  are  observed. 
Thus,  the  lowest  band  for  the  conjugated  polymer  is  due  to  C — N  a  orbitals  and 
is  to  some  extent  recovered  for  the  other  two  polymers.  The  uppermost  a  valence 
band  in  Figure  1 1  (c)  is  due  to  N  lone  pairs  and  can  also  be  found  for  the  HCN 
polymer.  Also  the  ir  band  in  Figure  1 1(c) — due  to  C — N  jr  bands — is  observed 
for  the  other  two  systems. 

The  discussion  of  localization  versus  delocalization  can  be  extended  to  include 
electron  densities  in  position  and  momentum  spaces.  We  shall  not  discuss  this 
point  further  here  but  refer  to  Ref.  [31].  Moreover,  a  similar  comparison  can  be 
carried  through  for  polyacetylene  and  polycarbonitrile  [34]. 

Doped  Polyacctylene 

In  the  third  section,  we  mentioned  that  the  properties  of  polyacetylene  can  be 
changed  significantly  upon  doping.  Within  models  such  as  that  of  Su  et  al.,  the 
dopants  are  merely  treated  as  a  passive  electron  reservoir  donating  or  accepting 
electrons  to  or  from  the  polyacetylene  chains,  and  all  changes  are  intrinsic  to  the 
polymers.  However,  electron  spin  resonance  ( ESR )  experiments  [  35-37  ]  on  doped 
polyacetylene  t.avc  indicated  that  the  role  of  the  dopant  is  more  active.  We  therefore 


Figure  1 1.  Band  structures  of(a.b)  the  hvdrogen-bonded  (a)  (HCN), and  (b)  (CNH), 
polymer  and  (c)  the  conjugated  (CHN),  polymer.  The  dashed  lines  represent  the  Feimi 
level,  and  A  =  0  and  k  =  I  are  the  zone  center  and  zone  boundary,  respectively. 
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Studied  a  single  polyacetylene  chain  doped  periodically  33%  with  monatomic  do¬ 
pants  ranging  from  Li  to  Na  (i.e.,  Z  =  3-1 1)  [38].  Although  the  dopant  concen¬ 
tration  is  higher  than  experimentally  achievable  (up  to  15-20%,  see,  e.g..  Ref.  [10]), 
dopant-dopant  interactions  are  largely  excluded  such  that  the  results  should  ah,, 
be  relevant  for  lower  dopant  concentrations. 

Our  main  interest  was  to  see  at  which  energies  dopant  orbitals  occurred  (i.e., 
whether  they  could  be  expected  to  be  important  in  the  transport  processes)  and  to 
estimate  the  electronic  dopant-polymer  hopping  integrals.  However,  due  to  inac¬ 
curacies  in  the  local-density  calculations,  this  was  related  with  some  uncertainties 
as  we  shall  see. 

It  turned  out  that  the  frontier  orbitals  for  all  systems  except  for  the  Ne-doped 
polymer  had  large  components  on  the  dopants.  Moreover,  the  Mulliken  populations 
indicated  a  fractional  electron  transfer  ( -0.3-  +0.8  electrons)  between  the  dopants 
and  the  polymer.  This  implies  that  the  dopant-polymer  interactions  have  a  signif¬ 
icant  covalent  component.  Whereas  most  transfers  in  principle  agree  with  chemical 
intuition,  we  found — surprisingly — that  Li  and  Na  receive  electrons  instead  of  do¬ 
nating  them. 

In  principle,  the  eigenvalues  t,  of  the  Kohn-Sham  equations  ( 1 )  are  not  related 
to  electronic  excitation  energies  except  for  the  uppermost  occupied  orbital  which — 
for  the  exact  density  functional — should  be  related  to  the  first  ionization  potential 
FiP  [39].  However,  the  local-density  approximation  gives  a  much  too  weak  depen¬ 
dence  of  this  as  a  function  of  the  atomic  number  Z  [40,13]  for  isolated  atoms. 
Therefore,  the  uppermost  occupied  orbitals  of  weakly  interacting  systems  will  tend 
to  have  noteworthy  components  on  all  subsystems,  as  we  found  for  doped  poly¬ 
acetylene.  Moreover,  small  changes  in  Vip  as  a  function  of  atomic  number — as,  for 
instance,  caused  by  effects  due  to  the  presence  of  more  than  one  atom — will  easily 
lead  to  an  electron  transfer  in  the  wrong  direction.  We  believe  that  this  explains 
our  results  for  Li  and  Na. 

These  results  imply  that  some  features  for  weakly  interacting  systems  may  be 
related  with  significant  uncertainties  when  calculated  with  a  local-density  method. 
On  the  other  hand,  we  believe  the  occurrence  of  fractional  electron  transfers  and 
of  significant  dopant-polymer  hopping  integrals  (we  estimated  them  to  be  ~0.5 
eV  [38])  to  be  true  such  that  our  results  also  may  be  considered  a  theoretical 
verification  of  the  experimental  esr  results  of  Bernier  and  coworkers  [  35-37  ] . 

Conclusions 

In  the  present  paper  we  have  discussed  how  first-principles,  density-functional 
calculations — mainly  on  quasi-one-dimensional  systems  and  when  using  an  lmto 
approach — can  be  used  m  understanding  the  chemical  and  physical  properties  of 
specific  materials.  These  results  may  in  turn  be  used  in  developing  more  general 
descriptions  of  larger  classes  of  materials  and/or  properties.  Most  real  materials 
are,  however,  too  complicated  to  be  the  subject  of  any  parameter-free  study,  and 
one  has  accordingly  to  examine  idealized  systems.  The  main  result  of  the  present 
paper  was  that  the  outcome  of  such  studies  might  depend  critically  on  the  choice 
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of  the  model  system.  Thus,  the  bond-length  alternation  observed  experimentally 
for  trans  polyacetylene  v/as  not  reproduced  by  some  other  theoretical  studies.  We 
suggested  this  to  be  related  to  differences  in  treating  three-d  inensional  structures 
vs.  single  chains.  As  another  result  the  simplest  chemical  picture  of  the  MX  chain 
was  only  recovered  when  including  the  full  ligand  structure.  On  the  other  hand, 
the  electronic  properties  of  CaNiN  were  well  described  as  a  qaasi-one-dimensional 
metal-containing  material  but  three-dimensional  effects  were  important  when  ad¬ 
dressing  the  question  of  the  position  of  the  Ca  atoms  relative  to  the  NiN  chains. 

As  a  further  example  of  the  importance  of  having  the  proper  description,  we 
considered  models  for  investigating  the  soliton-induced  gap  states  for  trans  poly¬ 
acetylene.  The  simplest  single-particle  model  of  Su  et  al.  predicted  these  to  appear 
exactly  at  midgap  whereas  experiments  found  these  off  midgap.  Effects  beyond 
those  of  a  single-particle  picture  can  describe  this  but  also  further  single-particle 
effects  can  be  (parts  of)  an  explanation  of  the  discrepancy. 

In  the  last  two  sections,  we  addressed  some  more  fundamental  problems  of  the 
local-density  approximation.  We  found  that  the  structural  properties  of  the  cova¬ 
lently  bonded  polycarbonitrile  were  well  described  but  those  of  the  hydrogen-bonded 
systems  were  less  accurate.  Moreover,  the  relative  stabilities  were  not  well  described. 
Results  of  some  recent  studies  indicate,  however,  that  a  nonlocal  approximation 
within  the  density-functional  formalism  may  solve  this  problem.  Despite  these  in¬ 
accuracies  the  calculations  could  give  interesting  information  on  localization  vs. 
delocalization  of  the  electrons. 

Finally,  the  results  for  doped  polyacetylene  indicated  nonnegligible  dopant- poly¬ 
mer  interactions  but  also  problems  for  local-density  calculations  in  obtaining  reliable 
electron  transfers  for  weakly  interacting  systems.  This  problem  may  be  related  to 
the  fundamental  problem  of  local-density  calculations  in  describing  the  gap  between 
occupied  and  unoccupied  orbitals  accurately. 
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Abstract 

We  propose  a  number  of  rules  based  on  topological  and  structural  considerations  for  the  synthesis 
of  intnnsically  conducting  polymeric  materials.  These  rules  can  be  summarized  as  follows:  (i)  The 
polymer  has  to  have  a  glide  plane  or  a  twofold  screw  axis  when  infinitely  extended;  (ii)  the  unit  cell 
must  contain  4N  +  2  valence  electrons;  (lii)  the  packing  into  the  solid  is  restricted  by  symmetry  rules 
that  conserve  the  twofold  degeneracy  caused  by  rule  (i).  These  rules  are  necessary  (but  not  sufficient) 
for  superconductivity.  The  inorganic  polymer  poly(sulfur-nitnde),  which  is  a  metallic  superconductor 
with  a  critical  temperature  of  TV  =  0.33  K,  is  identified  as  a  striking  example  of  these  rules.  ©  1993 
John  Wiley  &  Sons,  Inc. 

In  1 964  Little  [  1  ]  proposed  the  attractive  idea  that  there  may  exist  certain  organic 
polymers  which  can  become  superconducting  at  high  temperatures.  Such  organic 
molecules  should,  according  to  Little’s  original  proposal,  consist  of  two  parts:  a 
conducting  chain  (the  backbone)  and  a  series  of  side  groups  attached  to  this  back¬ 
bone.  If  these  side  groups  are  easily  polarized,  the  electrons  in  the  spine  can  be 
coupled  by  the  polarization-induced  interaction  of  the  side  groups.  This  interaction 
can  provide  the  attractive  coupling  necessary  for  superconductivity  [2],  In  spite  of 
many  attempts  to  synthesize  conducting  polymers  based  on  conjugated  backbones, 
no  significant  progress  has  been  made  in  the  synthesis  of  intrinsically  conducting 
polymers.  Most  of  the  organic  conductors  synthesized  to  date  are  semiconductors 
that  become  conducting  only  after  heavy  doping  (see,  e.g.,  the  recent  publications 
in  Ref.  [3]).  Unfortunately,  the  structural  and  mechanical  properties  of  these  doped 
polymers  are  not  always  useful  for  large  scale  application.  More  progress  has  been 
made  in  the  field  of  molecular  organic  (super-)  conductors  [4],  Values  of  Tc  have 
now  reached  a  value  of  12.5  K  [5]  and  will  likely  continue  to  rise.  The  discovery 
of  the  superconducting  properties  of  Li-,  Cs-,  and  Rb-doped  Buckminster-Fullerenes 
also  indicates  that  superconductivity  of  organic  materials  is  far  more  common  than 
originally  believed  [6-9]  (Ref.  [9]  gives  results  for  mixed-Cs,  Rb  compounds). 

So  far,  only  one  intrinsically  conductive  polymer  has  been  synthesized.  There 
arc  several  possible  reasons  for  this  fact.  First,  there  is  a  technical  one:  Up  to  now 
the  techniques  for  synthesis  of  polymers  with  high  crystallinity  have  been  insufficient 
to  create  conductive  polymers  in  which  the  charge  carrier  mobility  is  comparable 
to  that  in  metals  or  semiconductors.  Second,  a  one-dimensional  conductor  is  sup- 
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posed  to  undergo  a  Peierls  distortion,  which  will  result  in  a  gap  at  the  Fermi  level 
[10].  In  that  case,  the  density  of  states  at  the  Fermi  level  will  vanish  and  so  will 
the  metallic  conductivity.  The  resulting  polymer  will  only  become  conducting  after 
doping.  However,  doped  polymers  usually  do  not  crystallize  in  well-defined  crystal 
structures  due  to  the  disorder  caused  by  the  doping.  It  therefore  seems  worthwhile 
to  investigate  the  possibility  of  the  existence  of  a  class  of  intrinsically  conducting 
polymers  that  do  not  undergo  a  Peierls  distortion. 

If  we  consider  an  isolated  polymer  chain,  we  can  immediately  state  the  conditions 
under  which  it  will  be  conducting. 

1 .  The  polymer  structure  should  have  an  almost  rigid  backbone,  with  glide  plane 
symmetry  and/or  twofold  screw  axis  symmetry  when  infinitely  extended.  This 
rule  is  a  consequence  of  Herring’s  theorem.  This  theorem  implies  that  a  one¬ 
dimensional  system  with  a  glide  plane  or  a  twofold  screw  axis  along  its  spine 
will  show  a  double  degeneracy  at  the  zone  boundary  for  all  one  electron  levels 
[11,12] 

2.  There  should  be  +  2  electrons  per  unit  cell.  Under  this  condition  the 
highest  occupied  MO  will  be  filled  exactly  up  to  the  zone  boundary.  This  rule 
could  be  called  a  superaromaticity  rule,  as  it  is  the  polymer  extension  to  the 
Hiickel  stabilization  rule  for  aromatic  molecules.  It  is  this  degeneracy  at  the 
Fermi  level  that  may  offset  a  Peierls  distortion  [10].,  Moreover,  it  assures  a 
high  density  of  states  at  the  Fermi  level. 

A  polymer  that  obeys  both  rules,  and  does  not  undergo  a  Peierls  distortion,  is  a 
conductor.  If  we  crystallize  this  polymer  into  a  solid  and  succeed  in  packing  the 
polymer  in  such  a  way  that  the  above-mentioned  rules  still  apply,  then  the  resulting 
system  will  still  be  conducting.  In  that  case  the  interchain  interaction  will  not  reduce 
the  polymer  to  a  semiconductor.  This  represents  our  third  rule. 

In  fact,  the  interchain  interaction  may  be  important  for  more  than  one  reason. 
If  these  interactions  are  strong  enough,  so  that  the  resulting  solid  polymer  is  not  a 
one-dimensional  system,  but  has  to  be  regarded  as  a  highly  anisotropic  three-di¬ 
mensional  system,  a  Peierls  distortion  may  be  avoided. 

This  seems  to  be  the  case  with  poly  (sulfur-nitride).  From  calculations  and  ex¬ 
periments  one  can  conclude  that  this  remarkable  polymer  obeys  the  two  rules  stated 
above.  The  crystal  symmetry  is  P2\lc[\'i].  The  S-N  chains  are  oriented  along  the 
b~a.\K,  which  happens  to  be  the  twofold  screw  axis  of  the  unit  cell.  There  are  two 
crystallographically  equivalent  chains  per  unit  cell,  which  are  related  by  screw  axis 
symmetry  around  the  />-axis  of  the  monoclinic  unit  cell.  Moreover,  there  is  proof 
that  the  interchain  interaction  is  almost  as  strong  as  the  intrachain  interaction  due 
to  the  close  proximity  of  S  and  N  atoms  in  neighboring  chains,  and  therefore  (SN);^ 
has  to  be  seen  as  an  anisotropic  three-dimensional  solid  and  not  a  one-dimensional 
polymer  [14].  This  strong  interchain  interaction  is  apparently  the  reason  (SN);^ 
does  not  undergo  a  Peierls  distortion,  although  ab  initio  lcao-CO  calculations  by 
Dovesi  et  al.  show  that  the  energy  difference  between  the  symmetric  and  the  sym¬ 
metry  broken  state  is  exceedingly  small,  indicating  that  the  absence  of  a  Peierls 
distortion  may  also  be  an  intrinsic  quality  of  ( SN  )x  [  1 5  ] .  The  conductivity  parallel 
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to  the  molecular  axes  at  room  temperature  is  an  =  4  X  10^(12  cm)"‘  [16,17].  At 
4.2  K  this  value  is  about  1000  times  as  high.  At  low  temperatures  the  ratio  aj  a ^ 
is  of  the  order  of  1000  [18].  Furthermore,  (SN),  undergoes  a  transition  to  a  su¬ 
perconducting  state  at  0.33  K.  It  is  believed — at  this  moment — to  be  a  “normal” 
type  II BCS  superconductor.  Both  a  and  Tc  increase  under  pressure  [19-21]. 

A  key  issue  is  the  preparation  of  conducting  polymers  with  a  high  degree  of 
crystallinity.  Experiments  with  doped  polyacetylenes  show  that  high  crystallinity 
increases  the  conductivity  dramatically  by  increasing  the  effective  conjugation  of 
the  chains.  In  these  systems  the  crystallinity  can  be  improved  by  annealing  tech¬ 
niques  [22]. 

Further  possibilities  for  achieving  high  crystallinity  lie  in  the  synthesis  of  soluble 
polymers,  which  can  be  crystallized  from  solution. 

Finally,  the  question  has  to  be  asked:  Can  conducting  polymers  that  obey  our 
two  rules  be  superconductors?  According  to  Little’s  model,  this  is  certainly  a  pos¬ 
sibility  [1].  We  can  imagine  that  the  conductive  backbone  has  a  number  of  side 
groups  that  are  highly  polarizable,  thus  inducing  attractive  interaction  between  the 
conduction  electrons,  leading  to  a  Bcs-type  pairing  [2].  Due  to  the  one-dimen¬ 
sionality  of  the  polymers,  the  density  of  states  at  the  Fermi  level  will  be  high,  leading 
to  a  stronger  coupling  of  the  electrons  and  hence  a  higher  Tc.  This  situation  can 
be  compared  to  that  observed  in  the  A- 15  compounds,  where  the  one-dimensionality 
of  the  transition  metal  chains  is  considered  to  be  responsible  for  the  rather  high 
values  of  Tc  [23].,  On  the  other  hand,  a  mechanism  involving  the  interactions 
between  the  conduction  electrons  in  one  chain  with  the  neighboring  chains  could 
also  occur.  A  similar  mechanism  was  recently  discussed  as  a  possible  explanation 
of  the  high-Tc  superconductivity  in  the  layered  cuprates  [24,25].  This  additional 
fourth  rule  is  more  quantitative  than  the  first  three.  The  actual  unit  cell  composition 
and  chemical  structure  must  be  involved,  and  a  predictive  theory  must  recognize 
these  factors  explicitly  [26]. 

Polymers  that  contain  large  side  groups  and  sheet  {2D)  materials  that  resemble 
the  Cu-O  type  materials  in  structure  and  meet  the  original  topological  requirements 
of  Little’s  model  for  excitonic  superconductivity  are  currently  being  investigated. 
The  main  focus  of  these  studies  is,  however,  on  organometallic  systems  [27].  Also, 
a  number  of  sandwich  materials  have  been  produced.  They  are,  however,  very  hard 
to  process  because  of  their  air  sensitivity,  in  volatility,  and  insolubility  [28].  At  this 
moment,  a  number  of  experimental  groups  are  studying  the  possibilities  of  syn¬ 
thesizing  polymers  that  obey  the  rules  presented  in  this  paper.  The  results  of  these 
efforts  will  give  us  opportunity  to  test  and,  if  needed,  tune  our  rules.  We  believe 
the  three  rules,  stated  here,  are  qualitatively  correct.  Feedback  from  experimental 
results  will  be  important  to  turn  these  qualitative  rules  into  a  quantitatively  predictive 
theory  of  (super)  conductive  polymers. 

In  summary,  we  propose  a  number  of  simple  topological  and  structural  rules, 
that  will  lead  to  intrinsically  conducting  polymers.  If  these  polymers,  in  addition 
to  our  mles,  have  side  chains  of  high  polarizability,  they  can  become  superconducting 
at  fairly  high  temperatures.  Because  the  proposed  class  of  polymers  is  intrinsically 
conducting,  none  of  the  disorder  problems  arrising  from  the  usually  applied  heavy 
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doping  occurs.  In  addition,  our  rules  can  be  used  as  practical  guidelines  in  the 
search  for  intrinsically  conducting  polymers. 
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Abstract 

We  evaluate  binding  energies,  ionization  energies,  and  second-order  energy  differences  as  functions 
of  valence  electron  number  for  small  sphencal  clusters  of  stabilized  jellium,  using  the  Kohn-Sham 
equations  with  the  local-spin-density  (LSD)  approximation.  Cohesive  energies  are  also  reported.  A  com¬ 
parison  IS  made  with  semiciassical  formulas  (liquid  drop  model  and  Fade  approximant,  with  surface 
and  curvature  coefficients  derived  from  first  pnnciples).  These  formulas  nicely  average  the  shell-structure 
oscillations  of  the  energy,  which  are  found  to  be  almost  the  same  as  for  ordinary  jellium.  Sphencal 
clusters  with  1  ,■  7,  and  9  electrons  have  binding  energies  very  close  to  those  of  the  semiciassical  predictions. 
©  1993  John  Wiley  &  Sons,  Inc. 


Introduction 

In  the  simple  metals,  the  delocalized  valence  electrons  experience  a  weak  ionic 
pseudopotential,  which  in  simplified  treatments  is  often  replaced  by  the  potential 
of  a  uniform  positive  background  with  a  sharp  boundary  (jellium) . 

The  stabilized  jellium  model  [1]  for  surface  and  cohesive  properties  of  simple 
metals  is  intended  to  cure  well-known  deficiencies  of  the  ordinary  jellium  model:, 
unrealistic  results  for  the  binding  energies  at  all  densities,  for  the  compression  prop¬ 
erties  at  low  densities  (negative  bulk  modulus  for  the  density  of  cesium),  and  for 
the  surface  properties  at  high  densities  (negative  surface  tension  for  the  density  of 
aluminum).  All  these  unrealistic  results  follow  from  the  fact  that  jellium  is  unstable, 
except  at  a  density  close  to  that  of  sodium.  At  that  density,  jellium  and  stabilized 
jellium  have  identical  surface  properties.  The  stabilized  jellium  model  retains  the 
simplicity  and  universality  of  jellium.  With  only  two  inputs,  the  density  and  valence, 
this  model  provides  an  accurate  overall  description  of  the  binding  energies  of  all 
simple  metals,  in  the  framework  of  density-functional  methods.  Given  only  the 
density,  it  delivers  reasonable  overall  surface  properties  (work  function,  surface 
tension,  etc.).  This  result  is  achieved  by  subtracting  from  the  energy  functional  of 
the  jellium  model  the  spurious  self-interaction  of  the  bulk  positive  charge  and  by 
introducing  inside  the  solid  a  constant  potential,  the  average  value  in  the  Wigner — 
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Table  1  Liquid  drop  model  coefficients  a,  and  a^.  The  surface  and  curvature  coefficients  are  from 
a  Kohn-Sham  calculation  for  the  planar  surface  of  stabilized  jellium  [8]  Mso  indicated  are  the  work 
function  W'  and  the  distance  Xq  from  the  image  plane  to  the  jelhum  edge  of  the  planar  surface,  from  [9]. 


n 

(bohr) 

(eV) 

a, 

(eV) 

Oc 

(eV) 

W 

(eV) 

xo 

(bohr) 

Al 

2.07 

-10  58 

0.87 

0  65 

4.21 

1  01 

Na 

3.99 

-6  26 

0.59 

0  24 

2.94 

1.27 

Cs 

5.63 

-4.64 

0.42 

0.13 

2.26 

1.55 

Seitz  cell  of  the  difference  between  a  local  pseudopotential  and  the  jellium  back¬ 
ground  potential.. 

The  jellium  model  should  have  the  same  drawbacks  for  metallic  clusters  as  for 
the  bulk  system.  For  example,  it  does  not  yield  realistic  binding  energies,  and  it  is 
completely  inadequate  to  describe  compression  effects  of  finite  systems.  Notwith¬ 
standing  these  deficiencies,  the  jellium  model  with  the  help  of  density-functional 
methods  has  been  extensively  used  for  understanding  metallic  clusters  [2,3,4].  Those 
studies  support  the  image  of  the  cluster  as  a  system  of  electrons  bound  in  a  one- 
particle  potential  due  to  the  spherical  (or  deformed )  jellium  background  and  the 
other  electrons.  Clusters  with  “magic”  numbers  of  electrons  (2,8,  1 8,  20,  etc.)  are 
more  stable  than  their  neighbors,  due  to  the  presence  of  larger  gaps  above  the 
highest  occupied  level..  The  occurrence  of  ionization  energies  with  local  maxima 
for  magic  systems  corroborates  that  picture. 

Here  we  shall  evaluate  the  effect  of  the  “stabilization”  of  jellium  on  static  prop¬ 
erties  of  clusters,  such  as  binding  energies,  ionization  energies  and  second-order 
energy  differences.  In  other  work  [5],  the  stabilized  jellium  model  will  be  applied 
to  describe  compression  effects  for  clusters  at  any  density. 

This  work  calculates  energies  of  small  spherical  stabilized  jellium  clusters,  as 
functions  of  electron  number,  using  the  Kohn-Sham  equations  of  density-functional 
theory  (within  the  local-spin-density  approximation.  LSD).  By  “small,”  we  mean 
that  the  number  of  valence  electrons  is  A'  <  20.  The  quantal  energy  of  the  cluster 
is  compared  with  a  classical  liquid  drop  formula.  A  variant  of  the  liquid  diop 
formula,  using  a  Fade  approximant,  which  has  been  proposed  recently  for  the  study 


Table  II.  Fade  coefficients  for  the  stabi¬ 
lized  jellium  model  (5}.  The  r,  values  are 
those  of  Table  I 


b, 

b. 

b. 

Al 

0.742 

0.326 

0  123 

Na 

0  399 

0.284 

0  146 

Cs 

0  308 

0  227 

0.147 
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Figure  I .  Non-bulk  binding  energy  per  valence  electron  E/N-a^  for  clusters  of  sodium 
(rj  =  3.99 ),  within  the  stabilized  jellium  model.  The  symbols  •  and  O  represent,  respectively, 
the  Kohn-Sham  results  for  stabilized  and  ordinary  jellium.  The  dashed  line  represents  the 
liquid  drop  model,  and  the  dashed-dotted  line  the  Fade  approximant.  The  bulk  binding 
energy  is  =  -6.26  eV  for  stabilized  jellium  and  a..  =  -2.10  eV  for  jellium. 


of  small  voids  inside  solids  [6],  is  also  considered.  The  enhancement  of  stability 
due  to  shell  effects,  which  is  known  for  jellium,  is  examined  for  stabilized  jellium. 
Ionization  energies  of  stabilized  jellium  clusters  are  evaluated.  Finally,  the  energies 
of  stabilized  jellium  atoms  are  used  together  with  bulk  binding  energies  to  obtain 
cohesive  energies. 

In  the  next  section,  we  present  the  Kohn-Sham  equations  applied  to  spherical 
clusters  of  stabilized  jellium.  Results  are  then  displayed  for  the  energetics  of  three 
metals  ( Al,  Na  and  Cs),  which  cover  the  whole  range  of  metallic  densities.  In  the 
last  section,  further  work  with  stabilized  jellium  is  proposed. 

Kohn-Sham  Equations  for  a  Cluster  of  Stabilized  Jellium 

Let  us  adopt  the  atomic  system  of  units  {e=  m  =  h  =  1 ) .  We  consider  a  spherical 
uniform  positive  background  of  finite  density  n+  such  that 

1  {r^R) 

0  (r^R), 


«+(r)  =  «  0(R  -  r),  0(R  -  r)  = 


(2.1) 
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Na  (r,=3.99) 


0.0  5.0  10  0  15.0  20.0 

N 


Figure  2.  lonizatioii  energies  for  clusters  of  sodium.  The  symbols  *  and  O  represent,  re¬ 
spectively,  stabilized  and  ordinary  jellium  results.  The  dashed  line  represents  the  result  of 
formula  (2.22)  for  the  stabilized  jellium  model.  Experimental  data  are  indicated  by  X  [22]. 


with 

3  _  kf 
”  4irrs  ’ 

and  R  the  cluster  radius.  From  the  neutrality  condition 

J  dh  n+(r)  =  N , 


(2.2) 

(2.3) 


the  radius  of  the  background  sphere  must  be 

R  =  .  (2.4) 

The  self-consistent  Kohn-Sham  equation  is 

[-  ^  +  Ueft{r,  ,  (2.5) 


where  a  denotes  a  set  of  quantum  numbers  and  o  is  a  spin  number  (up  f  or 
down  I). 

In  Eq.  (2.5),  the  effective  potential  is 
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Na  {r.=3.99) 


Figure  3.  Second-order  energy  differences  for  clusters  of  sodium.  The  symbols  •  and  O 
represent,  respectively,  stabilized  and  ordinary  jellium  results.  The  dashed  line  represents 
the  liquid  drop  model  for  stabilized  jellium. 


«cfr(r,  f)  =  «+{r)  +  J  +  MJc(r) ,  (2.6) 

where  tJ+  is  the  electrostatic  potential  for  the  interaction  between  the  background 
and  the  electrons,  n(r )  is  the  total  electronic  density,  and  ulc(r)  the  spin-dependent 
exchange-correlation  potential.  We  have 


u+(r)  = 


WS  {l"  <  R) 


ir>R), 


with 


(5w)ws  — 


Icf  ^  dcc 
4ir  3  drs ' 


(2.8) 


the  average  potential  difference  which  appears  in  the  stabilized  jellium  model  [1], 
(cc  is  the  correlation  energy  per  electron  of  the  uniform  electron  gas  [7].)  The 
second  term  in  (2.6)  is  the  Hartree  potential.  The  last  term  is  the  exchange-cor¬ 
relation  potential 


2^4 
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Figure  4.  The  same  as  Figure  1  for  aluminum  (r,  =  2.07).  The  bulk  binding  energy  is 
=  -10.59  eV  for  stabilized  jellium  and  a„  =  -0.21  eV  for  jellium. 


«*(>■))  =  T-  [«evc(«t>  ”<)]  >  (2-9) 

on, 

where  n^(r)  and  «i(r)  are  the  up  and  down  spin  densities,  respectively.  Here  e^c  = 
Cx  +  Be,  with  Cx  =  ~3ki.  /4w  the  exchange  energy  per  electron  of  the  uniform  gas. 
The  total  electronic  density  is  given  by 

n(r)=  2  «a(r),  (2.10) 

with 

n,(.r)  =  X  ■  (2.11) 

a 

The  fa,  are  occupation  numbers  of  the  orbitals  defined  by  the  quantum  numbers 
{fac  =  ©(cf  -  Ba,),  with  Bp  the  Fermi  energy,  and  la^  fa^  =  N). 

The  total  energy  of  the  stabilized  jellium  cluster  is 

E{N)  =  2 -  |(^[«]  +  2  J  d^rnUn^,  «i)«»(r)j  +  UbM 
+  J  d^rBxfn^,  «i)n(r). 


(2.12) 
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10.0 


8.0 

(U 

^  6.0 

HH 

4.0 


2.0 

no  5.0  10.0  15.0  20.0 

N 

Figure  i.  The  same  as  Figure  2  for  aluminum.  Note  that  only  N  =  3.  6,  9.  12,  15,  18 
are  physically  meaningful.  Experimental  data  are  taken  from  [23]. 


A1  (r,=2.07,  z*=l) 


r  T—i  I  I  I  I  I  I  I  I  I  I  I  I  I — I — I  r 


where 


is  the  electronic  Coulomb  repulsion  energy,  and 

,,,  ,  3N(N-N'/^) 


(2.13) 


(2.14) 


the  background  Coulomb  repulsion  minus  the  self-repulsion  energy  within  each 
V  'igner-Seitz  cell.  We  have  adopted  the  version  of  stabilized  jellium  in  which  the 
effective  valence  is  unitv  (z*  =  1 ),  since  this  version  gives  the  more  realistic  bulk 
modulus.  The  ordinary  jellium  model  is  recovered  by  dropping  the  last  term  of 
Eqs.  (2.7)  (r<i?)  and  (2.14). 

For  a  spherical  system,  be  self-consistent  Eq.  (2.5)  is 


i>c(r(r,  cr)  + 


/(/+!) 

2r' 


(2.15) 


where  is  the  radial  wave  function.  As  an  approximation  and  to  oblige 

the  system  to  have  spherical  symmetry,  we  .eplace  the  true  electronic  spin  density 
by  the  spherical  average 
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Figure  6.  The  same  as  Figure  I  for  cesium  (r,  =  5.63).  The  bulk  binding  energy  is  o„  = 
-4.64  eV  for  stabilized  jellium  and  Oc  =  - 1 .98  eV  for  jellium. 


n«(r)  =  2  .  (2.16) 

In  order  to  solve  (2.15),  with  the  approximation  (2.16),  we  have  used  an  atomic 
code  where  the  nucleus  with  constant  charge  has  been  expanded  to  the  desired 
cluster  radius. 

It  is  known  that  the  energy  of  a  finite  quantal  system,  such  as  a  cluster,  can  be 
described  in  an  average  way  by  the  liquid  drop  model  (LDM) 

Ei.dm{N)  =  a,N  +  +  •  •  •  (2.17) 

The  coefficients  (volume),  Qj  (surface),  and  (curvature)  for  the  stabilized 
jellium  model  have  been  obtained  with  the  aid  of  the  so-called  “leptodermous 
expansion”  [8].  They  are  given  in  Table  I  for  Al.  Na,  and  Cs. 

We  can  write  (2.17)  in  the  form 

£'ldm(A^)  =  a,N  +  as{N)N^f\  (2.18) 

The  Fade  formula  for  a  stable  cluster  is  [6] 

aAN)  =  a<(l  -  -  byN-']-'  . 


(2.19) 


CLUSTERS  OF  STABILIZED  JELLIUM 


257 


The  coefficients  b\,  62,  and  are  given  in  Table  II.  This  formula  was  developed 
to  connect  the  liquid  drop  expansion  for  a  spherical  void  of  large  radius  with  the 
perturbation  expansion  for  a  void  of  small  radius.  It  effectively  sums  the  leptoder- 
mous  expansion  to  all  orders,  and  has  been  adapted  to  clusters  by  changing  the 
sign  of  the  curvature  7?"' . 

We  compare  the  liquid  drop  prediction  with  the  quantal  result,  looking  for  special 
stability  (shell  closures)  for  numbers  such  that 

£(A^)  <  £ldm(A^)  .  (2.20) 

Relative  stability  is  also  indicated  by  local  maxima  of  the  ionization  energy 

im  =  En{N  -  \)  -  En{N)  ,  (2.21) 

where  the  subscript  N  uenotes  the  fixed  positive  charge.  A  smooth  formula  for  the 
ionization  energy  is 


I{N)=  W  + 


1 

2(r,A-'^  +  xb)  ’ 


(2.22) 


with  W  the  work  function  and  Xq  the  location  of  the  image  plane  for  the  planar 
surface.  (The  values  in  Table  1  are  taken  from  [9].)  Eq.  (2.22)  is  an  approximation 
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Table  III.  Cohesive  energies  for  the  stabilized  jeihum  (SJ)  and  jelhum  (J)  models  (in  eV),  using  different 
appro.Nimations.  “lsd"  means  local-spin-density,  “lda”  local-density  approximation,  “Sic”  self-interaction 
correction,  "ldm”  liquid  drop  model.  “Fade"  Fade  approximant.  and  “Exp"  expenment  (from  [18]). 
Here  we  use  densities  which  are  slightly  different  from  those  of  Table  I.  The  values  m  this  table  refer  to 
zero  temperature,  while  the  previous  ones  are  room  temperature  values. 


r, 

(bohr) 

LSD 

LDA 

SIC 

LDM 

Fade 

Exp. 

II 

< 

2.07 

SJ 

3.79 

3.93 

3.21 

2.75 

3.02 

3.34 

J 

0.72 

0.84 

0.21 

-0.25 

-0.07 

M(N=  1) 

2.07 

SJ 

1.91 

2.41 

1.61 

1.52 

1.90 

— 

J 

0.48 

0.91 

0.24 

0.08 

0.19 

Na 

3.93 

SJ 

0.92 

1.18 

081 

0.85 

0.82 

1.13 

J 

0  88 

1.15 

0.78 

0.82 

0.78 

Cs 

5.62 

SJ 

0  56 

0.74 

0.51 

0.59 

0.54 

0.83 

J 

0  62 

0.81 

0.57 

0.66 

0.76 

to  the  liquid  drop  expression  [  1 0] ,  in  which  a  weakly  size-dependent  LDM  chemical 
potential  m(^)  *=  -H'' appears. 

A  third  measure  of  stability  is  the  second-order  energy  difference,  which  has  been 
used  to  interpret  mass  spectra  of  metallic  clusters: 

A2(iV)  =  £( A'  -F  1 )  -F  E(N  -  1 )  -  2£:(A^) .  (2.23) 

A  peak  of  this  quantity  means  an  enhancement  of  stability  with  respect  to  neigh¬ 
boring  values  of  N.  In  the  liquid  drop  model,  Eq.  (2.23)  becomes  A2(A^)  - 
d^ELDM(N)/dN^.  We  shall  see  whether  the  criteria  for  relative  stability  based  on 
(2.20),  (2.21),  and  (2.23)  agree. 


Results 

Bulk  jellium  is  stable  at  the  density  =  4.2,  where  <(6v)ws  =  0.  Sodium  {r,  = 
3.99)  has  therefore  been  used  as  an  application  of  the  jellium  model  [11,12],  Since 
the  ‘  tabilized  jellium  model  yields  a  small  value  of  (6u)ws  for  Na,  the  surface 
properties  of  jellium  and  stabilized  jellium  should  be  practically  the  same  for  this 
metal,  although  the  bulk  binding  energy  per  particle  is  very  different  in  the  two 
models.  Small  clusters  of  Na  are  more  bound  in  the  stabilized  jellium  description 
than  in  jellium  ( see  Fig.  I ,  noting  the  different  for  stabilized  jellium  and  jellium) . 
Nevertheless,  the  shell  structure,  i.e.,  the  oscillation  around  the  smooth  curve  given 
by  the  liquid  drop  formula,  is  essentially  the  same,  with  magic  numbers  N  =  2,  i, 
18,  20.  The  binding  energy  is  also  well  below  the  ldm  curve  for  N  =  \9,  where  the 
2s  shell  is  half-filled.  For  the  electron  numbers  N  =  1,7,9,  17,  the  quantal  result 
agrees  closely  with  the  LDM  prediction.  These  “tell-tale”  numbers  might  be  used 
to  estimate  the  exact  (beyond  lsd)  surface  and  curvature  energies  from  careful 
quantum-chemical  calculations  for  small  spherical  clusters.  For  Na,  the  Fade  rep¬ 
resentation  practically  coincides  with  the  liquid  drop  formula. 


CLUSTERS  OF  STABILIZED  JELLIUM 


259 


The  shell  structures  of  jellium  and  stabilized  jellium  are  also  displayed  by  the 
ionization  energies.  The  ionization  energies  of  the  two  models  agree  closely  (Fig. 
2).  The  ionization  energy  for  TV  =  19  is  bigger  than  that  for  N  =  20;  this  fact, 
known  for  jellium  m  LSD  [13],  holds  for  stabilized  jellium  as  well.  The  local  maxima 
at  half-filled  shells  (N  =  5,  13)  are  a  characteristic  of  LSD,  and  are  not  seen  in 
the  local-density-approximation  (lda)  [14].  (lda  is  LSD  with  Wf,  n;  replaced  by 
n/2,  n/2.) 

Figure  2  also  presents  experimental  ionization  energies  for  Na  clusters.  For  the 
smallest  clusters,  the  spherical  stabilized  jellium  results  are  not  in  good  agreement 
with  experiment.  In  fact,  clusters  of  five  or  fewer  atoms  are  typically  planar  [15,16]. 
Moreover,  the  instability  of  many  spherical  open-shell  ;lusters  is  evident  from  the 
total  energies  of  Figure  1.  For  example,  the  total  energy  for  A'  =  3  is  greater  than 
the  sum  of  total  energies  for  =  2  and  1.  These  unstaole  clusters  will  start  to 
deform  away  from  spherical  shape  toward  fission.  In  many  cases,  they  will  reach 
energy  minima  before  fission  occurs.  Spheroidal  distortions  can  correct  most  of  the 
difference  between  our  ionization  energies  and  experimental  ones  [17], 

The  second-order  energy  difference  is  almost  identical  in  jellium  and  stabilized 
jellium  (Fig.  3).  The  liquid  drop  result  for  AiiN)  is  practically  constant  (A2(,V) 
sw  0),  providing  a  good  average  of  the  quantal  values. 

Turning  to  Al,  we  find  that  the  shell  fluctuations  are  bigger  than  for  sodium, 
although  the  two  patterns  are  similar  (Fig.  4).  The  Fade  approximant  and  the  LDM 
formula  agree,  except  for  the  smallest  clusters  ( A^  ^  8 ) .  For  A'  =  1 ,  the  Fade  formula 
reproduces  the  quantal  energy;  recall  that  N  is  the  number  of  valence  electrons  and 
not  the  number  of  atoms.  The  ionization  energies  of  stabilized  jellium  ( Fig.  5 )  are 
bigger  that  those  of  jellium,  although  the  peak  structure  is  the  same.  For  A^  1 2, 
our  results  agree  with  experiment. 

In  the  case  of  Cs  (Figs.  6  and  7),  the  nonbulk  binding  energies  E/N  ~  and 
the  ionization  energies  of  stabilized  jellium  are  below  those  of  jellium.  As  expected, 
a  negative  value  of  <5u)ws  (as  in  Al)  increases  the  nonbulk  binding  energy  E/N 
-  a„,  the  ionization  energy,  and  the  amplitude  of  Ai^N)  oscillations,  while  a  positive 
value  (as  in  Cs)  has  the  opposite  effects. 

The  cohesive  energy  is  the  energy  difference  between  the  free  atom  and  an  atom 
in  the  bulk.  It  can  be  written  as 

Ccoh  =  —a^  +  E{N  =  z) ,  (3.1) 

where  z  is  the  valence  of  the  atom.  This  quantity  can  be  read  directly  from  Figures 
1 , 4,  and  6.  Table  III  gives  values  for  the  cohesive  energy  of  stabilized  and  ordinary 
jellium,  in  different  approximations. 

In  the  case  of  Al,  an  atom  of  stabilized  jellium  is  a  cluster  with  <6u)ws  constructed 
from  an  effective  valence  z*  =  1,  but  with  AT  =  3  electrons  ( real  valence).  We  have 
also  included  in  Table  III  the  case  AT  =  1,  since  this  provides  a  fair  test  of  the  ldm 
and  Fade  formulas. 

We  see  from  Table  III  that  stabilization  of  jellium  drastically  improves  the  cohesive 
energy  of  Al,  in  comparison  with  experiment.  Table  III  also  shows  that  the  Fade 
successfully  predicts  the  LSD  cohesive  energies  of  monovalent  metals  (A  =  1 );  the 
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discrepancy  for  TV  =  3  may  be  attributed  to  shell  effects.  Our  LDA  values  agree  with 
the  “ideal  metal”  results  of  Rose  and  Shore  [18].  The  surface  and  cohesive  properties 
of  the  ideal  metal  are  the  same  as  those  of  stabilized  jellium;  the  small  difference 
we  find  arises  from  the  use  of  a  different  correlation  energy.  We  note  that  the  LDA 
cohesive  energies  are  close  to  experimental  ones.  However,  this  close  agreement  is 
due  to  a  cancellation  of  errors  between  the  LDA  and  the  stabilized  jellium  approx¬ 
imation.  The  exact  cohesive  energy  for  monovalent  stabilized  jellium  is  given  by 
the  self-interaction  correction  (SIC)  [19],  and  is  smaller  than  the  LSD  value. 

Conclusions 

We  have  analyzed  the  properties  of  small  clusters  of  stabilized  jellium.  Stabilized 
jellium  is  similar  to  the  pseudojellium  model  of  Utreras-Diaz  and  Shore  [20],  but 
unlike  the  latter  it  does  not  use  phenomenological  information  other  than  the  den¬ 
sity.  Binding  energies,  ionization  energies,  second-order  energy  differences,  and 
cohesive  energies  have  been  calculated.  The  shell  structure  is  essentially  the  same 
for  ordinary  and  stabilized  jellium.  The  various  measures  of  stability  tend  to  agree. 
The  cohesive  energies  are  more  realistic  in  the  stabilized  jellium  model  than  in  the 
ordinary  one,  particularly  for  Al. 

Future  work  with  stabilized  jellium  could  investigate  different  breakup  channels 
for  charged  clusters  (a  problem  which  has  been  studied  within  the  ordinary  jellium 
model  [14]),  self-compression  effects  (since  a  cluster  should  have  a  central  density 
bigger  than  the  bulk  value  [5  ] ),  large  clusters  (e.g.,  the  analysis  of  supershells  [21]), 
and  the  static  and  optical  response  of  clusters. 
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Abstract 

Nonlocal  density-functional  and  correlated  ab  initio  methods  have  been  used  to  compute  singlet- 
triplet  separations  for  the  Pd  atom  and  Pd^,  as  well  as  the  dissociation  energies  of  the  latter  in  its  ground 
and  firs*,  excited  states.  The  results  are  in  good  agreement  with  the  available  expenmental  data  and  with 
other  high-level  calculations.  Single-point  local  density-functional  computations  were  earned  out  for  Pd 
clusters  up  to  Pd22  (1012  electrons ) .  The  interaction  energy  per  Pd — '  bond  appears  to  be  approaching 
a  limiting  value  for  Pd22.  ©  1993  John  Wiley  &  Sons,  Inc. 

Introduction 

We  are  interested  in  applying  density-functional  theory  (dpt)  to  small  clusters 
of  transition  metal  atoms.  One  of  our  objectives  is  to  determine  how  large  a  cluster 
is  needed  to  realistically  simulate  a  metal  surface,  in  order  to  permit  the  investigation 
of  chemisorption  and  catalytic  processes;  and  to  determine  the  ground  state  atomic 
and  electronic  structure  of  small  transition  metal  clusters. 

In  the  present  work,  we  have  focused  specifically  upon  palladium,  and  have 
proceeded  on  two  levels:  (a)  We  have  compared  sophisticated  nonlocal  dft  and 
ab  initio  methods  in  computing  the  singlet-triplet  separations  for  the  palladium 
atom  and  the  Pd2  molecule,  as  well  as  the  bond  length  and  dissociation  energy  of 
the  latter  in  each  state;  and  (b)  we  have  used  a  local  dft  approach  to  carry  out 
single-point  runs  on  clusters  as  large  as  Pd22  (1012  electrons). 

Methods 

E>ensity-functional  theory  [1-4],  as  it  has  been  applied  in  this  investigation,  is 
based  on  the  Hohenberg-Kohn  theorem  [5]  and  the  Kohn-Sham  procedure  [6]. 
In  the  latter,  the  total  energy  of  an  N-electron  system  (as  would  be  obtained  by 
solving  the  Schrddinger  equation  exactly)  is  found  by  minimizing,  with  respect  to 
the  electronic  density  p(r),  the  total  energy  functional  £[p], 

E[p]  =  J  n„,(r)p(r)  dr  -I-  r,[p]  +  Kdassfp]  +  E^Ap]  •  ( 1 ) 

Pe«(r)  is  the  external  potential  due  to  the  nuclei,  T^Ip]  is  the  kinetic  energy  of  a 
hypothetical  set  of  non-interacting  electrons  having  the  same  density  p(r)  as  the 
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system  under  consideration,  and  l^ciasslp]  is  the  classical  energy  calculated  using 
the  density  p(r). 

,,  r  ,  If  p(r)p{r')  dr  dr' 

(2) 

The  exchange-correlation  functional,  is  expressed  exactly  in  terms  of  the 

adiabatic  connection  variable  X  by  Eq.  (3): 

£x.[p]  =  f'  <^xl  d\  -  K,M  .  (3) 

Jo 

The  functions  are  the  solutions  of, 

//x^'x  =  £x^x  (4) 

where, 

//x  =  f+Px  +  XF,,.  (5) 

In  Eq.  (5),  the  electron-electron  interaction  operator,  V^e,  is  linearly  scaled  from 
zero  to  one  by  the  variable  X  in  a  manner  such  that  vx  yields  the  same  p(r)  for  any 
X  in  the  interval  [0,1],  (According  to  the  Hohenberg-Kohn  theorem,  the  potential 
determines  the  density  [5].) 

The  energy  quantity  as  given  by  Eq.  (3)  is  clearly  not  equivalent  to  the  sum 
of  the  exchange  and  correlation  energies  as  defined  in  ab  initio  formalism.  Unfor¬ 
tunately,  the  expression  for  E^Ap]  in  Eq.  (3)  can  only  be  evaluated  in  special  cases. 
One  of  these  is  an  uniform  electron  gas  (jellium),  for  which  exact  ground-state 
solutions  have  been  obtained  by  stochastic  simulations  [7 ] ,  fro  n  which  functionals 
have  been  constructed,  primarily  by  fitting  procedures  [8].  These  functionals  cor¬ 
respond  to  the  local  density  approximation.  First-order  corrections  can  be  made 
using  gradients  of  the  density,  resulting  in  so-called  nonlocal  approximations  [1,2]. 

For  the  Pd  atom  and  for  Pd2,  we  have  used  the  sophisticated  nonlocal  exchange- 
correlation  functional  of  Perdew  and  Wang,  the  generalized-gradient  approximation 
(GGA)  [11,12].  This  was  developed  entirely  from  first  principles,  and  has  no  fitted 
parameters.  We  have  recently  coded  this  functional  into  the  dft  program  deMon 
[  1 3, 14] .  The  single-point  runs  on  Pd  clusters  were  carried  out  with  the  dft  program 
DMol  [15],  which  uses  a  local-density  approximation  [16].  We  have  used  the 
standard  double  numerical  (dn)  basis  set.  The  ab  initio  results  that  we  report  were 
obtained  with  Gaussian  92  [17]. 


Results 


Pd  Atom 

The  standard  deMon  Gaussian  DZVPP  basis  set  for  Pd,  ( 18s,  12p,  9d)  contracted 
to  (6s,  5p,  3d),  yields  a  singlet-triplet  separation  of  33.5  kcal/mol,  which  differs 
considerably  from  the  experimental  18.8  kcal/mol  [18].  Accordingly  we  used  Hu- 
zinaga’s  (17s,  1  Ip,  8d)  basis  set  [19],  uncontracted,  to  compute  the  singlet-triplet 
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separation  by  both  ab  initio  and  dft  procedures.  The  two  dft  calculations  were 
carried  out  with  the  Perdew-Wang  ftclp]  functionals  GGa-pw86  [9,10]  and 
GGA-PW91  [1 1,12].  The  results  are  in  Table  I. 

Most  of  the  ab  initio  correlated  results  and  both  of  the  dft  are  within  3  kcal/ 
mol  of  the  experimental  value.  It  is  seen  that  the  effect  of  including  the  core  electrons 
can  be  quite  significant,  as  much  as  5.3  kcal /mol.  It  is  relevant  to  mention  that  a 
QCi  (full  core)  calculation  was  terminated  after  having  used  more  than  one  hour 
of  CPU  time  on  a  CRAY  C-90  for  only  two  iterations  (out  of  possibly  15)  of  the 
triple  substitutions;  in  contrasi,  the  DFT  calculations  despite  slow  convergence  (more 
than  1500  iterations  for  the  triplet)  required  only  about  10  min. 

Pd2 

Initial  results  obtained  with  the  DZVPP  basis  set  were  qualitatively  incorrect,  so 
we  have  continued  with  the  Huzinaga  [19].  Our  computed  bond  lengths,  total 
energies  and  dissociation  energies  for  the  triplet  ground  state  and  the  first  singlet 
excited  state  of  Pdj  are  given  in  Table  11,  as  are  the  singlet-triplet  separations,  T^. 
Experimental  and  other  calculated  data  are  also  included.  Our  dft-gga-pw  86 
ground  state  dissociation  energy  and  triplet-singlet  separation  are  in  good  agreement 
with  the  MCSCF(CASSCF)MRSDCI  results  and  with  one  of  the  experimental  values 
for  Df. 

Pd  Clusters 

The  Pd  clusters  that  were  investigated  are  shown  in  Figure  1.  They  were  given 
geometries  corresponding  to  the  [1 1 1]  surface  of  the  metal,  with  the  closest  inter- 


Table  I.  Calculated  energies  of  the  singlet  ground  state  and  first  excited  triplet  state  of  the 
palladium  atom,  using  the  Huzinaga  ( 1 7s,  I  Ip,  8d)  basis  set. 


Method 

Singlet 

(hartrees) 

Triplet 

(hartrees) 

Separation 

(kcal/mol) 

Hartree-Fock 

-4937.79120 

-4937.78582 

3.4 

mp2  (frozen  core) 

-4937.91174 

-4937.87904 

20.5 

MP2  (full  core) 

-4938.75759 

-4938.71649 

25.8 

mp3  (frozen  core) 

-4937.90090 

-4937.87540 

16.0 

mp3  (full  core) 

-4938.68426 

-4938.66039 

15.0 

MP4SDQ  (frozen  core) 

-4937.90689 

-4937.87748 

18.5 

mp4sdq  (full  core) 

-4938.71916 

-4938.68446 

21.7 

QCisDt  (frozen  core) 

-4937.90824 

-4937.87999 

17.7 

DFr-GGA-PW86 

-4940.67375 

-4940.64130 

20.4 

dft-ck;a-pw91 

-4940.66042 

-4940.62611 

21.5 

Experimental* 

188 

MCSCF(CASSCF)/FOCI  +  P.Cl'’ 

13  8 

•  Reference  [18], 
Reference  [20], 
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Table  II.  Results  for  the  palladium  dimer  ground  slate  tnplet  and  first  excited  state  singlet  using  the 

(17s,  lip.  8d)  basis  set. 


Method 

Tnplet 

Singlet 

Separation 

T, 

(kcal/mol) 

Energy 

(hartrees) 

R, 

(A) 

(kcal/mol) 

Energy 

(hartrees) 

R. 

(A) 

D. 

(kcal/mol) 

Harlree-Fock 

-9875.57302 

2  78 

59 

mp2  (frozen  core) 

-9875.81501 

2.46 

-5.3 

-9875.82866 

2.9 

33 

-8.6 

DFT-GOA-PW86 

-9881  37740 

2.62 

188 

-9881.36716 

2.7.  -2.8“ 

12.3 

6.4 

Experimental* 

16.9. 26.0 

MCSCFtCASSCFlMRSDCl" 

2  48 

19.6 

2  87 

3.6 

DFT  (model  core 

potential)' 

2.46 

31.3 

2.64 

21.9 

9.5 

*  Reference  (21]. 

"Reference  [22]. 

‘  Reference  (23). 

“*  A  very  shallow  minimum  was  obtained 


Figure  I.  Palladium  clusters  that  v'ere  studied  using  a  local  density-functional  theory 

approach. 
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Table  HI.  Local  density-functional  theory  results  for  palladium  clusters. 


Cluster 

Number  of 
Pd— Pd 
bonds 

Energy 

(hartrees) 

AE  =  £(Pd„) 

-  n£(Pd) 

A£  per 
Pd— Pd  bond 
(kcal/mol) 

(hartrees) 

(kcal/mol) 

Pd, 

0 

-4935.1926 

-0.0 

Pd2 

I 

-9870.4412 

-35.1 

-35.1 

Pdj 

3 

-14805.7079 

-0.130 

-81.6 

-27.2 

Pd4 

6 

-19741.0160 

-0.2456 

-154.1 

-25.7 

Pd, 

12 

-34546.8208 

-0.4724 

-296.5 

-24.7 

Pd,o 

24 

-49352.7822 

-0.8561 

-537.2 

-22.4 

Pd,2 

24 

-59223.2213 

-0.9099 

-23.8 

Pd„ 

57 

-93770.5585 

-1.8988 

-1191.5 

Pd22 

69 

-108576.53 

-2.29 

-1438.5 

-20.8 

atomic  distance  in  each  case  being  set  equal  to  the  experimentally  determined  2.75 
A  [24] .  The  calculated  total  and  interaction  energies  are  given  in  Table  III. 

The  dissociation  energy  of  Pd2  is  predicted  from  Table  Ill  to  be  35.1  kcal/mol, 
which  is  considerably  greater  than  our  dft-GGa-pw86  value  in  Table  II,  12.3 
kcal/mol.  This  could  reflect  both  the  difference  in  bond  length  and  also  the  well- 
known  tendency  of  local-density  approximations  to  overestimate  dissociation  ener¬ 
gies  [  25-27  ] .  However  a  key  point  that  emerges  from  Table  III  is  that  the  interaction 
energy  pei  bond  appears  to  be  nearing  a  limiting  value  for  Pdjj.  This  suggests  that 
clusters  of  this  size  may  be  reasonable  models  for  a  metal  surface  (including  the 
underlying  atoms). 


Discussion 

We  have  demonstrated  the  feasibility  of  local  density-functional  calculations  on 
systems  having  as  many  as  1012  electrons  (Pd22).  At  the  local /DN  level,  660  basis 
functions  are  required  for  the  latter  computation.  The  total  energy  is  in  the  neigh¬ 
borhood  of -108576  hartrees.  At  this  order  of  magnitude,  the  error  in  the  energy 
due  to  the  use  of  REAL *8  precision  is  about  6  kcal/mol.  The  use  of  REAL ’"16, 
which  would  solve  the  problem  due  to  truncation  in  the  floating  point  operations, 
is  not  practical  because  there  are  not  commercial  floating  point  processors  of  128 
bytes;  accordingly  the  REAL '"16  arithmetic  must  be  software-implemented.  Ma¬ 
chines  like  the  CRAY  YMP  then  require  about  10  times  more  CPU  time  to  process 
REAL  ""1 6  operations.  This  is  in  addition  to  the  inherent  problem,  in  these  Pd 
clusters,  of  a  large  number  of  low-lying  states  near  the  ground  state,  which  makes 
the  SCF  convergence  very  slow  and  tedious.  The  convergence  problem  is  independent 
of  the  program.  On  the  CRAY  YMP8-864,  each  scF  cycle  required  764  seconds 
of  CPU  time;  the  numoer  of  cycles  needed  for  a  respectable  energy  was  about  20, 
and  much  more  for  a  good  density.  About  500  Mbytes  of  scratch  disk  space  were 
required,  which  is  not  excessive  in  view  of  the  size  of  the  system. 

Despite  these  figures,  there  is  reason  for  optimism.  With  massively  parallel  com¬ 
puters,  turn-around  times  are  reduced  by  at  least  one  order  of  magnitude,  dft 
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codes  are  more  vectorizable  than  are  ab  initio,  and  run  more  efficiently  by  a  factor 
of  3  to  5.  This  efficiency  can  be  made  much  higher  in  massively  parallel  computers. 
DFT  codes  are  still  in  development,  and  improvements  can  be  anticipated. 
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Abstract 

We  exploit  the  unitary  group  formalitm  in  formulating  a  multiconfigurational  single-reference  coupled 
cluster  method  for  cases  involving  one  or  two  electrons  in  open  shells.  The  linear  version  of  CCSD  theory 
for  the  simple  open  shell  case  and  for  low  lying  singlet  states  of  closed  shell  systems  are  considered  in 
detail.  The  entire  formalism  is  related  to  the  unitary  group  based  CISD  method,  and  explicit  expressions 
for  size-extimsivity  corrections,  leading  to  the  L-ceSD  formalism,  are  given.  An  illustrative  example  of 
the  minimum  basis  set  model  of  the  BeH  radical  is  also  presented.  £>  1993  John  Wiley  A  Sons,  Inc. 

Introduction 

The  connected  cluster  structure  of  an  exact  wave  function  [  1  ] ,  describing  a  non- 
degenerate  closed  shell  ground  state,  is  well  known  from  many-body  perturbation 
theory  (mbpt)  (see,  e.g.,  Ref.  [2]).  This  result,  often  referred  to  as  the  connected 
cluster  theorem  [2]  (in  eontrast  to  a  better  known  linked  cluster  theorem),  im¬ 
mediately  implies  the  standard  coupled  cluster  (cc)  exponential  Ansatz  for  the 
corresponding  wave  operator.  This  cluster  A  nsatz  proved  to  be  very  useful  when 
designing  size  extensive  methods  that  efficiently  account  for  many-electron  corre¬ 
lation  efTects  in  atomic,  molecular,  and  even  in  extended  (solid  state  or  polymer) 
systems. 

An  extension  of  the  CC  formalism  to  degenerate  or  quasidegenerate  closed  shell 
states,  or  in  fact  to  general  open  shell  systems,  proved  to  be  most  exacting.  Although 
the  spin-noiiadapted,  single  reference  CC  formalism  exploiting  the  unrestricted 
Hartree-Fock  (UHF)  reference  can  often  provide  very  useful  results,  even  in  open 
shell  situations,  it  cannot  be  regarded  as  a  bona  fide  theory  in  view  of  its  inherent 
limitations  (spin  contamination  of  the  resulting  wave  functions,  inability  to  distin¬ 
guish  various  components  of  more  complex  multiplet  systems,  nonanalytic  behavior 
of  potential  energy  hypcrsurfaces,  etc.).  It  has  been  generally  recognized  that  a 
proper  account  of  quasidegeneracy  effects  requires  a  multireference  (MR)  veision 
of  a  standard  single  reference  (SR)  approach,  based  on  the  effective  L’'>’’.<iltonian 
concept  and  the  generalized  Bloch  equation.  Two  different  types  of  generalized  MR 
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cc  Ansatze  were  proposed,  leading,  respectively,  to  the  so-called  valence  universal 
or  Fock  space  and  state  universal  or  Hilbert  space  MR  cc  approaches  (see,  e.g.. 
Refs.  [2-4]).  While  initially  only  various  versions  of  the  former  approach  were 
exploited  in  actual  applications  (see,  e.g..  Ref.  [5]),  lately  the  state  universal  for¬ 
malism  is  also  being  explored  [6-9] .  Unfortunately,  our  knowledge  of  cluster  struc¬ 
ture  in  the  general  open  shell  case  is  very  limited,  in  contrast  to  the  closed  shell 
case  where  the  situation  is  much  simpler,  since  this  structure  can  considerably  vary 
from  case  to  case  [e.g.,  states  of  different  multiplicity,  covalent  vs.  ionic  type  states 
in  the  sense  of  the  valence  bond  ( VB )  theory,  multiradicals  or  transition  complexes 
far  away  from  equilibrium  geometry,  etc.].  Nonetheless,  it  seems  plausible  that  a 
satisfactory  description  may  be  achieved  in  most  cases  by  considering  only  pair 
clusters  with  respect  to  each  reference  considered,  as  in  the  closed  shell  case,  as 
long  as  a  suitably  large  model  space  is  employed.  However,  since  a  given  configu¬ 
ration  has  generally  a  different  excitation  order  with  respect  to  different  reference 
configurations,  the  truncation  scheme  for  the  MR  approaches  is  much  more  complex 
than  in  SR  methods  and  much  has  yet  to  be  learned  concerning  this  topic. 

In  any  case,  the  available  limited  experience  with  MR  cc  approaches  leaves  little 
doubt  that  they  are  much  more  demanding  than  the  SR  methods,  and  the  com¬ 
putational  difficulties  and  complexity  grow  with  the  increasing  dimensionality  of 
the  model  space  employed.  For  this  reason,  much  attention  has  been  paid  to  ap¬ 
proaches  based  on  incomplete  model  spaces  [10,1 1]  (it  should  be  noted  that  prac¬ 
tically  all  existing  applications  of  valence  universal  approaches  were  carried  out 
with  incomplete  model  spaces  and  often  with  different  ones  for  different  geometries). 
Thus,  for  purely  pragmatic  reasons,  it  seems  important  to  explore  the  capabilities 
of  SR  CC  approaches  employing  an  open  shell — generally  multideterminantal  or 
multiconfigurational  ( MC) — reference.  Such  an  approach  naturally  arises  when  we 
apply  the  SR  cc  formalism  to  a  spin-adapted  reference  configuration,  such  as  given, 
for  example,  by  the  electronic  Gel’fand  state  of  the  unitary  group  approach  (uga) 
[12-16].  This  approach  can  be  paiticularly  useful  for  various  special  cases,  such 
as,  e.g.,  the  ground  states  of  radicals  (when  in  fact  a  single  determinantal  reference 
suffices)  or  the  low-lying  excited  singlet  or  triplet  states  of  molecules  having  a  closed 
shell  ground  state.  In  designing  this  type  of  mc-SR  open-shell  cc  theories,  the  uga 
[  1 2- 1 6  ]  or  Clifford  algebra  uga  (  cauga  )  [  1 7  ]  formalism  may  be  very  helpful  [  1 8- 
20] ,  since  a  chosen  reference  may  be  simply  characterized  as  an  electronic  Gel’fand 
state,  a  cauga  VB-type  state  or  yet  another  mc  state,  v/hile  the  required  cluster 
operator  can  be  expressed  in  terms  of  unitary  group  U{n)  generators  or,  more 
generally,  in  terms  of  replacement  operators  from  a  universal  enveloping  algebra 
of  (7(«),  forming  the  so-called  external  algebra  [4,18-20].  The  derivation  of  ap¬ 
propriate  CC  equations  and  of  the  energy  expression  then  reduces  to  a  repeated 
application  of  basic  commutation  rules  for  (7(n)  generators  or  replacement  operators 
and  the  rules  for  the  action  of  these  operators  on  a  reference  state.  One  of  the  first 
exploitations  of  this  idea  [21]  was  recently  encoded  for  the  high-spin  reference 
states  by  Janssen  and  Schaefer  [22],  who  designed  general  symbol  manipulation 
code  (written  in  C)  that  generates  the  appropriate  cc  equations  and  automatically 
produces  the  required  FORTRAN  code.  However,  these  authors  employed  the  so- 
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called  generator  basis  [15],  which  produces  a  set  of  nonorthogonal,  overcomplete, 
and  unnormalized  states  which,  in  our  view,  may  lead  to  various  divergence  prob¬ 
lems  when  iteratively  solving  cc  equations.  A  similar  program,  based  on  the  UGA- 
Type  MC-SR  cc  approach  [23]  and  employing  spin-adapted  and  orthonormal  excited 
state  manifold,  was  designed  very  recently  and  is  currently  being  tested  [24]. 

In  this  paper  we  shall  present  a  preliminary  description  of  our  implementation 
of  the  CAUGA  formalism  in  mc-SR  cc  approaches,  concentrating  on  a  few  special — 
though  very  important — cases  of  simple  open  shell  doublets  and  singlet  and  triplet 
excited  states  of  closed  shell  systems.  Following  a  brief  outline  of  a  general  formalism 
in  the  next  section,  we  illustrate  our  approach  on  the  simplest  open  shell  case  at 
the  linear  level  of  approximation  in  the  third  section.  The  required  explicit  expres¬ 
sions  for  the  open  shell  singlet  case  are  also  given  in  this  section.  An  application 
of  this  level  of  theory  to  a  simple  minimum  basis  set  (MBS)  model  of  BeH  may  be 
found  in  the  fourth  section,  and  the  conclusions  concerning  general  MC-SR  CC 
approaches  that  can  be  drawn  at  this  stage  are  presented  in  the  last  section. 

Basic  Formalism 

In  our  implementation  of  the  general  MC-SR  CC  formalism,  we  first  construct  a 
set  of  excitation  operators  { G, }  that  generate  spin-adapted  and  orthonormal  con¬ 
figurations  when  applied  to  the  reference,  namely, 

!<!>,>  =  (7, 1  ^o>,  =  (1) 

Since  these  operators  generate  spin-adapted  configurations  when  acting  on  various 
states  arising  through  the  action  of  G,'s  and/or  of  a  spin-independent  Hamiltonian 
H  on  l4>o)<  the  whole  formalism  is  spin-adapted  and  no  spin-contamination  can 
arise.  The  cluster  Ansatz  employed  takes  the  standard  SR  form, 

|<'>  =  exp(7')|4>o>,  (2) 

and  the  cluster  operator  T  is  represented  as  a  linear  combination  of  excitation 
operators  G, , 

T='Zt.G,,  (3) 

I 

where  the  t,  designate  the  corresponding  cluster  amplitudes.  Note  that,  in  general, 
with  I  $0)  representing  an  MC  reference,  the  excitation  order  of  each  G,  is  not  well 
defined.  Nonetheless,  we  have  to  truncate  the  connected  cluster  components  in  (3 ) 
at  some  reasonable  level,  namely,  an  essentially  biexcited  level,  so  that  we  obtain 
an  approximation  corresponding  to  the  ceSD  method  of  a  standard  SR  case,  when 
we  use  the  approximation  T  =  Ti  +  T2.  Thus,  in  cases  considered  by  us,  there  are 
at  most  two  orbitals  in  |4>o)  that  are  not  doubly  occupied  (we  shall  refer  to  them 
loosely  as  active  or  valence  orbitals  in  analogy  with  the  general  MR  case).  However, 
in  contrast  to  the  general  MR  situation,  all  active  orbitals  (though  not  the  spin 
orbitals )  are  singly  occupied  in  every  configuration  or  Slater  determinant  constituting 
our  MC  reference  !4>o). 
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Thus,  in  our  developments,  we  shall  consider  all  genuine  mono-  and  biexcitations 
from  1  #o) .  as  well  as  the  so-called  pseudomono-  and  pseudobiexcitations  involving 
core-active  and  active-virtual  excitations.  With  this  generalization  in  mind,  we  can 
again  classify  our  approximations  using  the  standard  acronyms  CCSD,  CCSDT,  etc., 
as  in  the  SR  case.  Relying  on  uga,  we  then  express  the  excitation  operators  G, , 
corresponding  to  single  and  double  excitations  from  |  €>0)  in  the  above-mentioned 
generalized  sense,  in  terms  of  linear  combinations  of  t/(n)  generator  products. 
Clearly,  each  reference  |4>o)  will  require  a  specific  definition  of  such  an  oper¬ 
ator  set. 

We  shall  rely  on  a  standard  algebraic  derivation  (see,  e.g..  Refs.  [2]  and  [4])  of 
SR  CC  equations,  obtaining  for  the  energy 

£  =  <4'o|exp(-7')//exp(7')|4>o>  (4a) 

or 

A£=£-<4.o|//|4>o>  =  <-^ol[//,  7-]|<^o>  (4b) 

-1-  K'*‘olI(//,7’],71l<i>o>+  •••  , 
with  the  cluster  amplitudes  given  by  a  solution  of  cc  equations 

«/  +  2  Vy  +  i  2  c,,kt)tk  +  •  •  •  =  0  ,  (5) 

! 

where  the  coefficients  have  the  form 

a,  =  <4>,|//|4-o>  =  <^olC?J//|<l>o>,  (6) 

b„  =  G,]|<l>o>  =  G,]|4>o>,  (7) 

c„k  =  G,],  G,ll<I>o>  =  <4>o|G:1[//,  G,],  Gfcl|#o>,  etc.  (8) 

Assuming  that  |  #0)  represents  the  rhf  reference  for  a  given  state,  we  can  interpret 
A£,  Eq.  ( 5 ) ,  as  the  correlation  energy  for  that  state.  Note  also  that  the  excited  state 
manifold,  onto  which  we  project,  is  given  as  a  linear  span  of  configurations  If*,), 
/  #  0,  defined  by  excitation  operators  G, ,  1  $, )  =  G,  |  #0)  •  For  the  sake  of  brevity, 
we  shall  illustrate  this  formalism  in  the  next  section  on  the  simplest  possible  case, 
involving  one  singly  occupied  orbital  outside  the  closed  shell,  restricting  ourselves 
to  the  linear  level  of  the  cc  theory.  The  open  shell  singlet  case  ( involving  two  active 
orbitals )  was  briefly  described  in  Ref.  [  20  ]  and  the  explicit  expressions  for  the  linear 
level  of  the  theory  are  also  given  in  the  next  section.  A  complete  formulation  for 
all  three  cases  mentioned  above  will  be  given  elsewhere. 

L-CCSD  Formalism  for  Simple  Open  Shells 

For  the  sake  of  brevity,  we  restrict  ourselves  to  the  linear  form  of  CC  theory. 
Thus,  the  development  presented  below  parallels  the  recent  work  by  Neogrady  et 
al.  [25],  employing,  however,  a  very  different  formalism.  For  near  equilibrium 
geometries,  the  linear  approach  represents  already  a  useful  approximation.  An  ex- 
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tension  of  this  approach  to  a  fully  quadratic  case,  as  well  as  to  other  cases  mentioned 
above,  has  been  carried  out  [26]  and  is  presently  being  tested. 

When  considering  the  linear  approximation, 

a. +  2  b,tj  =  0  ,  (9) 

J 

we  can  evaluate  the  required  coefficients  a,  and  b,j  either  directly,  using  expressions 
(6)  and  (7),  respectively,  or  we  can  exploit  the  relationship  between  the  l-ccsd 
and  CISD  methods  [2,27],  The  latter  approach  has  the  advantage  that  we  can  exploit 
existing  codes  for  various  ciSD  implementations,  not  to  mention  a  certain  peda¬ 
gogical  value  that  indicates  explicitly  the  corrections  for  unlinked  cluster  contri¬ 
butions  that  are  present  in  the  ciSD  approach,  and  are  responsible  for  its  lack  of 
size  extensivity.  While  no  corrections  arise  for  the  absolute  terms,  Eq.  (6),  so  that 

a,  =H.o  =  <•!>,  |//|#o>,  (10) 

we  have  for  the  linear  coefficients  that 

b,j  =  H,j-A,,  (11) 

where  H,j  =  |/f|^y)  are  standard  ci  matrix  elements  and  Ay  represent  size 

extensivity  corrections, 

Ay  =  <$olG!c?,//|*o>.  (12) 

Noting  that  the  product  G]Gj  represents  a  linear  combination  of  C/(n)  generator 
products  of  the  type 

•  •  EJi* ,  (13) 

where  the  difference  between  the  subscript  and  superscript  index  sets  L=  {It,  h, 
...  ,lk}  and  U  =  {u\,  u^,  . . . ,  Uk),  respectively,  is  uniquely  determined  by  the 
given  product  gJGj,  we  can  give  the  size  extensivity  corrections  Ay  the  following 
more  explicit  form.  Since  H  involves  at  most  two-body  terms,  the  index  sets  U  and 
L  can  differ  in  at  most  two  orbital  labels  (cf.,  e.g.,  R.ef.  (4) ),  so  that  we  can  distinguish 
the  following  three  nonvanishing  cases: 

(i)  U  =  L,  when 

Ay  =  <4>o|//|#o>5„  =  £o5v-  (14) 

(ii)  7;A£  =  (t/UL)\(C/n£)=  {«,/), when 

Ay  =  [ff  +  (//|/k)5..«,]<<J>o|cIg,£L|-I>o> 

+  Z  {ml\mu){^o\G]GjE'„E1!\^o) ,  (15) 

where  «/  is  the  occupation  number  of  the  /th  orbital  in  4>o  and  the  sum  over  m 
extends  over  all  singly  occupied  orbitals  in  4>o  that  are  distinct  from  u  and  /.  Finally, 

(iii)  UAL  =  { Ml  s  Mj  =  u,  /,  =  /,  /j  =  A:},  when 

Ay  =  [(1  +  6„,)(1  +  fi,*)]-'^,„,(/M|fct;)<<I>olGJC;£!,£S!'fo>  ,  (16) 
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where  is  a  symmetrizer  in  u  and  v  labels, 

^^=l+(uv),  (17) 

with  (wn)  designating  the  transposition  of  u  and  v.  Further,  we  defined  that 

/“  =  2/“  +  2  [«m(wm|/M)  -  (ml\mu)] ,  (18) 


with  the  sum  extending  over  all  orbitals  occupied  in  |$o)  and  z"  designating  the 
one-electron  integral  <M|f  |/).  Finally,  we  note  that  we  employ  here  Mulliken  no¬ 
tation  for  two  electron  integrals,  (ij\kl)  =  (iklvljl).  Thus,  once  the  excitation 
operators  G,  are  defined,  we  can  calculate  the  size-extensivity  corrections  Ay  as 
mean  values  of  multigenerator  products  in  the  reference  state  |$o).  Indeed,  this 
is  the  case  for  all  coefficients  arising  in  cc  equations,  as  Eqs.  ( 6 )-  ( 8 )  indicate.  The 
explicit  expressions  for  these  quantities  in  terms  of  cluster  amplitudes  and  one-  and 
two-electron  integrals  can  be  worked  out  either  “by  hand”  or  with  the  help  of  a 
computer  program.  Our  results,  including  those  shown  below,  were  obtained  in 
both  ways. 

As  mentioned  above,  we  shall  present  the  relevant  excitation  operators  G,  and 
the  corresponding  correction  terms  Ay  for  the  simple  open  shell  case,  i.e.,  for  a 
doublet  state  with  one  electron  outside  the  closed  shell.  Thus,  schematically,  we 
have  that 


14>o>=  |[core]0;,>,,  (19) 

Obviously,  there  is  j  great  flexibility  in  defining  the  excitation  operators  G, ,  since 
our  singly  and  doubl>  excited  configurations  involve,  generally,  three  and  five  singly 
occupied  orbitals  and  we  are  free  to  choose  any  coupling  scheme  we  like.  In  view 
of  the  fact  that  we  are  using  the  CI-CC  connection,  as  well  as  uga  formalism,  to 
formulate  the  l-ccsd  equations,  we  shall  systematically  employ  the  Gel’fand-Tsetlin 
basis  and  thus  define  our  excitation  operators  in  such  a  way  that  they  produce 
electronic  Gel’fand  states  when  acting  on  our  reference  configuration.  In  this  way 
the  existing  Cl  programs,  employing  Gel’fand-Tsetlin  basis,  can  be  simply  modified 
to  obtain  l-ccsd  formalism  once  the  corrections  A,;,  are  known. 

To  simplify  our  notation,  we  designate  core  orbitals  by  the  letters  from  the  be¬ 
ginning  of  the  alphabet,  i.e.,  as  a,  b,  c,  etc.,  while  reserving  the  letters  r,  s,  t,  etc. 
from  the  end  of  the  alphabet  for  virtual  orbitals.  The  only  valence  (active)  orbital 
is  labeled  by  p  as  already  indicated  in  Eq  (19;.  Moreover,  we  shall  assume  that 
our  reference  1 4>o),  Eq.  ( 19),  represents  an  rohf  wave  function  and  that  the  SCF 
orbitals  are  numbered  in  the  order  [core]  [virtual]  [actn  e].  The  desired  excitation 
operators  then  take  the  form: 

(i)  Single  and  pseudosingle  excitations: 


g;  =  e',, 

G'L  =  -E'’a, 

(1/V2)£i  (m=l), 

(1/V6)(£SE;  +  £^£S)  (m  =  3). 


(20a) 

(20b) 

(20c) 
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Here  and  in  the  following,  the  intermediate  spin  multiplicity,  designated  by  m 
(»t  =  1.3).  is  employed  to  label  different  components  of  the  same  multiplet. 

( ii )  Double  and  pseudodouble  excitations: 

"‘G'Jl,  =  (-1)*-^'  '"^„„/f!;£T/V2m(l  +6„,) 

(in  =  I  (or  a  =  h:  111  =  1,  3  for  a  <  /») ,  (20d) 

"'g::,  =  (-1  )*''*' Vi  +5rAi-z +  (/»  -  1 

(m  =  1  for  r  =  .v;  m  =  1,  3  for  r  <  .v) ,  (20e) 


IH/'K  ^ 

tlU  ' 

l/V"  1  p-r  V-' 

^  aa  2  ^ 

'(I/V2  )/:■'/:;; 
-(i/\/6)X,/:';/:y^;, 

>1  II 

(20f) 

(20g) 

ni/'H  _ 

- 

T  T 

(in  =  1) 

(m  =  3) 

(20h) 

cy 

II 

■  -  -1 

(1/2V3)  -X,  'S,„E':,E',E', 

(m=  1), 

(m  =  2) . 

(w  =  3),  (20i) 

(l/6)-X/.(/i:. 
,  (l/6\/2)X/.( 

E),^2E'X,>E'r) 

-E'aE},  +  E'aElE'p+  3E 

'aElE',) 

( w  =  4 ) , 

(m  =  5) . 

In  the  last  case,  when  five  singly  occupied  orbitals  are  involved,  we  distinguish  the 
five  resulting  doublets  by  the  superscript  w,  m  -  1.  2,  3.  4.  5.  The  spin-adapted 
symmetrizer  is  defined  as 


1  +(2-m)(y)  (w=l,3).  (21) 

The  corrections  associated  with  excitation  operators  (20).  are  given  in  Table 
1,  while  the  required  coefficients,  appearing  in  these  expressions,  may  be  found  in 
Tables  II  and  III.  It  should  be  noted  that  these  expressions  reflect  the  symmetry 
that  is  involved  in  the  excitation  operators  employed.  Consider,  for  example,  the 
most  complicated  case  of  doi'bly  excited  states  involving  five  open-shell  orbitals, 
I  =  "'Gah  1 4>o).  The  resulting  five  doublets  may  be  characterized  by  the  fol¬ 
lowing  Young  or  Weyl  tableaux 


1 

2 

1 

2 

1 

3 

1 

3 

1 

4 

3 

4 

3 

51 

2 

4 

2 

2 

5 

5 

4 

i 

3 

_4_ 

3 

where,  for  greater  simplicity,  we  designated  the  singly  occupied  orbitals  a,  h,  r,  s, 
and  p  by  1,2.  3.  4.  and  5.  The  symmetry,  or  transformation  properties,  of  the'^e 
states  with  respect  to  transpositions  (ah)  and  (r.v)  of  core  and  virtual  orbitals,  re¬ 
spectively,  are  summarized  in  Table  IV.  Note  that  the  symbol  [31]  implies  the 
transformation  property  characterized  by  [  3 1  ]  partition,  namely. 
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1  ^BiJ  1  Size  c.\tensi\  il>  correclion  term  for  the  simple  open  shell  doublet  case  * 


\ 

Footnote 

"’g:, 

Op 

\'m/2  [  /C  +  ';a/»i/v>)) 

(2  - 

"OS. 

GT 

.\s  /a  -  (2  -  wK/ipIr/i)] 

b 

"a; 

A.i<.  (*mn  1./?  +  (f>P\PP)] 

b 

"OS 

G' 

(1)  l  =  r=y.  -I/;-2(«/>|z/>)) 

(ill  1  =  r<s:  \m/2[{ni  -2)11  +  (upl-S/i)) 

(nil  /  =  i  >  r:  +  Vm/2V«/>|r/') 

"OS 

(i)  /  =  r  =  s:  -  (l/V'^I./T 
(11)  /  =  r  <  s;  (l/V«)/' 

(iii)  1  -  .\>  r:(n-  2)<w/f 

'G" 

O'r 

Gzr|o/>) 

OS 

(ur|r;i) 

"O', 

6„i  f',  +  {2-  n)(ap\rp)] 

'o'S 

/? 

'o'S 

-1/S  +  /tvlpp)] 

"OS 

O'r 

(2  -  /«)  V»n/2  (a\|up) 

c 

(r<s) 

G'S 

V»i/2((2  -  /iiXarl.v/))  +  (a\!r/>)) 

"OS 

(2  -  -  /n.,(a/) ;.«/>)) 

c 

'OS 

(2  -  m)Vm/2/S 

c 

"O'S 

(2  -  /»)(«  -  2)<„,[/S  +  (a/>l/v>)] 

"OS 

d'r 

-V«i/2((2  -  m)(ar\hp)  +  {ap\hr)] 

(u  <  h) 

OT 

-(2  -  m)'\fni/2{br\rp) 

b 

"g: 

-  (2  -  m)(2  -  nXV»m/2)(/)/)ir/>) 

h 

"O'S 

is„„  (2  -  III)  r. 

'O'S 

(2  -  m)Vm72(/S  +  {bp\pp)] 

c 

(f  ah 

O', 

(i)  t  =  r:  l^.£)„i|(n\lM  a„{ap\bs)] 

(ii)  I  =  s:  )f2C„,[{ar]bp)  +  a„{ap\br)] 

GT 

(I)  c  =  a:  ^aJi„lC'„i(br\sp)  +  D„i(h'i\ip)] 

(ii)  c  =  b:  ^b„lC„,{ar\sp)  +  /)„,(«' Irp)) 

"g; 

(i)  (•  =  a.  /  =  r  -  A„,„li  +  C„„{bp\.ip) 

(ii'  c  =  a,l  =  y  +  D„„{bp\rp) 

(iii)  <  =  b.l  =  r:  a„\A„,r,  +  C„,{up\sp) 

(i\)  <•  =  /),/  =  s;  a„(/f„„/S  +  D„„[ap\rp)] 

"O'S 

(i)  /  =  r /;„„/S 

(ii)  1  ^  y.  /■■„„/? 

0  ^p 

(i)  4  =  a.  G„„|/j:  +  (/i/>|p/))] 

(ii)  <■  ^  b  a„G„„l/S  +  (a/i|/)/>)) 

*  Wt  define  (  I)'"*'  VI  +  d,.,. 

''When  (  ^  (I.  Ssmmetric  to  («.  />)  when  m  ^  I  and  antisymmetnc  to  ('z.  />)  when  m  =  3. 
‘  When  I  =  r.  Svmmctric  to  (r.  s)  when  ,ii  =  1  and  antisymmetric  to  (r.  s)  when  m  =  3. 
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1  ABi  r  11.  Ctx:tficiems  appearing  in  si7c  cxtcnsiMtv  correction  terms  A„  of  Tables  I  and  V. 


n 

m 

1 

3 

1 

3 

1 

3 

-1 

-1/2 

1 

V'3/2 

1 

V3/2 

1 

1/2 

1/2 

-V3/2 

-1/2^3 

1/2 

1/2 

-V3/2 

l/V'3 

0 

1 

0 

\lli3 

1/2 

■Vmn 

-1/2 

V3/2 

1/2^3 

1/2 

(34) 


1 

3 

2 

_5_ 

4 

1 

3 

2 

_5_ 

4 

l\fl 


J_ 

4 

2 

T 

(23) 


and  similarly  for  the  last  Young  tableau  in  (22).  We  thus  see  immediately  that  the 
symmetry  with  respect  to  the  transposition  {ah)  is  represented  by  the  coefficients 
a„  of  Table  III.  To  understand  the  symmetry  with  respect  to  the  transposition  of 
virtual  orbitals,  {rs),  we  define  the  following  matrix  T, 


T 


'10  0  0  0 

0-10  0  0 

0  0-10  0 

000  -1/3  2V2/3 

.0  0  0  2V2/3  1/3 


(24) 


Tablf.  III.  Coctlicienis  appearing  in  si/c  extcnsitiviiy  correction  terms  A„  of  Tables  I  and  V.' 


n 

1 

3 

m 

1 

2 

3 

4 

5 

1 

2 

3 

4 

5 

1 

1 

-■Vi 

1 

- 1 

0 

-1 

1 

.X2 

-1 

-1 

0 

-1 

-1 

0 

0 

X| 

0 

•Vj 

-A4 

n„„ 

-A| 

0 

-.'2 

0 

0 

0 

--X| 

0 

-.'2 

0 

^  mn 

-.Xf, 

•X(, 

V7 

0 

Aft 

■"Aj 

-Ar 

-A-4 

1 

'6 

X5 

-A^ 

-  X9 

-V|0 

^  mn 

--'6 

.XV, 

v? 

0 

Aft 

AA 

A5 

X'r 

-A  10 

”-'5 

Aft 

-Aft 

-~.u 

1 

Aft 

-A'5 

-■Vs 

--XV 

0 

^‘mn 

.V| 

-  V2 

0 

0 

0 

0 

0 

-V| 

“A: 

0 

^  mn 

■'l 

Ai 

0 

0 

0 

0 

0 

-A| 

A‘l 

-A4 

(in,„ 

“Vj 

-■'6 

.X7 

0 

-.Xft 

As 

Xs 

A4 

’The  following  definitions  are  used  to  simplify  the  table.  .Vi  =  1/^2;  .\2  =  V3/2;  Vi  =  l/Vfi; 
.V,  =  2/V^:  .\j  =  1/2V'2;  =  V'3/2V'2,  .v,  =  3/2^:  .v,'  -  l/2l^;  =  5/2V6:  .x,„  =  1/V3. 
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Table  IV.  Transformation  properties  of  five 
doublets,  characterized  by  Young  tableaux, 
Eq.  (22),  with  respect  to  transpositions 
( 1 2)  s  (ab)  and  (34)  =  («).* 


m 

(I2)  =  (u*) 

(34)  =  (rs) 

1 

S 

S 

2 

S 

A 

3 

A 

A 

4 

A 

[31] 

5 

A 

[31] 

*  S  and  A  designate  the  symmetnc  and  an¬ 
tisymmetric  behavior,  while  [31]  indicates  tne 
transformation  property  of  [31]  partition,  Eq. 
(23).  See  the  text  for  details. 


as  implied  by  Table  IV  and  Eq.  ( 23 ) .  It  is  then  easily  verified  that  the  5  X  2  matrices 
A  =  lAm„],  B  =  ^Ic.,  with  entries  given  in  Table  III,  satisfy  the  following 

relations 


TA  =  B,  TC  =  D,  and  TE  =  F .  (25) 

•  ^e  S’  nmetry  properties  involved  in  the  remaining  Ay  terms  can  be  found  in  an 
3  way. 

At  as  stage  it  is  wort!  .  hile  to  observe  that  the  above  results  immediately  imply 
‘■ize  extensivity  corrections  for  the  closed  shell  case.  The  only  possible  one-  and 
iw>body  excitation  operators  that  arise  in  this  case  are  'G'a,  'Gaa,  ‘GS,,  'G3.  and 
"'G'ib  {m  =  1,3).  Thus,  disreg  rding  the  cases  involving  other  types  of  excitation 
operators  and  eliminating  any  integrals  involving  the  active  label  p,  we  arrive  at 
Ay  corrections  pertaining  to  the  closed  shell  case.  It  is  thus  easily  seen  that  besides 
the  most  important  diagonal  terms  A„  =  Eg,  the  only  nonvanishing  Ay  terms  arise 
letween  double  and  single  excitation  operators.  Thus,  for  example,  for  G,  = 
"'Ga%  and  Gj  =  'Ga,  we  get 

Ay  =  <«i-ol('"(?a)^  =  -(2  -  m)i^n  .  (26) 

In  this  case,  the  result  is  symmetric  with  respect  to  transpositions  (afe)  and  (r5)  for 
m  =  1  and  antisymmetric  when  m  =  3.  Note  that  when  using  rhf  orbitals,  all 
these  correction  terms  vanish,  since  /„  =  0  for  any  r  and  a.  Finally,  in  the  l-ccd 
case,  the  only  non  vanishing  correction  terms  are  the  diagonal  ones,  A„  =  Eq. 

The  above-presented  scheme  may  be  easily — at  least  in  principle — extended  to 
other  open  shell  cases,  provided  that  we  can  define  a  suitable  set  of  excitation 
operators.  For  the  simplest  open  shell  singlet  case,  describing  low  energy  excited 
states  of  closed  shell  systems,  the  required  set  of  excitation  operator  may  be  found 
in  Ref.  [20].  The  size  extensivity  corrections  Ay  corresponding  to  this  set  of  exci¬ 
tation  operators  is  given  in  Table  V.  We  have  also  formulated  the  same  formalism 
for  simple  open-shell  triplet  states. 


Table  V.  Size  extensivity  correction  terms  Aj,  for  the  open  shell  singlet  case.' 


G,  G, 


A 


ij 


(j’k 

G‘ 

v/a 


Gs; 

G- 

oi 


"G 


ft 

ak 


ir  <  s) 


Cf  2(/!S-  +  (A-JiS^)1 

G‘  -2t/{  +  mklc)] 

G'k  4>'!!'(^[f.  +  (ak\kk)-(all\kli)] 

G‘  +  2(A-^|^r)] 

Gil  6„,,V2[/i  +  (A-A:|*^)l 

Gj  NAfU(bl^\l^lh-(bk\kli)] 

G'k  Vl  +  S„[fi  +  2(sk\ki^)] 

Gl  -in  +  {ak\kk)-(ak\kl^)] 

Gi  /*  +  2(U-|rfc) 

"G;  \^[yt-+(*rflM)] 

GT-  (-  1)‘-'J„,  V2/(l  +  +  (fl^lM)] 

Gj  [fi  -  (2  -  L*)  +  2i„,{blc  I  rJ)) 

G*  A',*\W2I(W|rif)  -  2(6ri*^')l 

”G;  0r(-l)*'*\TTi^«™,I/J  +  mkk)  -  {blc\kk)] 

GSK  5„,(l/V2)[/j  +  2(r*lA-{)] 

"G'^;  -i„„2lfi  +  {kk\k1c)] 

G;!c  (.VwV2)«„,[/?  +  -  (WIA:^)] 

Gl  -[/;  +  2(flA-|r*)-(af|/-f)) 

V2[7mi/f  +  Vm/2(aA:|j<:)  -  |8„,(aJ|sS)) 

^iLjf  +  '/^/2(ak\rk)  -  y„,{aJc\rk)] 

Gi:  -[2(ar|^S)-(a^|rA:)J 

V27™i[2(a.tlA:S)  -  (afisA:)] 

V^ff„,[2(arlfe^)-(a^|r^)] 

Gc  -Vm(l  +  5„)/2[(rA:|jt)  +  (rflsfc)] 

"Gi  -V2«,„[/5  +  2«tf|r*)] 

+  2(A-f  is^)] 

T™[/f  +  2(kk\rJc)] 

G"  (1  +  «„)/2 V^[/|  +  (aJf ifS)  -  (afclArJ)] 

"Gi'r  a„„2[/(  +  «cJlJ^)] 

Gil  (-1/2)1/*  + 2(^J|rA:)] 

(l/V2)3„,I/f  +  2{k]<\sk)] 

(l/V2)7„,[/*  +  2(*K|rA-)J 

GT  (ar|aA) 

Gi  (ar|rA) 

"Gi  V^[«„i  2  /;  +  (2  -  nKapIrp)  +  (a^lr?)] 

GS  -V2{rplr9) 

GX  V2(apla9) 

■Gil  [/;  +  2(rA|AA)] 

'Gii  -[/J  +  (aA|AA)-(a/:-|Af)] 

Gil  [2(ar|Af)-(aA|rA)J 

GT  .^?\W2(a5|aA) 

Gi  <t>T  Vm/2[(2  -  mKarIsfc)  +  (as|rfc)] 

"Gi  (2  -  m){6„„ft  +  t„„  i„„[{2  -  n)(2  -  m^apUp)  +  (a^ls?)]) 


Footnote 


b 


d 

d 


d 


d 


f 

t 


h 


h 


(Continued) 


Table  V,  {Omtmued) 


O', 

Footnote 

cs. 

i„,'i/2(ap\aq) 

0fV^iyf +  2(.i^|AJ)J 

h 

"O'", 

<f>?(2  -  n)l„„[n  +  {ak\kk)  -  {aklkk)] 

G'i 

A„,(l/i^)[2(as|A^)-(aA|iA)l 

h 

"az. 

Cii 

-(PT  Vw/2[(2  -  wKarlfiA)  +  (aA-|fer)) 

(a  <  h) 

Of 

-d”V'^(fir|rA) 

d 

"c; 

-  lAmnyf  +  ‘m,  “  n}(2  -  mK*/)|rp)  +  (hq\rq)]\ 

d 

03: 

-i„i]f2{rp\rq) 

o« 

-imiliaplhq)  +  {aq\bp)] 

nr;rk 

^ah 

«"Wyf +  2(rA|AA)) 

'Pc'f^lJi  +  (MIAA)  -  (MIAA-)] 

d 

05 

-i„,(l/V2)(2(Ar|AA)-(M|rA)) 

d 

G1 

))rV2C„,[(ai|/)A)  +  a„(aA-|/>v)] 

f 

))rV2/2»i((ar|M)  +  a^aA|A>r)) 

t 

Of 

)j"V2a„[D„i(fif|.vA)  +  (^.ifMIrA)) 

1 

ijc'fllDUarUk)  +  C„,(aiirA)I 

f 

"0! 

+  C™[(Ap|.s/))  +  (2  -  n)bj,bq\sq)]] 

if 

+  D„„[(bp\rn)  +  (2  -  n)W*^|r</)]| 

>t 

aJ,A„„r,  +  C''„„((ap|jp)  +  (2  -  n)b„{aq\iq)]\ 

)f 

+  6„„Hap\rp)  +  (2  -  n)bdaq\rq)]\ 

J4 

oa 

-«mi[(r/)|i</)  +  (rq\!>p)] 

0;; 

-b„i[(ap\hq)  +  (aq\hp)] 

nf'ik 

^ah 

riTE„„[f'',  +  Kshkl)] 

f 

rtTF„„\fr  +  2(rA|AA)l 

« 

nf;t\ 

^ck 

+  (MIAA)  -  (bhkm 

1 

vTa„G„„lfl  +  (aA  |AA-)  -  (a)f|Af)J 

J 

r'lk 

^ck 

/I„,(2(/>i|AA)  -  {AiA-|.rf)l 

if 

am-4mi[2(/)r|AA)  -  (MlrJ)) 

•  ( 

a„/4„,[2(a.r|AA)  -  (aA|j^)) 

Jf 

.4„i[2(ar|Aj)  -  (aA|rA)l 

J4 

*  The  label  k  designates  either  potq  open  shell  orbitals.  We  also  use  a  convention  that  q  when  k 
=  p  and  H  =  p  when  k  =  q.  The  phase  factors  and  r/"  arc  defined  as  follows:  -  I.  (f>“  =  (2  -  m): 
Vr  =  1.  b;  =  We  also  define  Kf,  =  (-1)*“**'  Vl  + 

•’  When  c  =  a  Symmetric  to  (a,  h). 

'  When  t  -  r.  Symmetric  to  (r,  s). 

When  c  =  a.  Symmetric  to  {a,  h)  when  ni  =  I  and  antisymmetric  to  (a,  h)  when  m  =  3. 

'  t  =  r  =  s. 

'l  =  r  <  s. 

*1  =  s>  r. 

When  /  =  r.  Symmetric  to  (r,  s)  when  m  =  1  and  antisymmetric  to  (r,  s)  when  m  =  3. 

'c  =  a  <b. 

‘c  =  h>  a. 
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Example:  BeH 

We  have  performed  a  number  of  test  calculations  for  various  systems  of  simple 
open  shell  or  open  shell  singlet  types,  using  both  semiempirical  (namely,  ppp)  and 
ab  initio  model  Hamiltonians.  We  shall  illustrate  an  application  of  the  above  outlined 
open  shell  L-CCSD  method  on  the  ^2  state  of  the  STO-3G  minimum  basis  set  (MBS) 
model  of  the  BeH  radical.  We  employed  the  ROHF  reference  and  used  the  resulting 
SCF  orbitals  as  the  MO  basis  in  our  cc  calculations.  The  advantage  of  this  model  is 
the  ease  with  which  we  can  obtain  the  full  ci  (FCi)  results,  representing  the  exact 
solution  for  our  model.  Both  the  rohf  and  the  FCI  results,  as  well  as  the  one-  and 
two-electron  integrals  required  in  L-CCSD  calculations,  were  obtained  with  the  GA- 
MESS  system  of  programs  [28]. 

The  ^2  potential  energy  curves  obtained  with  the  rohf,  l-ccsd,  and  FCi  methods 
are  shown  in  Figure  1.  In  the  vicinity  of  the  equilibrium  bond  length  Re,  the 
L-CCSD  energies  compare  well  with  the  FCi  results.  At  the  equilibrium  geometry. 
Re  =  2.538  a.u.,  L-CCSD  yields  100.10%  of  the  correlation  energy  (-0.022595  a.u. 
vs.  -0.022571  a.u.  obtained  with  FCI).  Similarly  as  in  the  closed  shell  case,  the 
L-CCSD  slightly  overestimates  the  correlation  energy  [27]. 


Figure  I .  Potential  energy  curve  for  the  MBS  model  of  the  BeH  radical  obtained  with 
ROHF.  L-ccstJ.  and  FCI  methods.  Note  that  two  ROHF  solutions  exist  in  the  vicinity  of 
4  a.u.  (sec  the  text  for  details)  and  that  the  Fci  curve  represents  the  exact  solution  for  this 

model. 
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As  we  approach  the  dissociation  limit,  the  l-ccsd  approximation  suffers  a  singular 
behavior  at  about  4  a.u.,  similarly  as  observed  in  quasidegenerate  closed  shell  sit¬ 
uations  [27,29,30].  In  fact,  this  region  corresponds  to  a  singular  behavior  of  the 
ROHF  ground  state:  Between  approximately  3.5  ?.nd  4.5  a.u.,  we  can  obtain  two 
distinct  rohf  solutions  whose  energies  cross  at  about  4  a.u.  ( see  Fig.  1 )  [  3 1  ] .  The 
ROHF  solution  that  can  be  continued  when  proceeding  from  the  equilibrium  ge¬ 
ometry  towards  the  dissociation  limit  corresponds  to  a  mixture  of  covalent  and 
ionic  structures,  similarly  as  the  rhf  solution  for  the  H2  molecule.  The  solution 
that  one  obtains  for  large  intemuclear  separations  corresponds  then  to  the  proper 
dissociation  limit,  the  closed  shell  part  describing  essentially  the  Be  atom  and  the 
open  shell  orbital  being  almost  exclusively  localized  on  the  H  atom  [31].  It  is 
worthwhile  to  note  that  this  difficult  “transition”  region  is  the  source  of  various 
difficulties  even  when  one  employs  the  vb-  rather  than  MO-based  approaches  [31]. 

Although  this  singular  behavior  of  the  L-CCSD  approach  is  overcome  when  we 
include  at  least  quadratic  terms  in  the  CCSD  equations  [32],  a  more  thorough 
investigation  of  the  character  of  this  singular  behavior  would  be  desirable.  We  note 
that  the  l-ccsd  potential  energy  curve  was  obtained  by  employing  the  lowest  energy 
ROHF  solution  throughout.  In  other  words,  the  rohf  reference  is  switched  at  ~4 
a.u.,  where  the  crossing  occurs  (cf.  Fig.  1 ).  When  we  employ  the  ROHF  solution 
coi’tinued  from  the  bonded  region,  much  more  complex  singularity  pattern  of  the 
L-CCSD  energy  arises. 

It  is  of  interest  to  note  that  the  A  =  [  matrix  of  size  extensivity  corrections 
is  much  sparser  than  the  CISD  matrix  H  =  In  fact,  if  we  only  consider  the 
most  important  and  simplest  diagonal  terms,  A„  =  £0  (which,  in  fact,  would  vanish 
if  we  employ  a  properly  defined  normal  product  form  of  the  Hamiltonian,  Hn  = 
H  -  Eo  [2,33])  and  neglect  all  the  remaining  offi-diagonal  terms,  the  resulting 
change  in  the  energy,  at  least  in  the  neighborhood  of  the  equilibrium  geometry,  is 
very  small.  Thus,  using  a  simplified  version  of  the  l-ccsd  theory  based  on  the  CiSD 
matrix,  namely,  defining 

a,  =  H.o,  b„  =  H,j  -  Eod,j ,  (27) 

will  yield  very  close  result  to  the  exact  l-CCSD.  This  is  reminiscent  of  a  procedure 
advocated  by  Hoffmann  and  Simons  (cf,  Eq.  (2.5a)  or  [34]),  assuming  that  we 
employ  a  multiconfigurational  reference,  such  as  given  by  a  solution  of  the  MC  SCF 
equations  or  by  a  suitable  linear  combination  of  Gel’fand  states.  Clearly,  such  a 
method  will  not  be  exactly  size-extensive;  however,  minor  violations  of  size  exten¬ 
sivity  may  be  outbalanced  by  the  simplicity  of  such  an  approach. 

Discussion 

In  this  paper  we  attempted  to  explore  the  possibilities  offered  by  SR,  but  not 
necessarily  single  determinantal,  CC  approaches.  We  have  also  seen  that  when  aiming 
for  a  spin-adapted  version  of  the  theory,  the  uga-  or  CAUGA-based  formalism  can 
be  very  useful.  Although  the  two  simplest  open  shell  cases  presented  here,  namely, 
a  single  electron  outside  a  closed  shell  and  a  singlet  excited  state  of  closed  shell 


COUPLED  CLUSTER  FORMALISM 


283 


systems,  can  be  handled  directly,  without  employing  special  computer  codes,  the 
complexity  of  the  formalism  rapidly  increases  with  the  number  of  open  shells.  For 
this  reason,  a  special  program  (in  FORTRAN )  was  written  that  is  capable  of  han¬ 
dling  more  complex  open  shell  systems.  A  similar  program  was  also  designed  to 
produce  the  nonlinear  form  of  cc  equations  and  a  required  code.  These  programs 
were  tested  on  die  above  presented  special  cases  and  are  further  being  simplified 
in  order  to  achieve  greater  efficiency.  The  description  of  these  codes  and  their 
capabilities  will  be  given  elsewhere. 

Concerning  the  linear  version  of  the  cc  theory  considered  here,  we  must  stress 
again  its  limitations  when  applied  to  general  situations  that  do  not  correspond  to 
a  bonding  region  characterizing  equilibrium  geometries.  Nonetheless,  even  the 
L-CCSD  results  can  be  very  useful  if  properly  employed.  It  is  certainly  very  dangerous 
to  make  any  conclusions  when  considering  only  one  fixed  nuclear  conformation. 
However,  when  used  over  a  wide  range  of  geometries,  the  range  of  its  validity  is 
easily  established.  In  fact,  this  region  of  applicability  is  already  implied  by  the 
character  of  one-electron  MOs  and  of  the  reference  configuration  employed,  even 
though  for  more  general  open  shell  cases  this  problem  should  be  investigated  in 
greater  detail. 

Let  us,  finally,  address  one  additional  aspect  of  the  formalism  described  in  this 
paper.  We  recall  that  in  order  to  classify  and  label  the  excitation  operators  G,  or, 
equivalently,  the  corresponding  excited  states  |  $,  >  =  G,  1 4>o)  that  span  the  excited 
state  manifold  defining  the  level  of  the  cc  approximation  employed,  we  rely,  gen¬ 
erally,  on  some  suitable  chain  of  subgroups.  In  this  paper  we  have  defined  our 
excitation  operators  in  such  a  way  that  they  produce  Gel’fand-Tsetlin  states,  which 
are  adapted  to  the  well-known  canonical  chain 

C/(n)D  G(«- 1)D  •  •  •  D  C/(l).  (28) 

As  already  mentioned,  the  advantages  of  this  choice  are  (i)  the  possibility  to  exploit 
UGA  formalism  and,  at  least  for  the  linear  level  of  theory,  ( ii )  to  rely  on  the  existing 
Cl  codes  and  algorithms. 

Recalling  the  orthogonally  spin-adapted  formalism  for  the  closed  shell  case  [33], 
we  see  that  the  coupling  scheme  implied  by  the  canonical  chain  (28),  or  by  the 
Gel’fand-Tsetlin  states,  leads  to  a  rather  awkward  symmetry  properties  with  respect 
to  permutation  of  hole  or  particle  labels,  as  exemplified  by  Eq.  (23).  It  is  thus 
preferable  to  exploit  the  following  subgroup  chain  [’’6,35] 

G(n)  D  U(n^„)  X  G(/2vinuu)  X  c(«,ct.ve) ,  (29) 

which  is  presently  being  explored.  Since  in  any  practically  viable  scheme  we  have 
to  restrict  ourselves  to  single  and  double  excitations,  the  resulting  configurations 
will  involve  at  most  two  singly  occupied  orbitals  in  the  core  and  at  most  two  singly 
occupied  virtuals.  It  is  then  convenient  to  couple  these  orbitals  into  a  symmetric 
and  an  antisymmetric  pairs  (cf.  pp-hh  coupling  used  in  the  closed  shell  case  [33]). 
Clearly,  this  cannot  be  done  when  relying  on  the  canonical  chain  (28),  as  seen  by 
the  examples  given  above,  e.g.,  Eq.  (23).  Using  such  a  coupling  scheme  leads  to 
CC  equations  having  much  simpler  symmetry  properties  than  when  exploiting  the 
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canonical  chain  (28).  The  details  of  these  developments  will  be  presented  else¬ 
where  [26,35]. 
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Abstract 

A  recently  developed  Full  Configuration  Interaction  method  is  discussed  and  used  to  compute  wave 
functions  for  N2  and  Hg  on  an  IBM  Rise  6000  workstation  As  a  further  application  of  the  formalism  a 
cluster  and  natural  orbital  analysis  of  the  wave  functions  is  performed  for  the  Nj  molecule  at  various 
intemuclear  distances  and  for  the  Hg  model.  ©  1993  John  Wiley  &  Sons.  Inc. 

Introduction 

The  Full  Configuration  Interaction  ( Full  ci  or  FCI )  method  is  the  simplest  among 
quantum  chemical  methods  and  has  the  property  of  giving  the  best  possible  wave 
function  once  a  one-particle  basis  set  is  given.  It  is  not  a  practical  tool  for  studying 
even  moderate  size  molecules,  because  of  the  factorial  growth  of  the  dimensions 
of  the  Cl  space  with  the  number  N  of  orbitals  and  the  number  n  of  electrons. 
However  it  is  extremely  important  to  assess  the  validity  of  approximate  methods 
[  1  ] ,  and  also  to  get  a  general  insight  into  the  structure  of  the  wave  function  in  small 
molecules  and  in  model  systems.  There  are  two  approaches  to  Full  Cl.  The  first 
one  uses  Slater  determinants  as  basis  functions  [2-10].  in  the  second  spin  adapted 
configuration  state  functions  [11,12].  In  the  second  approach  the  dimensions  of 
the  CI  space  are  smaller,  but  the  computer  code  is  more  complex  and  difficult  to 
vectorize.  The  first  approach  seems  to  allow  for  better  efficiency,  in  spite  of  the 
longer  expansions  and  the  lack  of  complete  spin  adaptation.  We  have  recently 
developed  a  ci  method  belonging  to  the  first  class  [13],  which  has  been  used  to 
compute  wave  functions  with  up  to  73  million  components  [14].  In  this  article  we 
pre.sent  medium  size  computations  on  an  IBM  RISC  6000  workstation  and  perform 
a  cluster  and  natural  orbital  analysis  of  the  full  Cl  wave  functions  of  Ni  and  Hg . 
In  order  to  test  the  performance  of  the  method  on  a  vector  supercomputer,  a  few 
tests  on  CRAY  machines  are  also  presented. 

Description  of  the  CI  Method 

Although  the  method  is  described  in  detail  in  [  1 3  ] .  we  briefly  review  here  its 
essential  features.  According  to  an  idea  which  goes  back  to  N.  C.  Handy  [2],  a 
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Slater  determinant  is  represented  by  a  product  of  two  strings  and  S2  specifying 
the  alpha  and  beta  occupation  numbers.  If  we  write  down  an  aj  for  each  1  in 
position  /■  of  the  string,  we  have  an  ordered  product  of  creation  operators  defining 
the  Slater  determinant  completely,  including  its  phase.  If  we  order  the  strings  in 
some  way,  e.g.,  in  lexical  order,  the  ci  vector  can  be  addressed  as  a  two-dimensional 
array  A"  (5i,  52)-  As  usual  in  direct  Cl  methods,  we  try  to  compute  an  eigenvector 
of  a  huge  Hamiltonian  matrix  H  by  an  iterative  procedure  in  which,  at  each  step, 
we  have  to  compute  the  vector  Y  =  HX  directly  from  the  two  electron  integrals 
and  the  vector  X  from  the  previous  cycle.  Our  approach  is  very  simple.  From  the 
second  quantized  form  of  the  Hamiltonian,  one  can  see  that  His  a  sum  of  elementary 
operators  like: 


{i\j)a]^a,^ 

(1) 

{ij\kl)a]^a]^a,^i^ 

(2) 

{ij\kl)a]^a]MiMk^ 

(3) 

1/  =  a  or  V 

Equations  ( 1 )  and  (2)  induce  a  mapping  between  strings  of  the  same  spin,  while 
eq.  (3 )  induces  a  product  mapping  involving  both  alpha  and  beta  strings.  The  idea 
is  to  construct,  for  each  operator,  two  lists  of  strings  mapped  one  into  another 
element  by  element  under  the  action  of  the  operator.  Consider  as  an  example 
a,  at,  which  appears  in  eq.  ( 1 ) .  We  generate  all  strings  in  lexical  order,  incrementing 
a  counter  c  at  each  step.  As  soon  as  a  string  is  constructed,  we  examine  it.  If  we 
find  that  orbital  /  is  empty  and  orbital  /  is  populated,  we  add  the  string  (or,  better, 
the  value  of  the  counter  c)  to  the  list  labeled  by  orbital  indexes  /  and  /,  in  order. 
At  the  end  of  the  process  we  have  a  list  of  all  strings  having  orbital  /  empty  and 
orbital  /  populated.  This  list  will  be  put  in  an  array  OV(m;  i,  1)  where  m  labels 
the  string,  i  the  occupied,  and  /  the  empty  orbital  (the  name  OV  comes  from  the 
words  occupied  and  virtual).  The  numbers  of  strings  accumulated  in  the  OF  lists 
are  also  computed  and  stored  in  an  array  NOV(i,  /).  Now,  as  shown  in  [13],  one 
can  prove  that  the  operator  aju/ sends  the  string  OV(m;  I,  i)  in  the  string  OF(w, 
/,  /)  multiplied  by  a  sign  factor  or  phase.  The  proof  relies  upon  lexical  order  and 
extends  to  operators  of  type  (2).  These  operators  will  use  lists  of  strings  having  two 
orbitals,  and  /,  populated  and  empty;  we  call  these  lists  OOFF(m;  /,  k,j,  i). 
In  order  to  realize  the  correspondence  induced  by  type  ( 3 )  operators,  we  use  the 
OV  lists  twice.  Since  the  lists  actually  contain  addresses  or  pointers,  the  mapping 
in  determinant  space  is  realized  as  follows: 

y(OF(m;  /,  /),  /)  *-  X(OV(m\ /,  /),  I)* phase* integral  (4) 
for  type  ( 1 )  operators,  or: 

Y{OOVV{m\j,  /,  k,  /),  I)  •*-  X(OOVV{m;  k,  l,j,  /),  I)* phase* integral  (5) 
Y(OV(m\  /,  k),  OV{n  J,  /))  •«-  X(,OV(m;  k,  /).  OV(n;  hj))* phase* integral 

(6) 


ANALYSIS  OF  FULL  ci  WAVE  FUNCTION 


289 


for  type  (2)  and  (3)  operators,  respectively.  The  phase  factors  can  be  stored  as 
separate  arrays,  or  can  be  incorporated  into  the  addresses.  These  phases  are  actually 
the  nonzero  values  of  coupling  coefficients  [  5  ] . 

Our  method  is  essentially  integral  driven  and  the  strategy  is  to  scan  the  two 
electron  integral  supermatrix,  to  pick  up  for  each  ( ik \jl  J  all  the  corresponding  X 
values,  and  accumulate  the  product  in  the  proper  location  of  Y  array.  In  this  way. 
integrals  which  are  zero  by  symmetry  or  for  some  other  reason  are  easily  skipped. 
We  will  consider  the  implementation  of  the  two  electron  loops  only.  In  its  simplest 
implementation,  we  keep  in  memory  X,  T,  the  lists,  and  the  two  electron  integrals. 
The  dimensions  of  X,  Y  grow  very  fast  with  the  number  IS  of  orbitals  and  the 
number  n  of  electrons;  the  length  of  the  lists  also  grows  fast,  but  less  rapidly  than 
the  FCi  vector.  It  becomes  something  like  10%  or  so  already  for  vectors  of  dimensions 
1 0’  [  1 3  ] .  As  a  second  step,  in  order  to  treat  larger  problems,  we  can  exploit  symmetry 
and  divide  the  strings  in  blocks.  In  such  a  case,  we  collect  all  strings  of  a  given 
symmetry  in  the  same  block  and  order  them  lexically  within  each  block.  A  third 
step  consists  in  dividing  X  and  Y  arrays  in  arbitrary  blocks  simply  according  to  the 
length  of  the  array,  according  to  how  much  memory  is  available  in  the  computer. 
This  extension  can  be  coupled  to  the  symmetry  blocking  and  requires  no  essential 
modification  of  the  lists.  We  will  not  examine  this  possibility  here. 

Consider  the  structure  of  the  two  main  loops  /3/3  and  a/3,  originating  from  operators 
given  in  eqs.  (2)  and  (3),  respectively.  The  simplest  one  is  the  /3/3,  which,  in  the 
case  of  no  symmetry,  reads: 

i3/3LOOP 

loop  over  orbitals  i,j,  k,  I 
V  =  {ik\Jl)  -  {il\jk) 

loop  m  =  I,  length  of  list  OOVV{i,j,  k,  /) 

I  =  OOVV{m,k,  lj,i) 

S  =  V* phase 

J=  OOVV(m  J,i,  k,l) 

loop  K  =  I,  number  of  a  strings. 

Y(K,J)=  Y{K,J)  +  X(K,I)*S 
endloop  K 
endloop  m 
endloop  i,j,  k,  I 

The  inner  loop  vectorizes  at  unit  stride;  its  length  is  the  a  dimension  of  X,  Y 
arrays;  it  can  be  implemented  using  BLAS  routines  SAXPY  or  DAXPY.  The  char¬ 
acteristic  feature  of  our  loop  is  the  use  of  the  two-electron  four  index  list  OOVV. 
The  overall  operation  count  is  essentially  the  same  as  other  schemes,  like  Olsen’s 
[6]  or  Zarrabian’s  [7,8];  however  the  structure  is  simpler  and  the  intermediate 
states  are  completely  bypassed.  The  structure  of  the  aa  loop  is  similar.  In  this  case 
the  inner  loop  would  vectorize  at  nonunit,  although  constant,  stride.  One  can  per¬ 
form  a  transposition  of  the  X,  Y  arrays  before  the  loop  in  order  to  have  stride  1 , 
which  gives  better  vector  performance.  In  the  case  S';  =  0,  as  pointed  out  by  Olsen 
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[6],  the  contribution  of  the  aa  loop  is  simply  given  by  that  of  times  {— 1)^, 
where  S  is  the  total  spin.  This  means  that  we  can  halve  the  number  of  operations 
and  force  a  definite  spin  reversal  symmetry  from  the  outset. 

Consider  now  the  case  when  we  group  in  the  same  block  all  strings  of  a  given 
(abelian)  symmetry.  The  strings  are  in  lexical  order  within  the  block  and  we  have 
a  list  for  each  operator  and  symmetry  block.  The  addresses  stored  in  the  lists  are 
relative  to  the  blocks.  Moreover,  any  operator  has  a  definite  symmetry,  so  strings 
of  a  given  block  are  sent  to  strings  of  another  block  of  definite  symmetry.  The 
structure  of  /3/3  loop  is  now  embedded  in  an  external  loop  over  symmetry  indices 
of  Y  and  X  blocks;  at  each  step  we  read  block  X  of  symmetries  a:„,  and  compute 
its  contribution  to  block  Y  of  symmetries  y„,  yg;  once  a  block  Y  is  completed,  we 
write  it  on  disk.  Furthermore,  the  four  index  lists  need  to  be  computed  only  for 
those  quadruples  of  indices  which  are  totally  symmetrical.  Therefore  the  string 
symmetry  is  not  changed  and  this  loop  will  be  done  only  in  the  “diagonal  case” 
when  Xa  =  ya,  Xg  =  yg.  The  dimensions  of  blocks  tend  to  be  quite  balanced,  even 
when  the  orbitals  are  quite  unevenly  distributed  among  the  various  symmetries. 

As  far  as  the  afi  loop  is  concerned,  one  should  remind  it  is  the  most  time- 
consuming  part  of  the  algorithm,  due  to  an  higher  operation  count  and  to  its  intrinsic 
complexity.  The  mapping  induced  by  al^jgaigOka  is  a  product  of  an  a  and  a  ^ 
mapping  of  the  type  described  for  the  j3/3  loop.  The  lists  used  are  the  shorter  one- 
electron  two-index  lists.  In  this  case  the  inner  loop  will  not  run  over  all  strings  in 
a  block,  but  on  those  contained  in  the  one-electron  list:  this  means  that  components 
of  X,  Y  arrays  are  not  scanned  one  by  one  in  sequence,  but  there  are  in  general 
random  jumps,  although  some  structure  is  preserved.  In  order  to  achieve  efficient 
vectorization,  one  must  use  gather-scatter  operations  as  pointed  out  by  [6).  In  our 
scheme  we  use  two  auxiliary  arrays  of  dimensions  that  can  accomodate  as  many 
rows  of  X  and  Y  array  as  there  are  elements  in  an  OK  list.  The  loop  structure  is 
given  in  the  following  (G  denotes  the  group  multiplication,  and  we  use  the  index 
of  an  orbital  to  indicate  also  its  symmetry): 

adLCK)P 

loop  over  block  indexes  Xg,  yg 
Xa  =  Symmetry  of  Ci  vector  0  , 

y„  =  Symmetry  of  Cl  vector  0  , 

loop  over  /,  k  orbitals  (a  spin) 

skip  when  {XaO  iO  kO  y^)  not  totally  symmetrical, 
gather  rows  of  X  array  in  a  scratch  XS  multiplied  by  the  a  phase;  the  rows 
correspond  to  the  entries  in  OV{k,  i\  jr„)  (vectorized  at  constant  stride), 
define  NLEN  =  number  of  rows  transferred. 

loop  over  j,  I  orbitals  (/3  spin). 

skip  when  (xgOjO  10  yg)  not  totally  symmetric. 

loop  m  =  I,  length  of  OV{l,j;  Xg) 

S  =  phase*{ik\jl) 
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/  =  l,j\  Xff)\  J  =  OV(m;j,  I,  y^) 

loop  a:  =  l,NLEN 

YS{K,  J)  =  YS{K,  J)  +  X{K,  I)*S 
endloop  K 
endloop  m 
endloop  j,  I 

scatter  NLEN  rows  from  YS  array  to  Y  array;  the  rows  correspond  to  the 
entries  OV{i,  k\ x„)  ( vectorized  at  constant  stride). 

endloop  /,  k 
endloop  Xg, 

Lists  of  all  symmetries  are  needed,  i.e.,  no  reduction  of  list  length  takes  place. 
The  length  of  the  inner  loop  NLEN  affects  the  efficiency  on  vector  machines.  We 
group  together  two  columns  of  X  and  Y,  in  order  to  have  maximum  NLEN,  i.e., 
the  columns  corresponding  to  the  terms  a]a^jaiaK  and  ajala,aj  of  the  Hamil¬ 
tonian. 

In  order  to  assess  the  effectiveness  of  the  method  we  present  here  some  com¬ 
putations  carried  out  on  a  workstation  IBM  Rise  6000,  plus  a  few  tests  on  CRAY 
Y-MP  and  C90.  The  symmetry  adapted  version  of  the  program  was  used.  In  Table 
I  we  report  energy  values  computed  for  water  with  DZ  basis  [3]  at  equilibrium 
geometry.  It  is  a  small  computation  performed  in  order  to  test  the  program  and  to 
illustrate  all  the  possibilities  offered  by  spatial  and  spin  reversal  symmetry.  A  pre¬ 
liminary  SCF  computation  on  the  ground  state  of  water  was  performed  by  using 
the  program  CADPAC  [  1 5  ] ;  the  MO’s  obtained  were  used  to  expand  the  FCi  wave- 
functions  for  all  cases  reported  in  the  table. 

A  larger  and  more  interesting  test  case  is  given  by  the  Ne  atom  in  a  I5s3p2d] 
AO  basis  and  with  eight  active  electrons,  because  it  can  be  compared  with  earlier 


Table  I.  fci  energies  (a.u.)  of  water  with  dz  basis  [3]  with  C:,. 
geometry.  Eight  different  states  are  refjorted  (the  lowest  states  for 
each  symmetry).  For  the  four  different  spatial  symmetnes  both  singlets 
(spin  rev.  =  + 1 )  and  triplets  (spin  rev.  =  - 1 )  are  considered. 


Symmetry 

Spin  rev. 

^toi 

+  1 

-76.157866 

1,002,708 

-I 

-75.797148 

Ai 

+  1 

-75.761016 

1,017,744 

-1 

-75.779894 

B, 

+  1 

-75.670399 

1,001,536 

-I 

-75.721598 

ft 

+  I 

-75.742374 

1.002,016 

-1 

-75.867480 
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calculations  by  Bauschlicher  et  al.  [16]  and  by  Olsen  et  al.  [10].  The  basis  set  used 
in  our  computation  is  slightly  different  from  that  of  Refs.  [16]  and  [10],  because 
we  used  6-component  cf-functions  and  discarded  the  two  highest  scf  orbitals  of  i- 
symmetry,  in  order  to  have  exactly  the  same  number  of  orbitals  as  a  [Sslpld] 
basis  with  5-component  d  orbitals.  The  only  difference  between  our  basis  and  that 
of  Ref.  [16]  and  [10]  is  in  the  radial  part  of  the  s  functions,  not  in  their  number. 
For  this  reason  we  get  a  correlation  energy  slightly  different  from  that  reported  in 
Ref.  [16],  i.e.  -0.245093  hartrees  instead  of -0.244864.  Various  timings  are  re¬ 
ported  in  Table  II.  The  values  reported  for  the  ibm  Rise  were  obtained  by  using 
the  function  MCLOCKf)  and  include  the  contribution  of  I/O,  which  amounts  to 
about  10%.  It  is  remarkable  that  on  a  workstation  we  have  essentially  the  same 
time  per  iteration  as  on  a  CRAY  2.  An  all-in-core  version  of  our  program  was  run 
also  a  CRAY  Y-MP  and  on  a  CRAY  C90,  giving  the  times  reported  which  of  course 
do  not  include  any  I/O.  These  results  demonstrate  that  medium  size  full  Cl  com¬ 
putations  can  be  efficiently  carried  out  on  a  workstation. 

Encouraged  by  this  result,  we  decided  to  consider  a  larger  case,  i.e.,  N2  in  a  split- 
valence  or  VDZ  basis.  Using  18  active  orbitals  and  14  correlated  electrons,  the 
dimension  of  FCi  space  is  126,608,256  determinants,  distributed  in  8  blocks  of  16 
millions  each  in  D2h  symmetry.  The  iterative  process  requires  a  large  amount  of 
I/O  (about  9.5  Gbytes  per  iteration),  and  therefore  the  use  of  asyncronous 
I/O  routines  (BUFFER  IN/OUT)  is  recommended.  The  efficiency  of  the  paral¬ 
lelization  of  our  code  can  be  judged  from  the  following  data.  A  preliminary  cal¬ 
culation  performed  on  CRAY  Y-MP  (4  epu,  64  MW  core  memory)  gave  a  total 
epu  time  of  about  1 125  s  per  iteration  using  the  4  cpu.  with  an  elapsed  time  of  910 
s.  By  running  the  same  calculation  on  a  single  cpu,  we  obtained  a  cpu  time  of  1060 
s  per  iteration,  and  an  elapsed  time  of  1675  s.  Therefore,  as  far  as  cpu  time  is 
concerned,  the  parallelization  efficiency  is  94%,  the  contribution  of  I/O  being  es¬ 
sentially  the  same  in  both  cases  (about  600  s). 

This  problem  was  however  too  large  for  our  RISC  workstation.  Therefore  we 
discarded  the  two  highest  MO’s  of  a  symmetry  and  got  a  dimension  of  16,631,136 
determinants,  partitioned  in  eight  nearly  equal  blocks  in  £>2/,.  In  Table  III  we  sum¬ 
marize  timing  data  on  the  RISC  6000  for  this  and  other  smaller  cases,  measured  by 


Table  II.  Comparison  of  iteration  time  (m  seconds  and  seconds  per  million  determinants)  for  Ne  'S, 
{5s3p2d)  basis  set,  eight  .1  ve  electrons.  G  is  the  symmetry  group  used  for  the  calculation.  the 
number  of  determinants  ol  the  fci  vector  using  this  group,  and  T„  the  iteration  cpu  time.  T,,  is  given  in 
seconds  and  TJNia  in  seconds  per  million  determinants. 


Reference 

G 

A'*, 

Computer 

T„ 

TJN^ 

Bauschlicher  et  al.  [16] 

9.805,897 

CRAY  2 

1368 

139.5 

Olsen  et  al.  [10) 

D,, 

5.502.533 

IBM  3090/vf 

360 

65.4 

Present  work 

9.805.897 

IBM  RISC  6000/950 

1482 

151.1 

Dv, 

9.805.897 

CRAY  Y-MP 

85 

8.7 

D:* 

9.805,897 

CRAY  C90 

45 

4.6 
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Table  III.  Timing  data  (in  seconds  and  seconds  per  million  determinants)  on  ibm  Rise  6(XX)/950  for 

different  systems. 


System: 

HjO 

Ne 

Nj 

H, 

State: 

'A, 

'S 

x'z; 

Basis  set: 

DZ 

I5s3p2d] 

VDZ 

DZ 

Active  orbitals' 

14 

23 

16 

16 

Correlated  electrons: 

10 

8 

14 

8 

Symmetry  group: 

Q,. 

Dik 

D:* 

Dv, 

Symmetry  blocks: 

4 

8 

8 

4 

Number  of  dets  (A(j«) 

1.002,708 

9.805,897 

16.361,136 

828,688 

Iteration  time  (T„): 

126 

1480 

1810 

90 

126.0 

151.2 

110.6 

108.4 

the  AIX  function  MCLOCKO.  They  are  average  values  including  the  contributions 
of  all  CPU  time  used  during  an  iteration  {HX,  I/O,  normalization.  •  •  •)  in  the 
presence  of  another  concurrent  job.  The  CPU  time  per  million  determinants  is 
frequently  reported  as  an  indicator  of  the  efficiency  of  the  program;  in  our  case  it 
is  in  the  range  100-150  s/million  determinants.  The  time  values  reported  for  N2 
are  lower  than  those  given  in  Ref.  [13]  for  a  similar  computation,  due  to  the  use 
of  the  BLAS  routine  DAXPY. 

Analysis  of  the  Wavefunction 

Natural  orbital  analysis  [17]  is  a  well-known  procedure  to  get  some  insight  into 
the  structure  of  the  wave  function.  The  one-electron  density-matrix  (one-matrix) 
p  in  the  MO  basis  is  easily  computed  from  the  ci  vector  and  the  one  electron  lists 
OV.  The  ij  element  of  the  spinless  one-matrix  is  simply  given  by: 

P,=  Z  2  X[OV{mJ,j).K]X[OV(mJ,i),K] 

A,‘«l,Asinngs 

+  XIK,  OV(m;  iJ)]X[K,  OV(m  J,  /)]  (7) 

when  5;  =  0  and  the  wave  function  is  totally  symmetric  in  an  abelian  group;  it  has 
no  a/3  contribution.  The  eigenvectors  of  the  one-matrix  are  the  natural  orbitals  of 
the  system,  and  the  eigenvalues  are  the  corresponding  natural  orbital  occupation 
numbers. 

An  important  underlying  structure  of  the  wave  function  is  associated  with  thc 
cluster  expansion.  This  was  discovered  long  ago  [18],  and  constitutes  the  basis  of 
the  various  coupled  cluster  methods  [19]  which  appear  to  be  powerful  and  promising 
tools  at  our  disposal  for  computing  molecular  properties  [20] .  The  cluster  expansion 
of  a  closed-shell  FCI  wave  function  can  be  written  as: 

|FCI>  =  exp(r,  +  72  +  Tj  +  •  •  OIscf)  (8) 

where  7|,  T2,  T3,  •  •  •  are  cluster  operators  defined  as 
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Tfi  —  2  •  •  •  •  iktPlt  .  •  Pk)^ti  '  *  ’  ^ifip\  '  '  '  ^Pk 

ii . uSocc  Pi . PtSvirt 

They  fulfill  the  following  equations: 

{Tk,T,]  =  0  (10) 

=  0  (11) 

where  n  is  the  number  of  electrons  and  [n/k]  is  the  ratio  rounded  to  the  nearest 
integer  ^  nik. 

A  cluster  analysis  of  the  FCi  vector  can  be  performed  along  the  following  lines 
(see  for  instance  [21]).  First  of  all,  the  matrix  elements  of  the  cluster  operators  are 
extracted  from  the  coefficients  of  single,  double,  triple,  •  •  •  excitations  of  the  FCi 
vector  in  the  following  way.  Denote  Ci,  C2,  C3,  •  •  •  the  operators  transforming 
the  SCF  vector  in  the  FCi  one: 

|Fa>  =  (l +C1  +  C2  +  C3+  --OISCF)  (12) 

in  such  a  way  that  the  matrix  elements  of  C| ,  C2,  C3  are  the  coefficients  of  single, 
double,  triple,  •  •  •  excitations,  respectively,  in  the  intermediate  normalization 
(fci|SCf)  =  1,  (scfIscf)  =  1.  The  operators  Q  are  therefore  given  by  expressions 
analogous  to  those  of  the  T^. 

^k  2  2  ^/c(ll  j  •  •  •  >  ^ki  Ply  •  •  •  y  Pk^^i\  •  •  *  Ui^Qpy  •  •  •  Qp^ 

ii . i*eocc  Pi,  . . .  .PuSvirt 

(13) 

By  equating  eqs.  ( 12)  and  (8)  we  obtain  the  chain  of  equations  [22]: 

r,  =  C„  (14) 

\t]+T2  =  C2  (15) 

and  so  on.  These  equations  are  easily  solved,  and  they  give 

ti(i',p)  =  ci{r,p),  (16) 

t2{i,j'yP,<l)  =  C2{i,j\p,q)+  1/2(1  ~  P,.j){\  -  Pp,g)ci{i;p)ci(j:Q)  (17) 

and  so  on.  Here  and  Pp^  are  permutation  operators  exchanging  the  indexes  /, 
j  and  p,  q,  respectively.  Once  the  matrix  elements  of  the  cluster  operators  are 
known,  we  can  compute  the  vector  resulting  from  the  application  of  any  polynomial 
in  Ti,  T2,  T3,  •  •  •  to  an  arbitrary  vector  of  the  FCI  space.  If  we  restrict  ourselves 
to  Ti  and  T2,  this  requires  nothing  but  the  same  algorithm  described  for  the  ap¬ 
plication  of  the  Hamiltonian  operator.  The  computer  time  required  in  this  casv  is 
much  smaller  than  a  FCi  iteration  because  the  number  of  nonzero  matrix  elements 
of  Ti  and  7’2  is  much  less  than  the  number  of  nonzero  two-electron  integrals.  This 
can  be  seen  from  the  values  of  indexes  i,j,  •  •  •  and  p,q,  •  ‘  ‘  which  do  not  span 
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Table  IV.  Computed  values  of  energies  and  related  quantities  for  the  ground  state  X'^g  of  'I.  at 
various  levels  of  approximation.  R  is  the  internuclear  distance  (/?„  =  2.12,  close  to  the  equilibrium 
distance).  Method  represents  the  level  of  approximation  (see  text);  £,ot  is  the  total  energy;  £j„„  is  the 
difference  with  the  SCF  energy,  cosscr  ts  the  cosine  between  the  approximate  vector  and  the  SCF  vector; 
coSfci  is  the  analogous  cosine  with  the  exact  FCi  vector.  All  values  are  in  atomic  units. 


R 

Method 

%Erci 

cosrei 

COSscF 

0.5 

SCF 

-104.905525 

0.000000 

0.0% 

0.988201 

1.000000 

FCI 

-104.989179 

-0.083654 

100.0% 

1.000000 

0.988201 

SD 

-104.986737 

-0.081213 

97.1% 

0.999803 

0.988396 

exp  SD 

-104.988142 

-0.082617 

98.8% 

0.999886 

0.988315 

Ro 

SCF 

-108.818276 

0.000000 

0.0% 

0.947271 

1.000000 

FCI 

-109.022844 

-0.204568 

100.0% 

1.000000 

0.947271 

SD 

-108.999695 

-0.181419 

88.7% 

0.996714 

0.950394 

exp  SD 

-109.014873 

-0.196597 

96  1% 

0.998655 

0.948893 

1.5 

SCF 

-108.518520 

0.000000 

0.0% 

0.797811 

1.000000 

FCI 

-108.897984 

-0.379464 

100.0% 

1.000000 

0.797811 

SD 

-108.791008 

-0.2''2489 

71.8% 

0.969170 

0.823185 

exp  SD 

-108.871607 

-0  353087 

93.0% 

0.993576 

0.803943 

2.0  R, 

SCF 

-108.226273 

0.000000 

0.0% 

0.442780 

1.000000 

FCI 

-108.825898 

-0.599626 

100.0% 

1.000000 

0.442780 

SD 

-108.588260 

-0  361988 

60.4% 

0.847671 

0.522335 

exp  SD 

-108.769450 

-0.543177 

90.6% 

0.970143 

0.387482 

10.0  R„ 

SCF 

-107.750890 

0.000000 

0.0% 

0.243124 

1.000000 

FCI 

-108.815249 

-1.064359 

100.0% 

1.000000 

0.243124 

SD 

-108.351849 

-0.600959 

56.5% 

0.694635 

0.35001 1 

exp  SD 

-108.672174 

-0.921285 

86  6% 

0.908307 

0.157619 

the  full  range  of  N  orbitals,  but  only  the  occupied  and  the  virtual  orbitals  in  the 
SCF  determinant,  respectively.  We  can  also  easily  compute  the  vector 

lexp  5iD)  =  expCTi  +  r2)|scF)  (18) 

because  the  exponential  series  breaks  down  to  a  polynomial  in  T i  and  Ti  thanks 
to  the  nilpotency  property  eq.  ( 1 1 ).  This  vector  can  be  compared  to  the  FCI  solution, 
and  to  the  other  vector 


|5Z)>  =  (1 +C, +  C2)1scf>  (19) 

i.e.,  with  the  FCI  solution  projected  on  the  space  of  those  determinants  which  are 
at  most  double  excitations  from  the  SCF.  We  note  that  both  |  SD\  and  |exp  SD) 
are  in  the  intermediate  nonnalization.  The  vector  (18)  contains  all  unlinked  higher 
order  excitations  coming  from  the  connected  single-  and  double-excitations  of  the 
FCI  vector. 

From  an  approximate  vector  either  \SD)  or  lexp  5/)),  we  can  compute 
th**  “variational”  upper  bound  to  the  energy: 
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Table  V.  Occupation  numbers  of  the  natural  orbitals  of  A;  at  various  intcrnuclear  distances.  The 
occupation  numbers  of  lo^and  I  natural  orbitals  differ  from  2.0000  by  less  than  5  10  '  at  all  intcrnuclear 

distances. 


z)R  =  0.5  Ro 
Symmetry 

|FOl) 

\SD) 

lexp  SD) 

1  9925 

1.9928 

1.9927 

1  9976 

1.9977 

1.9976 

1.9882 

1.98P5 

1.9884 

!xu 

1.9875 

1  9878 

1.9877 

It, 

0  0150 

0.0146 

0.0148 

3(yu 

0.0078 

0.0076 

0.0077 

4<t„ 

0.0025 

0.0024 

0.0024 

2vu 

0.0017 

0.0016 

0.0016 

40, 

0.0017 

0.0015 

0.0016 

2t, 

0.0007 

0.0007 

0.0007 

b)R  =  R„ 

Symmetry 

|FCI> 

ISO) 

lexpS'D) 

2o, 

1.9896 

1.9910 

1.9903 

2(j„ 

1.9796 

1.9815 

1.9807 

3o, 

1.9733 

1.9773 

1.9754 

1t„ 

1.9227 

1.9307 

1.9266 

0.0809 

0.0728 

0.0770 

3(Tu 

0.0234 

0.0201 

0.0219 

4<r„ 

0.0093 

0.0082 

0.0088 

2t„ 

0.0058 

0.0050 

0.0052 

4o, 

0.0041 

0.0036 

0  0038 

2*-, 

0.0010 

0.0008 

0.0008 

c)R  =  \.5Ro 

Symmetry 

|FCI> 

\SD} 

(exp  SD) 

2o, 

1.9883 

1.9921 

1.9900 

2<t„ 

1.9816 

1.9862 

1.9835 

3o, 

1.8753 

1.9217 

1.8929 

It„ 

1.6596 

1.7348 

1.6756 

>», 

0.3386 

0.265S 

0.3262 

3(t„ 

0.1249 

0.0790 

0.1083 

2t„ 

0.0083 

0.0048 

0.0048 

4it„ 

0.0066 

0.0043 

0.0054 

4o, 

0.0050 

0.0028 

0.0034 

2t, 

0  0027 

0.0015 

0.0018 

d)  =  2.0  Ro 

Symmetry 

IFCI) 

\SD} 

(exp  SD) 

2o, 

1  9941 

1.9980 

1.9960 

2(7u 

1.9916 

1.9966 

1.9945 

3o, 

1.4836 

1.7257 

1.3424 

It^ 

1  1719 

1.4185 

0.9671 

It, 

0  8211 

0.5783 

1.0320 

3<ru 

03132 

0.2738 

06585 
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Tablf  V.  (Continued) 


d)  R-  2.0  R°  (Continued) 
Symmetry 

|FC1> 

\SD) 

1  exp  SD) 

27r„ 

0.0065 

0.0033 

0.0017 

0.0057 

0.0021 

0.0021 

4(r„ 

0.0048 

0.0014 

0.0024 

2’r, 

0.0041 

0.0011 

0.0012 

e)R=  10.0  R„ 

Symmetry 

|FCI> 

\SD) 

|exp5D) 

2<r* 

1.9961 

1.9995 

1.9984 

lOu 

1.9961 

1.9995 

1.9984 

3(r, 

0.9969 

1.4174 

0.6086 

l)r„ 

0.9969 

1.4174 

0.6085 

Itt, 

0.9969 

0.5802 

1.3911 

3<Tu 

0.9969 

0.5802 

1.3911 

2ff„ 

0.0044 

0.0007 

0.0007 

0.0044 

0.0020 

0.0007 

4(;„ 

0.0044 

O.OOOV 

0.0007 

IVg 

0.0044 

0.0020 

0.0007 

=  (20) 

which  gives  information  about  the  ability  of  the  vectors  ( 1 8 )  and  ( 1 9 )  to  approximate 
the  FCI  solution.  Another  measure  of  the  same  property  is  given  by 

Ill'll  IlFCll 

i.e.,  the  cosine  of  the  angle  between  vector  I’l')  and  |FCl). 

Remember  that  our  cluster  analysis  is  only  partial;  the  importance  of  connected 
excitations  of  order  higher  than  2  can  be  obtained  only  indirectly  by  comparison 
with  the  FCI  results. 


Applications  to  N:  and  Hg 

With  this  technique  we  have  investigated  two  systems.  The  first  is  the  N2  molecule 
at  five  intern uclear  distances  and  using  the  VDZ  basis  [23]  truncated  to  16  MO’s. 
The  results  are  collected  in  Tables  IV  and  V.  In  Table  IV  we  collect  total  and 
correlation  energies  for  the  following  functions:  |scf),  |FCI),  the  projection  of 
|fci)  in  the  space  of  singles  and  doubles,  denoted  by  |  SD'),  and  the  exponential 
lexp  SD)  defined  in  eq.  ( 18).  The  cosines  of  the  angles  of  any  vector  with  iFCl) 
and  Iscf)  are  also  reported.  From  these  data  one  can  follow  the  increase  of  cor¬ 
relation  energy  with  the  internuclear  distance.  It  is  remarkable  how  the  |exp  SD) 
manages  to  recover  a  large  fraction  of  the  cori  elation  energy:  85.4%  of  FCl  at  ten 
times  the  equilibrium  internuclear  distance,  i.e..  21.2  bohr.  At  this  distance  the 
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Table  VI  Computed  values  of  energies  and  related  quantities  for  the  ground  state  'Aig  of  Hj  at  various 
levels  of  approximation  a  is  the  distortion  from  the  Z)j*  symmetry,  Method  represents  the  level  of 
approximation  (see  text),  is  the  total  energy;  is  the  difference  with  the  SCF  energy;  cosscr  ts  the 
cosine  between  the  approximate  vector  and  the  SCF  vector;  cosfci  is  the  analogous  cosine  with  the  exact 
FCI  vector.  All  values  are  in  atomic  units. 


a 

Method 

^tot 

Econ 

%Erct 

COSfci 

COSsCF 

l.00(X) 

SCF 

-4.284048 

0.000000 

0.0% 

0.936431 

1.000000 

FCI 

-4.416537 

-0.132489 

100.0% 

1.000000 

0.936431 

SD 

-4.404999 

-0.120951 

91.3% 

0.997102 

0.939153 

expSD 

-4.415626 

-0.131579 

99.3% 

0.999768 

0.936703 

O.KXX) 

SCF 

-4  167991 

0.000000 

0.0% 

0.836074 

1.000000 

FCI 

-4.315812 

-0.147821 

100.0% 

1.000000 

0.836074 

50 

-4.291160 

-0.123169 

83.3% 

0.991633 

0.843129 

exp  50 

-4.308775 

-0.140784 

95  2% 

0.996730 

0.838160 

0.0100 

SCF 

-4.149155 

0.000000 

0.0% 

0.687678 

1.000000 

FCI 

-4.308362 

-0.159206 

100.0% 

1.000000 

0  687678 

50 

-4.265770 

-0.116614 

73.2% 

0.983508 

0.699209 

exp  50 

-4.289768 

-0.140613 

88.3% 

0.991195 

0.691383 

0.0010 

SCF 

-4.147195 

0.000000 

0.0% 

0.666401 

1.000000 

FCI 

-4.308064 

-0.160869 

100.0% 

1.000000 

0.666401 

50 

-4  263109 

-0.115913 

72.1% 

0.982443 

0.678309 

exp  50 

-4.287676 

-0.140481 

83.3% 

0.990397 

0.670158 

0.0001 

SCF 

-4  146999 

0.000000 

0.0% 

0  664227 

1.000000 

FCI 

-4.308040 

-0.161041 

100.0% 

1.000000 

0.664227 

50 

-4.262846 

-0.115847 

71.9% 

0  982336 

0.676171 

exp  50 

-4.287467 

-0.140468 

87.2% 

0.990316 

0.667987 

triple  bond  is  completely  broken,  as  can  be  seen  from  the  occupation  numbers  of 
natural  orbitals  reported  in  Table  V.  However,  the  absolute  error  in  correlation 
energy  is  very  large,  i.e.,  0.155826  hartrees.  These  data  clearly  indicate  the  impor¬ 
tance  of  connected  triples,  quadruples,  and  higher  excitations  in  the  FCi  vector  at 
large  internuclear  distances  in  N2.  These  data  also  suggest  that  a  “variational” 
single  reference  SD  coupled  cluster  wave  function  is  not  adequate  to  compute  a 
quantitatively  correct  dissociation  curve  for  N2  molecule,  even  though  a  variational 
optimization  of  the  matrix  elements  of  Ti  and  T2  would  increase  the  percentage 
of  predicted  correlation  energy.  In  this  connection  is  should  be  reminded  that  the 
usual  Restricted  Hartree-Fock  Coupled  Cluster  truncated  to  Single  and  Double 
excitations  (RHF-ccsd)  gives  for  this  molecule  qualitatively  incorrect  results.  The 
potential  energy  curve  computed  by  Laidig  et  al.  shows  indeed  an  unphysical  max¬ 
imum  in  the  region  of  R  =  4  a.u.  [24],  and  seems  to  decrease  to  very  low  values 
at  large  R.  The  usual  procedure  to  compute  the  energy  in  cc  methods  uses  the 
transition  formula: 


(exp5D|//|scF) 

<expSD|SCF 


(22) 
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which  is  not  variationally  bounded;  both  |exp  SD)  and  1 SD}  give  back  to  the 
exact  FCI  enei^y  at  any  distance  when  plugged  into  eq.  (22).  Our  results  confirm 
the  usefulness  of  the  extension  of  CC  methods  to  higher  excitations  [25]  or  to 
multireference  functions  [26]. 

In  Table  V  we  report  the  natural  orbitals  occupation  numbers  of  N2  at  the 
same  five  internuclear  distances,  for  the  three  wave  functions  |FCi),  1 5£))  and 
I  exp  SD}.  The  first  four  orbitals  (two  Og  and  two  (r„  orbitals)  are  practically 
doubly  occupied  at  any  distance.  The  triple  bond  is  formed  by  the  orbitals  3og, 
Wxu,  and  liTju,  while  3(r„,  lir^g,  and  Itt,.*  are  the  corresponding  antibonding 
orbitals.  It  is  therefore  possible  to  follow  the  dissociation  of  the  triple  bond  with 
the  internuclear  distance.  At  long  distance,  at  FCi  level,  both  the  three  bonding 
orbitals  and  the  three  antibonding  orbitals  are  populated  with  a  single  electron. 
Since  at  large  distance  they  are  mainly  formed  by  p  atomic  orbitals,  this  cor¬ 
responds  to  the  correct  dissociation  of  the  molecule,  with  a  single  electron  in 
each  one  of  the  2px,  2p,.,  and  2p-  ao  of  the  two  N  atoms.  The  |  SD)  vector,  on 
the  contrary,  has  the  bonding  orbitals  more  occupied  than  the  antibonding  or¬ 
bitals.  This  is  a  consequence  of  the  incorrect  dissociation  of  the  closed  shell  SCF, 
since  double  excitations  are  unable  to  produce  two  neutral  atoms  starting  from 
the  closed-shell  reference.  Quite  remarkably,  the  occupation  numbers  corre¬ 
sponding  to  the  I  exp  SD}  vector  are  unbalanced  exactly  on  the  opposite  way, 
with  the  antibonding  orbitals  more  occupied  than  the  bonding  orbitals.  This  is 
due  to  the  absence  of  linked  higher  order  excitations,  which  would  partially 
cancel  the  products  of  diexcitations. 

Similar  computations  and  analysis  were  performed  on  the  Hg  system  (see 
Table  VI ),  originally  studied  by  Jankowski  et  al.  [27]  with  a  minimal  basis  set, 
and  later  investigated  by  Bartlett  and  co-workers  at  D2  and  DZP  level  [28].  In 
this  model  four  H2  molecules  with  fixed  internuclear  distance  (2.0  a.u.)  are 
initially  arranged  into  an  octagonal  Du  structure.  Two  H2  molecules  are  then 
symmetrically  displaced  from  their  initial  positions  by  an  amount  a,  and  the 
symmetry  is  lowered  to  D^h  ■  The  level  of  degeneracy  is  controlled  by  the  param¬ 
eter  a,  which  represents  the  distorsion  from  the  symmetric  Du  structure,  which 
corresponds  to  a  =  0.  When  a  goes  to  zero,  the  highest  occupied  and  lowest 
unoccupied  mo’s  become  quasi-degenerate,  and  two  configurations  are  dominant 
in  the  FCI  wave  function.  !n  fact,  as  we  can  see  from  Table  VI,  the  weight  of  the 
SCFdeterminant  in  the  FCI  vector,  which  is  given  by  cosier  (FCI),  is  close  to  Vi 
as  a  goes  to  zero. 

We  used  the  DZ  basis  set  of  Paldus  et  al.  [  29  ] ,  and  we  performed  FCi  calculations 
followed  by  cluster  analysis  for  five  values  of  the  parameter  a.  This  is  a  less  degenerate 
system  than  a  dissociating  N2  molecule,  and  the  difference  between  the  exact  cor¬ 
relation  energy  and  that  associated  with  lexp  SD)  is  smaller.  However  when  we 
examine  the  percent  of  correlation  energy  recovered  by  |exp  5D),  we  find  for  small 
a  a  value  very  similar  to  the  corresponding  percent  for  N2  at  /?  =  10J?o-  This 
indicates  that  even  in  this  simple  case  connected  triple  and  quadruple  excitations 
are  important,  as  already  pointed  out  by  Bartlett  and  co-workers  in  the  case  of 
single  reference  CC  approach  for  this  system. 
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Abstract 


This  article  gives  an  unproved  derivation  for  t'(«i  +  n^D  VinDy.  L'{  Wj).  D  5,v,  X  5a,,  and 
other  transformation  coefficients  by  using  a  senes  of  elementary  Racah  transformations  These  trans¬ 
formation  coefficients  can  be  expressed  generally  in  terms  of  a  product  of  segments  containing  6j  symbols 
and  easily  calculated.  Possible  segment  factors  are  given  in  Table  1.  @  1993  John  Wiley  &  Sons.  Inc 


Due  to  their  importance  in  applications,  the  U(nx  +  ni)  ^  t^(«i)  X  U{n2)  and 
5,\,+aj  D  Sa,  X  Sa,  transformation  coefficients  have  attracted  numerous  studies 
[1-1  ij.  A  motivation  for  these  studies  is  to  exploit  the  possibility  of  partitioning  a 
large  system  into  smaller  subsystems  in  actual  calculations.  For  this  purpose  it  is 
obviously  desirable  to  find  a  simple  and  effective  approach  of  evaluating  the  coef¬ 
ficients.  For  a  electron  system,  with  which  quantum  chemists  are  particularly  con¬ 
cerned,  the  spin  algebra  approach  based  on  the  spin  characteristic  of  electron  spin- 
adapted  functions  might  be  conceptually  the  simplest  one.  By  using  the  graphic 
technique  of  the  spin  algebra  the  author  of  this  note  in  an  earlier  publication  gave 
the  closed  formulae  for  5a,+a2  ^  -Sa,  X  5aj  and  U{ni  +  z?:)  ^  X  [/(nj) 
transformation  coefficients  [10].  These  formulae  seem  quite  easy  to  handle  and 
have  been  involved  in  the  program  of  ligand  field  calculations.  [12]. 

Althougn  the  application  of  the  spin  graph  approach  is  straightforward,  the  pro¬ 
cedure  obtaining  a  general  expression  is  often  tedious  because  a  series  of  graph 
transformations  should  be  carried  out.  In  this  note  we  shall  show  that  in  some  cases 
the  complex  procedure  of  graph  transformations  may  be  avoided. 

Let  us  take  U{n)  D  C,'(ni )  X  U{n2)  («  =  /?!  +  «:)  transformation  coefficient  as 
an  example.  Suppose  that  |  is  the  spin  function  associated  with  an 

orbital.  Its  explicit  form  depends  on  the  step  number  dr  and  can  be  written  as 


\s(dr)m(dr)') 


d  =  0 

=  1  or  2  ( I ) 


d=3 
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According  to  the  SU(2)  scheme,  and  U(n)  Gelfand  state  (so-called  canonical 
basis)  can  be  constructed  by  gradual  coupling  of  individual  orbital  spin,  i.e., 

\[\]d,  '•(!„)=  |[X]Sr  •  •  4  •  •  •  S„) 

n 

=  /  2  in  \Sr-sMr-Mdr)m{dr)) 

{mM)  (A/,)  r-2 

X{Sr-^M,.,S{dr)m(dr)\SrMr)  (2) 

where  Sr  is  the  intermediate  spin  generated  by  coupling  the  first  r  orbitals,  Mr  is 
the  corresponding  component,  and  /is  a  phase  factor.  This  coupling  scheme  can 
be  symbolized  by 

(•  •  ■md,)s{d2))S2S{dy))Si-  •  ■)S„r‘  ■)£„-, s(d„))S„  =  5  (3) 

or  illustrated  by  labeled  binary  tree  [131  (FiS-  •  )■ 

We  can  similarly  construct  the  noncanonical  basis  or  the  basis  function  symmetry- 
adapted  to  X  Uin-i)  by  coupling  L'(«i)  and  U(n2)  Gelfand  states  into  the 
state  having  total  spin  S, 

\[\\]dr"d„-,[\2]d'„,^^-’d'„) 

=  2  l[X.15r  (4) 

A/„,.w; 

The  coupling  scheme  for  the  noncanonical  basis  is 

(•  •  •{(s(d^)s(d2))S2-  •  •  •  •{5(<4,+  |)5(<//„  +  2))5'„,+2  '  *  *) 

XS'„.,s(d,,)S'„)S„  =  S  (5) 

and  the  corresponding  labeled  tree  diagram  is  shown  in  Figure  2. 

As  is  well  known  that  the  U(n)  D  f/(ni )  X  t/(n2)  transformation  coefficients  are 
contained  in  the  following  expansion. 
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j(d,)  -SR,.!) 


l\2]d'„,^^“^d'„)  (6) 

Comparing  the  coupling  schemes  we  find  that  the  transformation  from  (3)  to 
(5)  can  be  realized  by  a  series  of  similar  elementary  transformations, 

.  .  -({S„,S{d'r.r))S'r-rS{d'r))S'r^  ^  .(S„,{S{d’-,)s{d'r))S'r)Sr'  •  * 

r  =  «!  +  2,  /ii  +  3,  •  •  •  n  (7) 

Each  elementary  transformation  corresponds  a  Racah’s  recoupling  transformation, 

(Uij2)ji2j3)J  “*■  (iJi  (7273)723)-/  (8) 

and  contributes  a  6j  symbol  factor  to  the  U{n)  D  t/(ni)  X  i/(«2)  transformation 
coefficient, 


j  5,  5; 

\s{dr)  S{d'r-,)  5,-, 


(9) 


where 


and 


[Sr-^,S'r]  =  {2Sr-l+lK2S'r+  1) 


Sid'r-,) 


Sr-u 


r=ni  +  2 
r¥=  rii  +2 


(10) 


Thus  the  U(n)  D  U(n\)  X  {/(/iz)  transformation  coefficient  is  obtained  by  multi¬ 
plying  (n  -  Ml  -  1)  67  factors  together.  The  transformation  process  from  (3)  to 
(5)  is  shown  in  Figure  3. 
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5 

Figure  3.  Transformation  from  canonical  basis  to  noncanonical  basis. 


The  coefficient  can  also  be  written  into  the  product  of  segments,  i.e., 

n  S{drd'r-,b„„brb’r)  (11) 

r-ni+2 

The  values  of  segment  factors  are  given  in  fable  1. 

Another  example  is  the  transformation  between  Yamanauchi-Kotani  functions 
and  Jahn-Serber  functions.  The  former  coupling  is  represented  into 

(•••((5,52)5253)53*--5^)5  (12) 

while  the  latter 

(•  •  .((5,52)5'2(S354)5'3)5'4-  •  •(S;v-iW;v-,)SV)5  (13) 

Again  we  found  {N  -  2)/2  elementary  transformations  which  will  change  (11) 
into  (12).  The  elementary  transformation  is 


TRANSFORMATION  COEFRCIENTS 


307 


Table  1.  S[d,d‘,;  b„„  b„  b',). 


d,d’r 

S(d,d',-,  b„„  b„  b',) 

0  0 

1  1 

+b,  +  b'r+  2K-*„,  +  b,  + 

1  4{br+\)b', 

2  1 

,  .  +  ^  ~  -  br+  b',)]''^ 

1  2 

.  ...  J(bn.  -K+bfr+  2){b„.  +b.- 
’  [  4(i,+  IXft',  +  2) 

2  2 

.  .  +  b,  +  b',-^-  4K-fc,.  +  bf-i-  b',  +  2)1'^^ 

1  4(fc, +  !)/>; 

3  3 

1.  d„,  =  1 

rf„,  =  2,3 


The  coefficients  derived  above  are  only  two  examples  of  a  big  family  of  trans¬ 
formation  coefficients  between  different  angular  momentum  coupling  schemes. 
The  matrix  elements  of  {/(«)  generators  and  their  products  can  also  be  put  under 
this  family.  It  is  generally  proven  that  the  transformation  coefficient  between  a  pair 
of  coupling  schemes  is  expressible  in  terms  of  sums  over  products  of  6j  symbol 
factors  or  Racah’s  coefficients  [13].  This  conclusion  clearly  reveals  the  close  rela¬ 
tionship  between  transformation  coefficients  and  3nj  symbols.  As  an  explanation 
of  this  fact  we  point  out  that  the  overlap  part  of  a  matrix  element  (d'\e,j,ki  \  d)  = 
(d'\{E,jE„-5j,E.,  )  I  </)  is  exactly  the  first  or  second  kind  of  3nj  symbol.  It  is  easy 
to  see  that  the  exchange-type  and  direct-type  matrix  elements  correspond  to  the 
first  and  second  kind  of  3nj  symbols,  respectively.  Based  on  this  correspondence 
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we  can  prove  the  relation  between  the  two  kinds  of  matrix  elements,  which  is  of 
course  useful  for  improving  the  efficiency  of  the  Cl  program  [15]. 
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Abstract 

Ah  inilto  electronic  structure  calculations,  earned  out  in  conjunction  with  rigorous  analysis  of  the 
resulting  wavefunctions.  demonstrate  that  the  electronic  factors  responsible  for  the  unusual  stability  of 
the  l,3-dimethylimidazol-2-ylidene  carbene  are  directly  related  to  the  substantial  o-backdonation  from 
the  carbenic  carbon  to  the  adjacent  nitrogen  atoms.  The  ir-donation  is  found  to  play  only  a  minor  role, 
ruling  out  the  presence  of  significant  stabilization  due  to  ylidic  resonance  structures  Tbe  unusually  large 
proton  affinity  of  the  carbene  is  explained  by  the  extra  ir-electron  stabilization  of  the  corresponding  1,3- 
dimethylimidazolium  cation  The  MP2/6-3 1 1  +  +G*  *  level  of  theory  is  found  to  be  sufficient  for  accurate 
predictions  of  the  geometries  of  the  imidazol-2-ylidene  earbenes.  P  199.1  John  Wiley  &  Sons.  Inc 

Introduction 

The  accuracy  achieved  by  electronic  structure  calculations  on  1 CH2 ,  the  prototype 
carbene.  is  widely  regarded  as  one  of  the  triumphs  of  modern  quantum  chemistry 
(for  a  recent  review,  see  Ref.  [1]).  For  larger  earbenes,  such  calculations  are  too 
expensive  to  be  practical,  as  it  takes  over  700,000  configurations  in  the  Cl  procedure 
and  very  large  basis  sets  to  compute  the  singlet-triplet  splitting  m  ICH2  within 
0.1  [kcal/mol]  [2],  In  light  of  this  fact,  the  necessity  of  understanding  the  general 
factors  influencing  stability  of  earbenes  becomes  urgent. 

Following  several  unsuccessful  attempts  by  Wanzlick  and  Schonherr  [3],  very 
recently  Arduengo  and  coworkers  have  succeeded  in  isolating  several  stable  earbenes 
for  the  first  time  [4,5].  These  compounds,  many  of  which  have  been  crystallized 
and  have  had  their  structures  analyzed  by  means  of  X-ray  diflfraction,  are  alkyl  and 
aryl  derivatives  of  imidazol-2-ylidene.  Despite  the  obvious  importance  of  this  dis¬ 
covery,  very  little  effort  has  been  directed  by  theoretical  chemists  toward  explaining 
the  unusual  stability  of  the  imidazol-2-ylidene  earbenes,  with  the  early  low-grade 
EHT  calculations  [6]  and  the  recent  ab  initio  study  [7]  being  the  only  relevant 
electronic  structure  investigations  available  in  the  chemical  literature. 

Several  researchers  extended  conflicting  arguments  concerning  the  mechanisms 
responsible  for  the  wide  variation  in  the  singlet-triplet  energy  gaps  in  earbenes  ( for 
a  recent  review  and  an  attempt  to  reconcile  the  different  theories,  see  Ref.  [8]). 
Regrettably,  most  of  the  previously  published  analyses  rely  heavily  on  nonrigorous 
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interpretive  tools,  such  as  the  notoriously  unreliable  Mulliken  atomic  charges.  Al¬ 
t'-  jugh  the  arguments  arising  from  application  of  such  tools  are  often  qualitatively 
correct  (note,  however,  some  recent  examples  to  the  contrary  [9,10]),  reliable  and 
theoretically  consistent  interpretation  of  electronic  wavefunctions  can  only  be 
achieved  with  techniques  that  do  not  invoke  the  notion  of  basis  functions  as  “atomic 
orbitals.”  Building  upon  the  topological  theory  of  atoms  in  molecules  [  1 1  ] ,  we  have 
recently  proposed  bias-free  definitions  of  localized  natural  orbitals  [12],  covalent 
bond  orders  [13],  measures  of  steric  crowding  [14],  and  electronegativities  of  atoms 
and  functional  groups  in  molecules  [15].  In  this  paper,  we  employ  these  rigorous 
interpretive  tools  in  order  to  pinpoint  the  origins  of  the  unusual  stability  of  1,3- 
dimethylimidazol-2-ylidene  (Fig.  1 ),  which  is  the  simplest  stable  carbene  reported 


Figure  2.  The  molecular  skeleton  of  C5H9N2. 
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thus  far  [  5  ] .  We  also  report  the  results  of  a  similar  analysis  for  its  parent  carbenium 
ion,  the  1 ,3-dimethylimidazolium  cation  (Fig.  2). 

Computational  Techniques 

The  geometries  of  the  systems  under  study  were  optimized  at  both  the  HF/6- 
31 1++G**  and  MP2/6-31 1++G**  levels  using  the  GAUSSIAN  90  suite  of  pro¬ 
grams  [16].  The  minima  on  the  respective  potential  energy  hypersurfaces  were 
characterized  with  the  HF/6-31 1+-I-G**  vibrational  frequencies.  The  total  and 
relative  energies  obtained  from  these  calculations  are  listed  in  Table  I. 

The  one-electron  density  matrices  were  computed  at  both  the  HF  and  mp2  [17] 
levels.  The  atomic  overlap  matrices  ( AOMs)  [18]  were  calculated  with  the  VECAIM 
program  [19  j.  The  LOSSES  program  [20]  was  employed  in  computations  of  the 
localized  natural  orbitals  [121  and  covalent  bond  orders  [13]. 

Results  and  Discussion 

In  the  singlet  |  CHj  with  a  linear  geometry,  the  degeneracy  of  Slater  determinants 
that  result  from  occupying  the  perpendicular  tt  orbitals  necessitates  invoking  a  two- 
configurational  wavefunction  as  the  correct  zeroth-order  description.  However,  de¬ 
creasing  the  H— C— H  angle  to  its  equilibrium  value  of  ca.  105°  significantly  lessens 
the  severity  of  the  degeneracy,  making  the  singly-configurational  description  rea¬ 
sonable  [21].  Therefore,  despite  relatively  large  errors  in  the  calculated  singlet- 
triplet  splittings,  one  expects  the  mp2  level  of  theory  to  yield  geometries  and  electron 
densities  accurate  enough  for  a  reliable  analysis  of  the  electronic  structures  of  singlet 
carbenes.  The  results  of  calculations  on  the  C5H8N2  carbene  (Fig.  1,  Table  II) 
confirm  these  expectations.  When  compared  with  the  experimental  values  for  the 
1,3,4,5-tetramethyl  derivative,  the  calculated  bond  lengths  and  bond  angles  are 
found  to  be  as  accurate  as  the  corresponding  quantities  obtained  with  the  two- 
configurational  SCF  [  7  ] .  The  analogous  comparison  for  the  geometrical  parameters 
of  the  C5H9N2  cation  (Fig.  2,  Table  III)  is  hindered  by  the  unavailability  of  suf¬ 
ficiently  accurate  experimental  data. 

The  C— N  bonds  in  both  the  carbene  and  the  carbenium  cation  are  strongly 
polarized,  as  reflected  by  the  calculated  atomic  charges  (Tables  IV  and  V).  As 


Table  I.  The  total  and  relative  HF/6-31 1++G**  and 
MP2/6-3 1 1 ++G**  eneigies  of  C5HJN2  and  CjHjNJ . 


A£mf: 

System 

£hf  (a.u.l 

Ewn  (s-u.) 

[kcal/mol]' 

(kcal/molj* 

QH.Nj 

-302.915711 

-304.108346 

00(0.0) 

0.0  (0.0) 

-303  346434 

-304.537389 

-270  1  (-260.8) 

-269.2  (-259.9) 

"  Values  corrected  for  the  zero-point  energies  (at  298  [K]  with  vibrational 
frequencies  calculated  at  the  HF/6-31 1++G**  level)  in  parentheses. 
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Table  II  The  calculated  vs.  expenmental  bond  lengths  [A]  and  bond  angles  (°]  in  CjHjNj. 


Bond  length  or 
angle* 

Calculated*’ 

Expenmental 

C 

d 

C.Nz 

1.371  (1.349)  [1.352] 

1.363 

1.370 

C4N: 

1.380  (1.384)  [1.307] 

1.394 

1.384 

C4C5 

1.370(1  334)  [1.350] 

1.352 

1.338 

QNj 

1.453  (1.446)  [n/a] 

1.454 

1.484 

C4H, 

1.080(1.069)  [n/a] 

n/a 

0.964 

C,H,„ 

1.089  (1.079)  [n/a] 

n/a 

n/a 

C,H„ 

1.093  (1.085)  [n/a 

n/a 

n/a 

NjC.Nj 

101.5  (103.0)  [101.9] 

101.5 

102.2 

C,N2C4 

113.4  (112.2)  [113.5] 

113.5 

112.2 

NjC.C, 

105  8  (106.3)  [105.6] 

105.8 

106.7 

C.N-Q 

122.6(123.7)  [n/a] 

122.9 

122.8 

N2C4H4 

123.4  (123.!)  [n/a] 

n/a 

123.5 

N2C8H1O 

107.2  (108.0)  [n/a] 

n/a 

n/a 

NjQH,, 

110.4  (110  8)  [n/a] 

n/a 

n/a 

*  See  Figure  I  for  atom  numbering. 

'’MP2/6-3I I++G**  values  followed  by  the  HF/6-31 1++G**  data  in  parentheses  and  the  TCSCF/ 
DZP  data  from  Ref.  [7]  in  square  brackets 
'Expenmental  values  for  the  l,3,4,5-tetramethylimidazol-2-ylidene  (5). 

Expenmental  values  for  the  1.3-di-l-adamantylimidazol-2-ylidene  [4.  5] 


Table  III.  The  calculated  vs.  expenmental  bond  lengths  [A]  and  bond  angles  [°]  m  C5H9N2. 


Bond  length  or  angle* 

Calculated*’ 

Expenmental' 

CjH, 

1.080(1.070)  [n/a] 

n/a 

C2N3 

1.342  (1.314)  [1,316] 

1.330 

QNj 

1.374  (1.378)  [1  384] 

1.380 

CjCs 

1.374  (1.341)  [1.345] 

n/a 

C,N3 

1.469(1.467)  [n/a] 

n/a 

C5H7 

1.079(1.068)  [n/a] 

n/a 

C,H„ 

1.089  (1.080)  [n/a] 

n/a 

C,H,2 

1.090(1.080)  [n/a] 

n/a 

NjC-N4 

108.4  (109.8)  [108.2] 

109.7 

C2N3C5 

108.8  (108.0)  [109.4] 

n/a 

N  " 

107.0  (107.1)  [106.5] 

n/a 

C-" 

125.5(126.3)  [n/a] 

n/a 

N 

122.2(122.2)  [n/a] 

n/a 

NjC9ii  1 1 

108.5(108.8)  [n/a] 

n/a 

N3C9H12 

109.1  (109.3)  [n/a] 

n/a 

*  See  Figure  2  for  atom  numbering. 

’’ MP2/6-31 1++G**  values  followed  by  the  HF/6-31 1++G**  data  in  parentheses  and  the  HF/DZP 
data  from  Ref.  [7]  in  square  brackets. 

'  Expenmental  values  for  the  (1,3-di-l-adamantylimidazolium)*'  BICeHj);  [4]. 


INTERPRETATION  OF  ELECTRONIC  WAVEFUNCTIONS 


313 


Table  IV.  The  HF/6-31 1++G**  and  MP2/6-31 1++G**  Bader 
atomic  charges  and  AOM-based  covaient  bond  orders  in  C3HSN2. 


Atom* 

Ofluler*’ 

Bond 

p 

*  cov 

c, 

0.707  (1.038) 

C,-N2 

1.108(1  034) 

N2 

-1.254  (-1.620) 

C4-N2 

1.012(0.981) 

0 

0.366  (0.496) 

C4— Cs 

1.609(1.702) 

He 

0.033  (0.067) 

C4  — He 

0.949  (0.984) 

c, 

0.363  (0.598) 

Cs-Nj 

0.886(0.871) 

H,o 

0.079  (0  029) 

Q-H.o 

0.916  (0.943) 

H„ 

0.022  (-0.027) 

Q-H„ 

0.936  (0.959) 

*  See  Figure  1  for  atom  numbering. 

MP2/6-3 1 1  ++G»*  values  followed  by  the  HF/6-3 1 1 ++G**  data 
in  parentheses. 


expected  [22],  the  ionicity  of  bonds  is  grossly  overestimated  at  the  HF  level.  For 
this  reason,  all  the  subsequent  discussion  is  limited  here  to  atomic  and  bond  prop¬ 
erties  derived  from  the  Mp2  density  matrices.  The  covalent  bond  orders  indicate 
the  presence  of  single  bonds,  except  for  those  linking  two  carbon  atoms  in  the 
imidazole  rings,  which  have  bond  orders  close  to  2. 

In  accordance  with  the  predominantly  singly-configurational  character  of  both 
C5HfiN2  and  C5H9N2 ,  the  occupation  numbers  of  the  strongly  occupied  localized 
natural  orbitals  (LNOs)  do  not  deviate  significantly  from  the  limiting  value  of  2 
(Tables  VI  and  VII).  The  LNOs  make  it  possible  to  analyze  the  electronic  structures 
of  the  species  under  study  on  an  orbital-by-orbital  basis.  Discarding  those  describing 
the  core  electrons,  there  are  27  strongly  occupied  localized  natural  orbitals  in  the 
l,3-dimethylimidazoI-2-ylidene  carbene.  The  presence  of  C2,  symmetry  reduces 
this  number  to  only  10  distinct  LNOs  (Table  VI,  Fig.  3).  Taking  into  account  the 


TableV  The  HF/6-31  1++G**  and  MP2/6-3I I++G**  Baderatomic 
charges  and  AOM-based  covalent  bond  orders  in  QH^NJ. 


Atom* 

Bond 

P  ** 

•  cov 

H, 

0.160  (0.144) 

C2-H. 

0.886  (0.904) 

C2 

0995(1.401) 

C2-N3 

1.180(1.117) 

Nj 

-1.226  (-1.580) 

C,  — N, 

0.985  (0.955) 

Cj 

0.418  (0.527) 

Cj-Ce 

1.522(1.626) 

H, 

0  147  (0.121) 

Cj-H, 

0.920  (0.955) 

c. 

0.323  (0.508) 

C,--N, 

0.848  (0  829) 

H„ 

0.081  (0.042) 

C9  Hu 

0.919(0.947) 

H,2 

0.091  (0.055) 

c,-h,2 

0.919(0.948) 

*  See  Figure  2  for  atom  numbering 

"  MP2/6-31 1-E-EG**  values  followed  by  the  HF/6-31 1 ++G**  data  in  paren¬ 
theses. 
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Table  VI.  The  valence  MP2/6-3 1 1 ++G**  strongly  occupied  localized  natural  orbitals  in  CjHjNj. 


Orbital*  Occupancy  Description 


8(9) 

1.963 

10(11) 

1.960 

12(13) 

1.959 

14 

1.965 

15(16) 

1.941 

17 

1.964 

18(19) 

1.971 

20(21) 

1.971 

22 

1.930 

23  (24.  25.  26.  27) 

1.971 

61.0%  ionic  C  —  N  a  bond.  96.8%  localized 
46.9%  ionic  C — N  it  bond.  96.5%  localized 
46.6%  ionic  C  —  N  a  bond.  96.4%  localized 
Purely  covalent  C — C  a  bond.  95.4%  localized 
Tricentrie  C  —  N — C  x  bond.  93.9%  localized 
a  lone  pair.  90  4%  localized  on  C 
1 1  3%  ionic  C  —  Ha  bond.  97.4%  localized 
8.8%  ionic  C  —  Ha  bond.  95.6%  localized 
Purely  covalent  C — C  ir  bond.  85.2%  localized 
4.2%  ionic  C  —  H  ff  bond.  96. 1  %  localized 


*  LNOs  ordered  according  to  increasing  negative  kinetic  energies  (see  Fig.  3)  with  orbitals  related  by 
symmetry  given  in  parentheses. 


local  symmetries,  eight  of  these  orbitals  are  of  the  a  type.  With  the  exception  of 
the  ff  lone  pair,  the  a  LNOs  are  strongly  localized,  with  the  localization  numbers 
(calculated  as  the  sums  of  the  pertinent  atomic  occupancies)  in  excess  of  95%.  The 
imidazole  ring  is  described  by  the  lnos  8, 10, 14,  and  their  symmetry-related  coun¬ 
terparts  ( Fig.  3).  The  bonds  between  the  formally  divalent  (carbenic)  carbon  and 
the  neighboring  nitrogens  ( lnos  8  and  9)  are  particularly  ionic,  resulting  in  a  large 
positive  charge  on  the  carbon  atom  (Table  IV).  The  ionicities  of  the  remaining 
C — N  bonds  (LNOS  10  and  1 1  for  the  intraring  C — N  bonds,  and  LNOS  12  and  13 
for  the  ring-CHi  C — N  bonds)  are  markedly  smaller.  The  C — H  bonds  of  the  bonds 
CH  ( LNOS  1 8  and  1 9 )  and  CHy  groups  ( LNOs  20. 2 1 ,  23-27 )  are  almost  completely 
covalent.  Finally,  the  C — C  bond  of  the  imidazole  ring  ( LNO  1 4 )  is  purely  covalent 
due  to  the  underlying  molecular  symmetry. 


Table  VII.  The  valence  MP2/6-31 1 ++G**  strongly  occupied  localized  natural  orbitals  in  C5H9N2. 


Orbital* 

Occupancy 

Description 

8(9) 

1.962 

51.7%  ionic  C — N  <r  bond.  96.5%  localized 

10(11) 

1.961 

52.1%  ionic  C  —  N  a  bond.  97.0%  localized 

12(1.3) 

1.961 

53.4%  ionic  C  —  N  ir  bond.  97.1%  localized 

14 

1.965 

Purely  covalent  C — Co  bond.  95.0%  localized 

15(16) 

1.931 

Tncentnc  C — N — C  ir  bond.  94.8%  localized 

17 

1.972 

18.8%  ionic  C —  Ha  bond.  96  3%  localized 

18(19) 

1.972 

17.9%  ionic  C—  H  a  bond.  91.2%  localized 

20 

1.920 

Purely  covalent  C — C  ir  bond.  80.7%  localized 

21  (22,  23.  24) 

1.971 

10.1%  ionic  C  —  H  a  bond.  95  9%  localized 

25 (26) 

1.971 

9.1%  ionic  C  —  H  a  bond.  95.7%  localized 

*  I  NOS  ordered  according  to  increasing  negative  kinetic  energies  (sec  Fig.  4)  with  orbitals  related  by 
symmetry  given  in  parentheses. 
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®18  ^  20 

22  23 

Figure  3.  Atomic  occupancies  ( represented  by  areas  of  the  shaded  circles )  of  the  strongly 
occupied  localized  natural  orbitals  (lnos)  of  the  CsHjNj  molecule. 


The  system  of  the  rr-type  orbitals  consists  of  just  two  distinct  lnos.  The  lno 
number  14,  which  pertains  to  the  intraring  C — C  x-bond,  is  partially  delocalized 
over  the  neighboring  nitrogen  atoms.  The  other  symmetry-related  pair  of  the  x- 
type  LNOS  (LNOS  15  and  16)  describes  tricentric  C — N — C  bonds.  Despite  the 
delocalized  character  of  these  lnos,  their  occupancies  on  the  carbenic  carbon  atom 
are  rather  small,  amounting  to  only  12%.  This  implies  only  a  marginal  x-donation 
from  the  nitrogens  to  the  carbon  atom,  especially  when  compared  with  the  massive 
ff-backdonation.  This  <7-backdonation  also  plays  some  role  for  the  carbon  lone  pair 
(LNO  17),  which  is  only  90.4%  localized  (Table  VI,  Fig.  3). 

Protonation  of  C5HgN2  is  a  strongly  exothermic  process.  In  agreement  with  the 
previous  calculations  [7],  we  find  the  imidazol-2-ylidene  system  to  be  a  much 
stronger  base  than  other  carbenes.  We  attribute  this  increased  basicity  to  the  extra 
x-electron  stabilization  enjoyed  by  the  corresponding  carbenium  cation,  for  which 
'here  is  the  following  evidence:  First,  the  excess  electric  charge  of  the  proton  is 
iciently  distributed  over  the  entire  imidazol  moiety  (compare  the  data  in  Tables 
iV  and  V ).  Therefore,  the  increase  (by  0.288 )  in  the  atomic  charge  of  the  carbonic 
carbon  is  much  smaller  than  what  one  would  expect  from  the  loss  of  the  lone  pair 
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electrons  that  are  now  shared  with  the  hydrogen  atom.  The  protonation  almost 
completely  equalizes  the  ionicities  of  the  three  distinct  C — N  bonds  (LNOs  8,  10, 
and  12;  Table  VII),  which  now  span  a  narrow  range  of  51.7-53.4%.  The  newly 
formed  C — H  bond  (described  by  LNO  17)  is  very  similar  in  its  very  moderate 
ionicity  to  the  other  C — H  bonds  (LNOs  18  and  19)  of  the  imidazole  ring. 

Second,  although  the  a-type  LNOs  (including  those  describing  the  C — H  bonds 
of  the  CH3  groups  and  the  C — C  bond  of  the  imidazole  ring)  remain  strongly 
localized  in  the  carbenium  cation,  localization  of  the  xr-type  orbitals  decreases 
markedly  upon  protonation.  In  particular,  the  localization  index  of  the  LNO  per¬ 
taining  to  the  C — C  TT-bond  falls  by  ca.  5%,  mostly  due  to  the  increased  delocalization 
over  the  formerly  carbenic  carbon  atom  (Fig.  4).  The  same  carbon  atom  also 
enjoys  an  increased  vr-donation  from  the  neighboring  nitrogens,  due  to  changes  in 
atomic  occupancies  of  the  tricentric  LNOs  15  and  16  (compare  Figs.  3  and  4). 

When  comparing  two  chemical  systems,  the  increase  (decrease)  in  the  strength 
of  an  analogous  bond,  as  evidenced  by  its  shortening  (lengthening),  can  be  attributed 
to  either  increased  (decreased)  covalent  interactions  or  increased  (decreased)  bond 
ionicity.  Which  mechanism  is  the  major  driving  force  in  a  given  case  can  be  easily 

^  12  '  ^14  ' 

21  25 


Figure  4.  Atomic  occupancies  ( represented  by  areas  of  the  shaded  circles)  of  the  strongly 
occupied  localized  natural  orbitals  (lnos)  of  the  CsH9Nj  cation. 
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decided  by  examining  the  changes  in  the  relevant  covalent  bond  orders.  The  decrease 
(increase)  in  the  bond  strength  accompanied  by  the  decrease  (increase)  in  the 
covalent  bond  order  indicates  the  predominant  decrease  (increase)  in  the  covalent 
interactions,  which  can  often  be  explained  by  delocalization  effects.  On  the  other 
hand,  the  decrease  (increase)  in  the  bond  strength  in  conjunction  with  the  increai-ed 
(decreased)  covalent  bond  order  points  to  the  predominant  decrease  (increase)  in 
the  bond  ionicity. 

Applying  the  aforedescribed  criterion,  one  finds  that  the  significant  shortening 
upon  protonation  of  the  bonds  linking  the  carbenic  carbon  with  the  neighboring 
nitrogens  is  mostly  due  to  increased  conjugation  in  the  carbenium  cation.  The 
accompanying  slight  shortening  of  the  two  other  C — N  bonds  of  the  imidazole  ring 
can  be  attributed  to  the  increased  bond  ionicity.  The  C — C  bond  lengthens  somewhat 
upon  protonation.  This,  together  with  the  corresponding  drop  in  the  covalent  bond 
order,  indicates  decreased  covalent  interactions  as  the  result  of  the  increased  tt- 
electron  delocalization  (see  above). 

What  are  the  origins  of  the  unusual  stability  of  the  l,3-dimethylimidazol-2-ylidene 
carbene?  The  factors  related  to  the  substituents  on  the  imidazole  ring  have  to  be 
spelled  out  first.  Among  them,  the  absence  of  hydrogen  atoms  in  the  1,3-  positions 
is  obviously  important,  as  it  prevents  rapid  isomerization  to  imidazole  through  a 

1.2- proton  shift.  Blocking  of  the  4,5-  positions  appears  to  be  less  critical,  although 

1.3- dimethyl-imidazol-2-ylidene  is  much  less  stable  than  its  1 ,3,4.5-tetramethyl  an¬ 
alog  [4]. 

The  electronic  factors  are  directly  traced  to  the  large  singlet-triplet  energy  gap  of 
C5H8N2  [7].  In  principle,  two  alternative  explanations  of  this  large  splitting  are 
possible  [  8  ] .  First,  in  line  with  the  original  ideas  of  Wanzlick  [  3  ] ,  one  would  expect 
the  TT-donation  to  stabilize  the  singlet  state  through  bonding  with  the  vacant  tt 
orbital  of  the  carbenic  carbon  [23].  Another  possibility  is  that  the  (x-backdonation 
favors  the  singlet  state  by  increasing  the  positive  charge  on  the  carbon  atom  [24]. 
In  light  of  the  analysis  described  above,  the  first  explanation  does  not  appear  credible, 
taking  into  account  the  small  degree  of  xr-donation  present  in  the  carbene  under 
study.  On  the  other  hand,  the  second  explanation  is  fully  compatible  with  the  large 
<T-donation  due  to  the  large  electronegativity  difference  between  the  carbenic  carbon 
and  the  adjacent  nitrogens. 

It  is  proper  at  this  point  to  comment  on  the  relative  importance  of  the  two 
aforementioned  mechanisms  to  the  energetics  of  the  singlet-triplet  gap.  In  principle, 
it  is  possible  (although  not  very  likely)  for  the  xr-donation  to  play,  in  spite  of  its 
small  relative  magnitude,  the  dominant  role  in  determining  the  singlet-triplet  split¬ 
ting.  The  only  way  to  entirely  rule  out  this  possibility  is  to  correlate  the  calculated 
splittings  with  the  extents  of  xr-donation  and  a-backdonation.  Such  correlation, 
which  has  been  recently  attempted  [8],  is  not  without  serious  dangers  of  misinter¬ 
pretation.  First  of  all,  the  use  of  the  Mulliken  atomic  charges,  with  their  well-known 
deficiencies  (see,  for  example.  Ref.  [25]),  may  easily  result  in  misleading  conclu¬ 
sions.  This  is  well  illustrated  by  the  previous  calculations  [  7  ]  which,  on  the  basis 
of  the  Mulliken  population  analysis,  found  the  stabilization  of  the  singlet  state  of 
imidazol-2-ylidene  due  to  the  xr-donation  relatively  insignificant.  At  the  same  lime. 
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the  Mulliken  charge  on  the  carbenic  carbon  atom  was  found  to  be  very  close  to 
zero,  ruling  out  stabilization  due  to  the  a-backdonation  and  leaving  the  unusual 
stability  of  the  imidazol-2-ylidene  carbene  comp’;tely  unexplained. 

Second,  even  when  rigorously  derived  atomic  charges  are  correlated  with  the 
singlet-triplet  energy  gaps,  one  may  expect  the  proportionality  factor  to  be  signif¬ 
icantly  different  from  that  for  an  isolated  carbon  atom.  This  is  so  because  the 
potentials  around  the  carbenic  carbon  atoms  are  no  longer  spherically  symmetrical. 
This  means  that  the  singlet-triplet  splitting  is  expected  to  depend  not  only  on  the 
charge  on  the  pertinent  carbon  atom,  but  also  on  its  higher  multipole  moments, 
which  often  have  quite  large  magnitudes.  For  example,  our  calculations  show  that 
the  carbenic  carbon  atom  in  the  l,3-dimethylimidazol-2-ylidene  molecule  bears 
the  dipole  moment  of  ca.  1 .88  [a.u.] ,  which  corresponds  to  the  mean  displacement 
of  the  atomic  charge  density  equal  to  about  0.15  [A].  Therefore,  any  empirical 
correlation  between  the  splitting  and  the  atomic  charge  must  include  in  an  effective 
manner  variability  of  these  moments  with  the  atomic  charge,  thus  altering  the 
proportionality  factor. 

Finally,  we  would  like  to  bring  the  reader’s  attention  to  a  particular  feature  of 
the  calculated  I,3-dimethylimidazol-2-ylidene  geometry  for  which  we  do  not  have 
a  simple  explanation.  As  revealed  by  the  data  listed  in  Table  II.  the  bond  angle 
NiCsHio  (Fig.  1.  Table  II)  between  the  nitrogen  atom,  the  carbon  of  the  methyl 
group,  and  the  hydrogen  facing  the  lone  pair  of  the  carbenic  carbon  is  much  smaller 
(by  ca.  3°)  than  the  analogous  angles  (NjQHn  and  NjCrHh)  for  the  other  hy¬ 
drogens.  The  difference  in  angles  disappears  almost  completely  upon  protonation. 
We  speculate  that  the  tilting  of  the  C — H  bond  toward  the  carbon  lone  pair  is 
caused  by  attractive  interactions  between  the  C — H  dipole  and  strongly  dipolar 
electron  distribution  around  the  carbenic  carbon. 

Conclusions 

One  can  expect  the  geometries  of  imidazol-2-ylidene  carbenes  and  the  corre¬ 
sponding  imidazolium  cations  to  be  accurately  predicted  at  the  MP2/6-31 1++G** 
level  of  theory.  Rigorous  analysis  of  the  one-electron  density  matrices  calculated 
within  the  same  approximation  affords  a  clear-cut  description  of  the  electronic 
structures  of  these  systems  in  terms  of  localized  natural  orbitals.  Taking  into  account 
the  results  of  such  an  orbital-b> -orbital  analysis,  the  unusual  stability  of  the  imidazol- 
2-ylidene  carbenes  can  be  directly  related  to  the  substantial  ir-backdonation  from 
the  carbenic  carbon  to  the  adjacent  nitrogen  atoms,  with  the  resulting  large  positive 
charge  on  the  carbon  atom  strongly  favoring  the  singlet  state  over  the  triplet  one. 
The  TT-donation  is  expected  to  be  marginal  and  therefore  to  play  only  a  minor  role, 
ruling  out  the  presence  of  significant  stabilization  due  to  >lidic  resonance  structures. 
The  unusually  large  proton  affinity  of  the  imidazol-2-ylidene  carbenes  can  be  readily 
explained  by  the  extra  ir-electron  stabilization  of  the  corresponding  imidazolium 
cations. 
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Abstract 

Techniques  for  computing  the  multicenter  integrals  required  for  molecular  calculations  for  orbitals 
for  which  the  radial  factors  are  given  numerically  are  described.  The  methods  make  extensive  use  of  a 
numencal  algonthm  for  computing  sphencal  Bessel  transforms.  The  feasibility  of  using  these  methods 
is  demonstrated  by  applying  them  to  SCF  calculations  for  the  methane  molecule.  iS-  1993  John  Wiley  & 
Sons,  Inc 


Introduction 

In  atomic  structure  calculations,  both  numerical  or  finite  difference  approximation 
and  finite  basis  expansion  methods  have  proved  to  be  of  great  value  and  have  lead 
to  successful  theoretical  calculations.  The  former  approach  has  been  pioneered  in 
large  part  by  Froese  Fischer  [1],  and  the  application  of  the  latter  in  atomic  physics 
is  associated  to  considerable  extent  with  Dementi  and  Roetti  [  2  ] .  The  situation  is, 
of  course,  considerably  different  in  molecular  physics,  where,  apart  from  some 
calculations  for  diatomic  molecules,  essentially  all  calculations  employ  finite  basis 
expansion  methods,  as  pioneered  by  Roothaan.  Moreover,  practically  all  finite  basis 
expansion  methods  employ  expansions  in  Gaussian  type  orbitals  (GTOs).  The  reason 
for  this  is  obvious;  the  calculation  of  the  necessary  matrix  elements  is  almost  trivial 
for  GTOs  and  is  practically  intractible  for  other  functional  forms. 

Despite  the  fact  that  GTO  expansion  methods  have  been  widely  successful  in 
describing  molecular  properties,  there  still  seems  to  be  interest  in  developing  alter¬ 
native  methods  for  approximating  molecular  wave  functions.  This  stems  in  part 
from  the  fact  that  GTOs  are  not  particularly  suited  to  describing  the  behavior  of  the 
wave  function  close  to  the  nuclei  and  at  large  distances.  This  problem  is  overcome 
in  part  by  superposing  several  GTOs  to  give  a  more  realistic  orbital  (the  method  of 
contraction),  but  this  sacrifices  part  of  the  virtue  of  the  simplicity  of  the  matrix 
element  calculation.  It  is  also  a  difficult  computational  challenge  to  optimize  the 
nonlinear  parameters  in  the  GTOs,  and  this  is  usually  done  only  partially,  using  so- 
called  even-tempered  nonlinear  parameters.  The  interest  in  using  basis  functions 
other  than  GTOs  has  led  to  considerable  effort  to  develop  techniques  for  computing 
matrix  elements  for  other  "-'rms  of  basis  functions,  namely.  Slater  orbitals  (STOs) 
or  more  general  forms  of  exponential  basis  functions  (etos).  Techniques  for  such 
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orbitals  in  the  two-center  case  have  been  developed  some  30  years  ago  by  Barnett 
and  Coulson  [3]  and  Geller  [4].,  A  variety  of  approaches  have  been  reviewed  in 
1981  at  a  conference  devoted  to  these  problems  [5]. 

As  commented  above,  relatively  little  work  has  been  done  on  applying  purely 
numerical  techniques  in  molecular  calculations,  although  some  aspects  of  this  have 
been  reviewed  at  a  conference  in  1988  [6]. 

Since  numerically  determined  wave  functions  offer  more  generality  than  wave 
functions  constructed  as  linear  combinations  of  analytic  basis  functions,  it  remains 
of  interest  to  try  to  incorporate  such  functions  in  molecular  calculations.  In  this 
article  some  progress  in  this  direction  will  be  described  and  results  of  applying  the 
methods  to  the  CH4  molecule  are  reported. 

The  calculations  that  have  been  carried  out  are  SCF  calculations  using  molecular 
orbitals  that  are  linear  combinations  of  atomic  orbitals  that  are  obtained  numerically 
as  atomic  orbitals.  It  would  appear  that  the  use  of  such  orbitals  should  be  optimal, 
in  that  all  the  energy  to  be  obtained  by  orbital  optimization  for  the  isolated  atomic 
systems  is  obtained,  and  resulting  energy  differences  should  represent  chemical 
bonding  energies.  In  other  words,  errors  from  basis  set  incompleteness  should  be 
very  small  in  this  approach.  It  could  also  be  hoped  that,  by  using  more  realistic 
orbitals,  the  required  number  of  orbitals  to  obtain  satisfactory  results  might  be 
substantially  reduced.  The  extra  computational  complexity  of  using  numerical  or¬ 
bitals  could  then  possibly  be  compensated  by  a  reduction  in  the  number  required. 

The  numerical  methods  used  are  based  on  an  effectiye  method  for  computing 
spherical  Bessel  transforms  numerically  [7,8].  Since  these  transforms  are  essentially 
the  Fourier  transforms  of  angular  momentum  eigenfunctions,  and  translation  in 
position  space  corresponds  to  multiplication  by  a  complex  exponential  in  momen¬ 
tum  space,  this  facilitates  the  calculation  of  multicenter  integrals  for  numerically 
determined  orbitals.  This  transform  method  will  be  reyiewed  in  the  next  section, 
and  its  application  to  overlap  integrals,  nuclear  attraction  integrals,  and  two-electron 
integrals  will  be  described  in  the  following  sections.  The  results  of  the  application 
of  the  method  to  CH4  will  be  given,  and  the  limitations  and  outlook  for  other 
applications  will  be  discussed. 

Momentum  space  methods  for  STOs  have  been  studied  extensively  by  Silverstone 
and  coworkers;  these  methods  have  been  reviewed  by  Todd  et  al.  [9].  Similarly, 
techniques  using  so-called  B  functions  which  are  equivalent  to  STOs  have  been 
extensively  studied  by  Steinbom  and  coworkers.  This  work  has  been  recently  re¬ 
viewed  by  Steinbom  et  al.  [10]. 

Numerical  Spherical  Bessel  Transforms 

The  basis  functions  to  be  considered  are  taken  to  be  angular  momentum  functions 
of  the  form 

fin,{r)=fAr)Y,„{r),  (1) 

where  T/mlf )  is  the  usual  spherical  harmonic.  The  Fourier  transform  of  /is  defined 
to  be 


y;™(k)  =  J  e"^"f,„(r)dt. 


(2) 
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The  familiar  expansion  of  a  plane  wave  in  spherical  harmonics 

c'*'-'  =  47r  2  i‘Mkr)Y,„{k)Y,„(i)*  (3) 

can  be  readily  applied  to  show  that 

f,„{k)  =  4H'Mk)YM,  (4) 

whe;  • 

f,{k)  =  r Mkr)f,{r)r^  dr.  (5) 

Jo 

If  new  variables  /c  =  In  /c  and  p  =  In  r  are  introduced,  this  equation  can  be  written 
as 


Me‘)=  r  dp 

J-CC 

J-OO 

The  integral  in  the  right-hand  side  of  Eq.  (6)  can  be  recognized  as  a  convolution 
of  a  function  of  p  and  function  of  p  +  #c.  It  is  therefore  possible  to  compute  it  in 
terms  of  the  inverse  Fourier  transform  of  the  product  of  the  Fourier  transform  of 
each  factor;  this  gives  the  convolution  as  a  function  of — x. 

In  the  calculations  described  here,  the  Fourier  transform  of  is  com¬ 

puted  numerically  using  the  fast  Fourier  transform  method,  and  the  transform  of 
is  obtained  analytically.  The  inverse  transform  of  the  product  is  again 
computed  using  the  fast  Fourier  transform.  Complete  details  of  the  numerical 
method  are  given  in  Ref  [  8  ] ., 

The  expansion  of  an  angular  momentum  function  centered  at  one  point  in  terms 
of  angular  momentum  functions  centered  at  another  point  can  easily  be  effected 
using  Fourier  transforms.  We  write 


fimir  -  a) 


^  J  dk. 


(7) 


Expanding  each  factor  in  the  last  expression  in  spherical  harmonics,  using  Eqs.  ( 3 ) 
and  (4)  gives 


fim{r  -  a) 


2 

LL'MM' 


L  L' 
-M  -M' 


yLM{^)YL'MimLu(r,a), 


(8) 
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where 


V  47r  \0  0  0/ 

X  r  jdka)jLikr)f,(k)k^  dk.  (9) 

vO 

The  functionsyi,t'(r,  a),  which  are  the  Lowdin  alpha-functions  in  the  case  of  STOs 
[11],  can  therefore  be  tabulated  on  the  logarithmic  mesh  by  computing^! it),  mul¬ 
tiplying  hyjL(.ka)  and  computing  the  L’  spherical  Bessel  transform  of  the  product. 

The  overlap  integral  of  two  angular  momentum  functions  can  be  computed  in 
momentum  space  [12]  as 


Ji'mW*gim{r)dkdr 


j  e'^-^’-Vrn,dk)*gi.„(r)dkdr 
=  e-''^'*frnAk)*g',,„ik)dk 

^  8  2  //--/'-/  ^  /(2/+1)(2/'+1)(2L+1) 

L\i  \  47r 

0  o)(,l  -1/ 


The  sum  on  L  is  in  this  case  finite.  Matrix  elements  of  the  kinetic  energy  operator 
are  readily  obtained  by  replacing  k^  by  k*  in  the  integrand. 


Nuclear  Attraction  Integrals 

The  calculation  of  the  nuclear  attraction  energy  requires  the  evaluation  of  integrals 
of  the  form 

f  =  J  f'mir  -  a')*  -  gi„{r  -  a)  dr.  (11) 

These  have  been  computed  by  translating  each  factor  to  a  common  origin  which 
is  chosen  to  optimize  the  convergence  rate  of  the  infinite  sums  [13].  The  Coulomb 
factor  is  expanded  using  the  usual  expansion 

17^  =  ?  5xTT  1 


(12) 
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The  maximum  value  of  L'  in  Eq.  (8)  and  of  X  in  Eq.  (12)  will  be  denoted 

by 

The  angular  factor  in  the  integral  is  expressible  in  terms  of  3  -  y  coefficients. 
The  radial  integral  is  obtained  using  the  trapezoidal  rule  on  the  logarithmic  mesh. 
However,  the  integrand  has  a  discontinuous  derivative  at  r  =  R,  and  a  correction 
has  been  included  to  account  for  this. 

It  can  be  shown  [13]  that  the  convergence  of  the  angular  momentum  sums  is 
geometric  in  the  ratios  of  the  distances  of  the  three  centers  from  the  expansion 
center.  With  the  optimum  choice  of  the  expansion  center  the  largest  possible  value 
of  this  ratio  is  0.618,  the  reciprocal  of  the  golden  mean.  The  disadvantage  of  this 
approach  is  that  the  angular  momentum  sum  is  twofold  infinite  whereas  it  is  singly- 
infinite  if  the  expansion  center  is  one  of  the  three  centers  in  the  integrand.  However, 
with  the  latter  choice,  the  convergence  may  be  only  algebraic,  rather  than  geometric 
if  the  centers  lie  on  an  equilateral  triangle. 


Two-Electron  Integrals 

The  most  time  consuming  part  of  the  scF  calculation  is  to  compute  the  two- 
electron  or  four-center  integrals  of  the  form 

/2  =  JJ  fhwAt  -  a)*g,,„,{r  -  b)  iTl-pi- 

X  -  a')* -  b')  dtdT'.  ( 13) 

These  integrals  are  the  interaction  energy  between  two  charge  density  distributions 
which  are  the  products  of  a  pair  of  orbitals  [14].  If  the  number  of  orbitals  is  N, 
there  are  of  the  order  of  N^/ 2  charge  distributions  and  of  the  order  of  N^/S  two- 
electron  integrals.  Although  the  integral  is  sixfold  in  position  space,  it  is  only  threefold 
in  momentum  space  [  1 5  ] ;  it  is  therefore  more  convenient  to  do  the  integration  in 
momentum  space.  The  technique  that  has  been  employed  is  to  expand  the  product 
of  two  orbitals  in  a  multipole  expansion  of  angular  momentum  functions  centered 
at  the  midpoint  points  c  and  c'  between  the  centers  of  each  factor.  This  expansion 
may  be  rather  slowly  convergent;  however,  if  the  centers  are  separated,  the  product 
will  be  small,  and  less  accuracy  is  required  than  when  the  orbitals  are  on  the  same 
center. 

The  interaction  energy  of  charge  distributions  Fa„{t)  centered  and  c  and  Gym'll) 
centered  at  c'  is 

/2  =  JJ  -  c)  J  Gyyfr'  -  t)  drdr'.  ( 14) 

In  momentum  space  this  becomes 

/2  =  ^  J  ^''‘•<*-''’FA,(k)GA','(k)  p  dk. 


(15) 
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This  integral  is  essentially  the  same  as  the  integral  in  Eq.  ( 10)  so  that 

/(2A+  1)(2A'+  1)(2L+  1) 


Ii  =  32ir  2 


LM 


47r 


\0  0  0/\(r  ff'  -mJ 

XliiwCR)  rjL(/cR)A(/c)Gj,>(k)d/c, 

Jo 


(16) 


where  R  =  c  -  c'. 

The  product  of  angular  momentum  functions  centered  at  a  and  b  can  be  expressed 
in  terms  of  angular  momentum  functions  centered  at  the  midpoint  c  =  (a  -  b)/2 
as 


-  a)*^,,„,(r  -  b)  =  2  FA,(R)rue<b),  (17) 


where  R6^  are  the  spherical  polar  coordinates  of  r  -  c.  However,  it  is  more  sat¬ 
isfactory  [16]  to  express  the  product  as 


y/,„,(r-a)*^/j„j(r-b)=  2  (-1)”" 


X 


(2/, +  1)(2/2+1) 

(2X+  1)(2A  +  1) 


( 

4 

X 

\-w, 

mz 

m) 

\m 

(18) 


where 


W'-)=  2  (2X+ 1)(2A+ l)(2y+ 1)(-1)^^ 

LiL'iLiL'z 

/L.  L.  XWL'.  Li  AX 

\  0  0  0/\  0  0  0/ 

(19) 
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Lz  X 
L'z  A 


fL,L',{r,  d)gi,,,/,(r,  d) 


and  d=  [a  -  b|/2.  The  sum  in  Eq.  ( 19)  is  singly  infinite. 
The  two-electron  integrals  are  therefore  given  by 


f2  =  2(-I)'"'W 

jj'  X-' 


1  4 

Wi  -mz 


^,)s(jm./m'),  (20) 

m)\m\  mz  m ) 
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where 


SUmJ'm')=X  X  7:1^  ^  '^,Va£)rxv(£')rzj./(R) 

jxKcj'x'r,'  L  \tn  .«  <rlW  m'  <r7 

V'  LN 
7'  -M) 


X(-i)(A-A'^^)/2+Ar(2L+ 1)^^  ^ 
X  h{\h.j,  \'A.'f,  L) 


A  A'  L\(A  A'  L 

O  <J* 


(21) 


and 


h{\Aj,  X'A'f,  L)  =  3277  r  jLikR)FMk)GyyAk)  dk.  (22) 
Jo 


The  two-electron  integrals  have  been  calculated  using  the  spherical  Bessel  trans¬ 
forms  of  the  functions  FjXA(r)  in  Eq.  (16).,  The  integrals  were  calculated  using 
Gauss-Laguerre  integration  on  a  mesh  of  Ngl  points.  The  functions  FjXA(k)  for  0 
^  A  ^  Amax  are  calculated  initially,  interpolated  onto  the  Gauss-Laguerre  mesh 
and  stored.  The  maximum  value  of  L't  and  L'2  in  the  sum  in  Eq.  (19)  will  be 
denoted  by  Lmax  •  Since  the  calculation  of  the  transforms  of  the  product  functions 
requires  only  N^/2  steps,  it  is  possible  to  use  a  considerably  refined  mesh  and  a 
reasonably  large  value  of  L  max  for  the  calculations  with  a  relatively  small  expenditure 
of  computer  time.  Since  the  Gauss-Laguerre  mesh  is  relatively  small,  of  the  order 
of  20-40  mesh  points,  the  storage  requirements  were  tolerable  in  the  present  cal¬ 
culations. 

The  advantage  of  the  expansion  (18)  over  ( 17)  is  that  fewer  functions  need  to 
be  stored.  For  example,  in  the  case  li  =  k  =  0,  it  is  ncessary  to  store  A^ax  +  1 
functions  Fjx\{k)  arising  in  Eq.  ( 18)  compared  with  (Amax  +  1  )^  functions  F^Ak) 
that  arise  in  Eq.  ( 1 7 ) .  Similarly,  there  is  a  large  reduction  in  the  number  of  integrals 
of  the  type  defined  in  Eq.  (22)  using  the  formulation  of  Eq.  ( 18). 


Numerical  Accuracy 

The  accuracy  obtained  in  the  calculations  depends  on  a  number  of  parameters 
governing  the  numerical  approximations.  These  are: 

Pmifi)  Amm.  h,  A^jog,  defining  the  logarithmic  meshes  in  r  and  h, 

Ngl,  ^max  defining  the  mesh  for  the  Gauss-Laguerre  integration; 

Ljv,max.  the  maximum  of  L'  and  X  in  the  nuclear  attraction  integrals; 

L2.mtx,  the  maximum  of  L\  and  Li  in  Eq.  ( 19); 

Amax.  the  maximum  of  A  and  A'  in  Eq.  (21 ). 

The  calculations  have  been  made  with  pmin  =  -14,  (Cmm  =  -1 1,  /i  =  0.14  and 
A^iog  =  128.  The  results  change  by  only  a  few  parts  in  lO"*  if  Aiog  is  doubled  and  h 
is  halved. 

Results  of  calculations  for  various  choices  of  the  other  parameters  in  the  minimal 
basis  set  calculation  are  given  in  Table  1.  It  is  seen  that  results  accurate  to  a  few 
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Table  I.  Calculated  energ'  'i  for  the  methane  molecule  in  a.u.  using  the  minimal  basis  for  various 
values  of  the  parameters  determining  the  numerical  procedure.* 


Acl 

^ra*x 

A‘'> 

E 

20 

40.0 

6 

14 

6 

2 

-40.04680 

30 

40.0 

6 

14 

6 

2 

-40.04678 

40 

40.0 

6 

14 

6 

2 

-40.04680 

40 

50.0 

6 

14 

6 

2 

-40.04680 

30 

40.0 

7 

14 

6 

2 

-40.04702 

30 

40.0 

8 

14 

6 

2 

-40.04701 

30 

40.0 

8 

16 

6 

2 

-40.04697 

30 

40.0 

8 

14 

7 

2 

-40.04964 

30 

40.0 

8 

14 

8 

2 

-40.04974 

30 

40.0 

8 

14 

9 

2 

-40.05055 

30 

40.0 

8 

14 

10 

2 

-40.05058 

30 

40.0 

8 

14 

10 

3 

-40.04456 

30 

40.0 

8 

14 

10 

4 

-40.04395 

30 

40.0 

8 

14 

10 

5 

-40.04342 

30 

40.0 

8 

14 

10 

6 

-40.04335 

•  The  parameters  are  defined  in  the  text.  The  C — H  bond  length  is  2.0665  bohr. 


parts  in  10”^  are  obtained  for  reasonably  small  values  of  of  the  order  of  20- 
40.  The  parameter  /cma,  is  the  maximum  value  of  k  in  the  Gauss-Laguerre  mesh 
and  should  be  determined  so  that  the  integrand  in  Eq.  ( 22 )  is  negligible  ax  .  If 
the  orbitals  in  the  integrand  are  5  orbitals,  which  is  the  worst  case,  the  integrand, 
apart  from  the  factor  j^ikK),  decreases  like  k~^  for  large  k  and  should  be  less  than 
10”®  for  Armaa  of  the  order  of  5.  This  should,  however,  be  multiplied  by  Z^ax,  the 
largest  nuclear  charge,  since  this  determines  the  scale  of  the  momentum  variable. 
Therefore,  a  value  of  k^„  of  the  order  of  30  or  40  should  provide  accuracy  of  better 
than  10“®.  The  results  confirm  that  this  is  reasonable. 

It  is  seen  that  accuracy  of  a  few  parts  in  10”®  is  obtained  with  =  8.  Com¬ 
parable  accuracy  is  obtained  for  L2,tnax  =  14. 

It  has  been  found  that  the  sum  on  A  and  A'  for  the  two-electron  integrals  convei^es 
more  slowly  if  either  of  the  separations  c  or  c'  is  zero.  (If  both  separations  are  zero, 
the  sum  is  finite.)  Since  these  integrals  are  also  larger,  it  is  necessary  to  treat  them 
more  carefully.  On  the  other  hand,  the  sum  in  this  case  is  only  singly,  rather  than 
doubly  infinite.  The  calculation  has  therefore  been  arranged  so  that  the  summation 
is  truncated  at  a  larger  value,  A^Ji ,  in  the  case  c  =  0  or  c'  =  0  and  at  a  smaller 
value.  Ami  otherwise.  It  appears  that  accuracy  of  a  few  parts  in  10”®  is  obtained 
for  Ami  =  10  and  Ai?i  =6.  It  is  the  latter  parameter  that  governs  the  most  time 
consuming  part  of  the  two-electron  integral  calculation.  It  can  be  shown  that  the 
number  of  terms  in  the  calculation  behaves  like  A  mix  so  that  in  going  from  Amax 
=  4  to  Amix  =  6,  the  time  requirement  increases  fivefold.  Thus  accuracy  of  the 
order  of  10”^  is  much  more  readily  obtainable  than  lO”**. 
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Results  Including  Polarization  Orbitals 

The  variational  SCF  calculation  can  be  improved  by  going  beyond  the  minimal 
basis  set,  in  this  case  Is,  2s,  and  2p  orbitals  on  C  and  Is  orbitals  on  H,  to  include 
so-called  polarization  orbitals.  These  can  be  either  excited  orbitals  centered  on  each 
atom,  or  “floating  orbitals”  which  may  be  gtos.  Calculations  have  been  carried 
out  using  the  former  approach,  including  up  to  4s,  3p,  and  3d  orbitals  on  C  and 
up  to  2p  on  H.  It  should  be  noted  that  the  orbitals  that  have  been  used  are  not 
exactly  Hartree-Fock  atomic  orbitals;  for  the  C  orbitals  they  are  obtained  from  the 
optimized  potential  model  [17].  which  yield  orbitals  essentially  the  same  as  Hartree- 
Fock  orbitals  for  the  occupied  states  [18],  but  can  also  generate  orthogonal  orbitals 
for  unoccupied  states. 

These  results  have  been  obtained  using  the  numerical  parameters  from  the  last 
line  of  Table  I.  No  further  tests  have  been  made  on  the  effect  of  changing  these 
parameters.  It  can  be  expected,  however,  that  the  effect  on  the  contributions  of  the 
polarization  orbitals  will  be  small  since  these  are  already  small.  Furthermore,  most 
of  the  computational  inaccuracy  arises  from  large  momentum  components  in  the 
orbitals,  and  these  should  be  much  smaller  in  the  more  diffuse  unoccupied  orbitals. 

Results  are  given  in  Table  II  and  compared  with  similar  calculations  using  GTOs 
[19].  The  results  are  rather  disappointing  in  that  the  GTO  calculations  yield  a  lower 
energy  than  the  result  using  optimized  atomic  orbitals  in  each  case.  On  the  other 
hand,  the  results  are  obtained  with  no  orbital  optimization,  whereas  the  results 
using  GTOS  are  obtained  with  searches  over  the  parameters  determining  the  un¬ 
occupied  orbitals. 


Discussion 

The  results  obtained  from  these  calculations  indicate  that,  perhaps  contrary  to 
expectation,  atomic  Hartree-Fock  orbitals  do  not  provide  a  particularly  good  basis 
for  molecular  SCF  calculations.  The  reason  for  this  may  be  that  the  molecule  is 
more  tightly  bound  than  the  constituent  atoms.  The  behavior  of  the  C  2p  orbitals 


Table  11.  Calculated  energies  in  a.u.  for  the  methane 
molecule  using  vanous  basis  sets.* 


Basis 

C 

H 

E 

£gto’’ 

ls,2s.2p 

Is 

-400433 

-40.1282 

+3s.4s.3p 

2s 

-40.1397 

-40.1801 

+3d 

— 

-40.1632 

-40.1982 

+ - 

2p 

-40.1775 

-40.2039 

*  Corresponding  results  for  Gaussian  bases  are  denoted 
Eoio- 
"Ref.  [19], 
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at  large  r  is  governed  by  the  single-particle  energy  eigenvalue  which  is  -0.43 1  a.u., 
whereas  the  CH4  ionization  potential  as  given  by  the  valence  electron  eigenvalue 
in  the  present  calculation  is  -0.548  a.u.  Further  evidence  of  this  is  that  there  are 
very  large  mixings  of  the  polarization  orbitals,  with  amplitudes  comparable  to  those 
of  the  occupied  orbitals. 

On  the  other  hand,  the  results  indicate  that  it  is  quite  feasible  to  carry  out  SCF 
calculations  for  small  molecules  using  these  momentum  space  techniques.  The 
minimum  basis  set  calculation  required  14  s  on  a  CYBER  2000U  and  the  two- 
electron  integrals  required  about  2.5  ms  per  integral.  This  could  also  be  carried  out 
on  a  PC.  The  largest  calculation  reported,  using  35  basis  functions,  required  1600 
s.  The  calculation  of  the  approximately  200,000  two-electron  integrals  required 
1400  s  or  about  7  ms  per  integral.  With  reduced  from  6  to  4,  this  is  reduced 
to  about  2.7  ms  per  integral. 

The  success  of  these  calculations  suggests  that  it  may  be  possiole  to  develop  a 
variational  approach  to  compute  numerically  the  basis  orbitals  entering  the  cal¬ 
culation.  This  could  permit  the  use  of  the  finite  difference  approximation  methods 
that  give  accurate  Hartree-Fock  wave  functions  for  atoms  in  molecular  calculations. 
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Abstract 

Vibration  frequencies  for  2-aminoethanol,  neutral  glycine,  glycolic  acid,  n-propylamine,  n-propanol, 
and  propionic  acid  were  calculated  on  the  ab-imiio  rhf  level  with  a  variety  of  standard  basis  sets  including 
polarization  and  diffuse  functions.  Experimental  frequencies,  if  available,  are  compared  with  these  data 
and  basis  set  suitability  is  discussed  on  the  basis  of  this  companson.  ©  1993  John  Wiiey  &  Sons.  Inc. 

Introduction 

The  calculation  of  vibration  frequencies  in  variational  ab-initio  programs  is  usually 
carried  out  by  diagonalizing  the  Hessian  matrix  of  second  derivatives.  Frequencies 
calculated  this  way  are  consistently  too  large;  in  the  case  of  stretch  vibrations  the 
excess  may  reach  the  order  of  some  hundred  cm’* .  At  first  glance  it  appears  that 
this  deviation  is  caused  by  considering  the  second  derivatives  only,  which  corre¬ 
sponds  to  the  assumption  of  harmonic  oscillator  behavior.  It  is  a  striking  fact, 
however,  that  this  straightforward  looking  consideration  is  wrong:  the  deviation 
from  the  harmonic  oscillator  behavior  can  be  calculated  rather  accurately  by  vari¬ 
ational  ab-initio  methods  at  the  Hartree-Fock  level  as  well  as  at  the  post-Hartree- 
Fock  level  (!].  The  differences  between  experimental  and  calculated  frequencies 
therefore  must  be  due  to  intrinsic  deficiencies  of  the  description  of  the  potential 
energy  surface  near  the  local  minimum. 

Since  calculated  vibration  frequencies  allow  a  direct  link  to  experiments,  the 
correlation  between  calculated  and  experimental  values  has  been  a  topic  of  interest 
for  several  years.  The  main  emphasis  depends  mostly  upon  the  intended  use  of  the 
calculated  values:  either  as  a  help  in  the  assignment  of  normal  modes  to  experi¬ 
mentally  measured  frequencies,  or  as  a  prediction  of  experimentally  inaccessible 
data.  The  first  case  is  best  covered  by  scaling  not  the  frequencies,  but  the  elements 
of  the  Hessian  matrix  with  a  number  of  individual  scale  factors  [2].  In  the  latter 
case  oftentimes  one  global  scale  Lctor  is  used  to  adjust  the  calculated  frequency 
values.  (Some  recent  applications  may  be  found  in  Ref.  [3-6].) 

We  have  investigated  several  compounds  at  the  Hartree-Fock  level  with  a  variety 
of  basis  sets  in  the  last  years  in  the  course  of  a  study  of  amino  acids  and  related 
species.  In  some  cases  considerable  differences  between  the  various  basis  sets  have 
been  encountered,  especially  regard:  le  the  number  and  the  nature  of  stationary 
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points  in  the  potential  energy  surface  [7-10].  The  purpose  of  this  contribution  is 
to  exploit  the  basis  set  dependence  of  the  calculated  frequencies  of  some  of  these 
compounds,  namely  2-aminoethanol,  glycine,  glycolic  acid,  « -propylamine,  pro¬ 
pionic  acid,  and  «-propanol.  All  of  these  combine  one  or  two  of  the  functional 
groups  —  NH2 ,  —  OH,  and  —  COOH  to  organic  molecules  with  four  main  chain 
atoms.  These  molecules  may  be  considered  as  model  compounds  for  larger  systems 
which  are  partly  affected  by  intramolecular  interactions,  e.g.,  biologically  important 
species. 


Results 

For  all  compounds  the  global  minimum  of  the  potential  energy  surface,  or,  in  the 
case  of  n -propylamine  and  n -propanol,  the  one,  which  is  the  global  minimum  with 
most  basis  sets,  is  considered  here.  These  conformers  are  depicted  in  Scheme  1 . 

The  wavenumbers  of  these  conformers,  which  were  calculated  at  the  RHF  level 
[  1 1  ]  for  the  respective  optimized  geometries  with  a  number  of  standard  basis  sets, 
are  shown  in  Figures  1  to  6.  The  following  basis  sets  have  been  used:  3-21G  [12], 
4-21G  [13],  UQIO  [14],  4-31G  [15],  6-31G  [16],  6-31 IG  [17],  and  Dunning’s 
10s,5p/4s  basis  set  [18]  (denoted  dh  in  the  following).  The  6-3 IG  and  the  6-31 IG 
basis  sets  have  also  been  augmented  by  polarization  functions  [19]  and  diffuse 
functions  [20]. 

It  has  to  be  noted  explicitly  that  some  vibration  modes  change  significantly  upon 
basis  set  variation.  All  frequencies,  for  which  the  associated  mode  of  vibration 
remains  essentially  unchanged,  are  connected  in  Figures  1  to  6  with  straight  lines. 
Figure  7  exemplifies  this  using  the  specific  example  of  modes  6  and  7  of  n-propyl- 
amine  with  the  basis  sets  6-31 1+G  and  6-3 IG*:  mode  7  with  the  6-31 1+G  basis 
set  is  identical  with  mode  6  in  the  6-3 IG*  description,  but  mode  6/6-31 1+G  has 
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Figure  2.  Experimental  [28]  and  calculated  wave  numbers  (ab-imuo  RH.-,  harmonic 
approximation)  for  n-propanol  with  various  basis  sets. 
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Figure  3.  Expenmental  [24]  and  calculated  wave  numbers  (ab-imlio  RHF,  harmonic 
approximation)  for  the  global  minimum  in  the  pes  of  glycolic  acid  with  various  basis 

sets. 
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Figure  4.  Calculated  wave  numbers  (ab-iniiio  rhf,  harmonic  approximation)  for  the 
global  minimum  in  the  pes  of  neutral  glycine  with  various  basis  sets. 
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Figure  5.  Calculated  wave  numbers  (ab-mmo  RHF,  harmonic  approximation)  for  the 
global  minimum  in  the  pes  of  propionic  acid  with  various  basis  sets. 
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Figure  6.  Calculated  wave  numbers  (ab-initio  RHF,  harmonic  approximation)  for  the 
global  minimum  in  the  pes  of  2-aminocthanol  with  various  basis  sets. 
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Figure  7.  The  vibration  modes  6  ana  7  of  n-propylamine  with  the  basis  sets  6-31 1 +G 
and  6-3 IG*.  Mode  7/6-31 1+G  is  identical  with  mode  6/6-3 IG*,  but  mode  6/6-31 1+G 
has  no  equivalent  in  the  6-31G*  description  nor  has  mode  7/6-31G*. 


no  equivalent  in  the  6-3  IG*  description  nor  has  mode  7/6-3  IG*.  This  effect  seems 
to  be  specific  for  the  C — C — NH2  fragment,  because  the  same  situation  also  occurs 
in  the  case  of  glycine  and  2-aminoethanol. 

Another  trend,  which  may  be  gathered  from  Figures  2  to  6,  is  the  increase  of  the 
C=0  stretch  vibration  frequency  upon  inclusion  of  polarization  functions  on  the 
atoms  C,  N,  and  O,  and  the  increase  of  the  O — H  stretch  vibration  frequency 
upon  inclusion  of  polarization  functions  on  hydrogen.  Since  there  are  only  minor 
changes  in  the  position  of  the  C — H  and  N — H  stretch  vibrations,  both  of  these 
effects  point  to  the  t/-type  polarization  function  on  oxygen  as  the  sole  reason.  The 
variation  of  the  exponent  of  this  //-type  function  leads  to  energy  minima  for  ex¬ 
ponents  between  0.85  to  0.95  ( Fig.  8 ).  (The  standard  value  proposed  by  Hariharan 
and  Pople  [  1 9  ]  for  this  '•xponent  is  0.8.)  A  slight  increase,  but  no  significant  change 
of  the  O — H  frequency  values  is  correlated  with  an  increase  of  the  exponent. 
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Figure  8.  The  electronic  energy  (6-3IG* )  of  n-propanol  (lop),  propionic  acid  (middle), 
and  glycolic  acid  (bottom)  as  a  function  of  exponent  of  the  rf-type  polarization  function 

on  oxygen. 


Frequencies  calculated  at  the  Mp2  level  were  not  considered  in  this  study,  because 
the  deviation  between  these  and  the  experimental  data  has  no  systematic  trend; 
mp2  frequencies  are  much  closer  to  the  experimental  ones,  but  either  greater  or 
less  than  them. 


Discussion 

Figures  1  to  6  allow  the  distinction  of  three  groups  of  basis  sets:  group  A,  which 
consists  of  the  basis  sets  3-2 IG,  4-2 IG,  and  UQlO;  group  B,  which  contains  the 
split  and  triple  valence  basis  sets  4-3 IG,  6-3  IG,  DH,  and  6-31  IG,  with  and  without 
diffuse  functions;  and  group  C,  which  covers  all  basis  sets  with  polarization  functions. 
This  classification  agrees  with  one  based  on  the  geometry  data  of  all  local  minima 
of  glycolic  acid  and  glycine  [9,21]. 

For  the  purpose  of  judging  the  widely  used  global  scaling  of  calculated  frequency 
values,  the  experimental  frequencies  of  « -propylamine,  w-propanol,  and  glycolic 
acid  [20-24]  arc  compared  with  the  calculated  values  for  all  three  basis  set  groups 
in  Figures  9  to  11.  The  experimental  values  are  drawn  in  these  figures  on  different 
scales  (which  corresponds  to  the  application  of  one  common  scale  factor).  If  the 
lowest  frequencies,  which  are  extremely  difficult  to  detect  experimentally,  are  ignored 
in  the  comparison  of  experimental  and  calculated  values,  a  good  agreement  is  given 
for  the  oxygen  containing  compounds  n-propanol  and  glycolic  acid  only  in  the 
case  of  basis  set  group  B,  whereas  for  n-propylamine  agreement  is  best  with  the 
polarized  basis  sets. 
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Figure  9.  Experimental  [  28  ]  and  scaled  calculated  vibration  frequencies  for  n-propanol. 
The  different  length  of  the  abscissae  corresponds  to  a  uniform  scale  factor. 
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Figure  10.  Experimental  [24]  and  scaled  calculated  vibration  frequencies  for  glycolic 
acid.  The  different  length  of  the  abscissae  corresponds  to  a  uniform  scale  factor. 
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Figure  1 1.  Experimental  [22]  and  scaled  calculated  vibration  frequencies  for  n-propyl- 
amine.  The  different  length  of  the  abscissae  corresponds  to  a  uniform  scale  factor. 


The  effect  of  the  significant  changes,  which  are  to  be  observed  in  Figures  1  to  6 
when  going  from  basis  set  group  B  to  group  C,  is  therefore  opposite  for  oxygen  and 
nitrogen.  For  nitrogen  it  brings  the  calculated  frequencies  in  better  agreement  with 
the  experimental  ones  if  proper  scaling  is  employed.  For  oxygen  it  causes  a  mismatch 
that  increases  when  polarization  functions  are  added  on  hydrogen  atoms.  The  re¬ 
lation  between  bond  lengths  and  calculated  vibration  frequencies  should  be  pointed 
out  specifically.  The  C  =  O  data  for  glycolic  acid,  which  are  displayed  in  Figure 
12,  show  that  there  is  an  almost  perfect  linear  dependence  between  these  quantities, 
when  the  results  for  different  conformers  are  considered  which  were  obtained  with 
the  same  basis  set.  This  corresponds  well  with  the  observation  that  was  made  for 
O — H  in  water  [  25  ] .  In  contrast,  data  of  the  same  conformer  obtained  with  different 
basis  sets  do  not  show  specific  regularities. 

The  good  agreement  between  experimental  and  calculated  frequencies,  which 
were  obtained  with  group  B  basis  sets  for  glycolic  acid  matches  the  observation 
that  the  optimized  geometry  of  the  global  minimum  of  the  potential  energy  surface, 
when  calculated  with  group  B  basis  sets,  is  in  much  better  agreement  with  experi¬ 
mental  data  than  with  group  C  basis  sets.  This  seems  to  be  generalizable  for  organic 
compounds  which  contain  only  oxygen  hetero  atoms  (c.f.  [26]). 

The  conclusion,  which  can  be  drawn  at  least  for  species  similar  to  the  ones  dis¬ 
cussed  here,  is  to  avoid  the  use  of  rf-type  polarization  functions  on  oxygen.  In 
contrast,  basis  sets  of  the  type  used  by  Sato  et  al.  [20]  and  Batista  de  Carvalho  et 
al.  [27],  i.e.,  split  valence  basis  sets  with  additional  ^/-polarization  functions  only 
on  nitrogen,  appear  to  be  quite  reasonable  in  this  context. 
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'•0=0  [A] 


Figure  1 2.  The  correlation  of  C = O  distance  and  frequency  for  all  glycolic  acid  minima, 
calculated  with  the  4-3 IG  basis  set  (left),  where  the  H-bonded  minimum  (IH  in  [9])  can 
be  recognized  as  the  one  outside  the  fitted  curve.  The  global  minimum  ( 0 )  was  calculated 
with  different  standard  basis  sets  ( right ) ,  where  •  marks  double  zeta  basis  sets  and  O  triple 
zeta  basis  sets  with  and  without  diffuse  functions.  All  basis  sets  including  polarization 
functions  are  marked  by  x.  The  experimental  C— O  bond  length,  obtained  from  microwave 
data,  is  1.210  A  [29]. 
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Abstract 

Koga  and  Thakkar's  reoptimized  (9s5p)  and  (12s7p)  Gaussian  basis  sets  for  the  atoms  Li  to  Ne  are 
contracted  to  [4s2p]  and  [7s4p],  respectively,  and  their  (4s)  and  (6s)  sets  for  H  and  He  are  contracted  to 
[2s]  and  [4s],  respectively.  The  basis  sets  are  tested  by  performing  self-consistent-field  (SCF)  geometry 
optimizations  on  LiH.  BCH2,  BjH^,  CH4,  NH3,  H2O.  and  HF  The  equilibnum  geometries  of  hydrogen 
peroxide  and  hydrazine  are  determined  at  both  the  scF  and  fourth-order  many-body  perturbation  theory 
level.  1993  John  Wiley  &  Sons,  Inc. 

Introduction 

Most  contemporary  quantum  chemical  calculations  use  basis  sets  of  Gaussian- 
type  functions  (GTF).  There  are  many  good  reviews  and  compendia  [1-8]  of  GTF 
basis  sets.  Dunning’s  double  zeta  [4s2p/2s]  contractions  [9]  of  Huzinaga’s  varia- 
tionally  optimized  (9s5p/4s)  GTF  sets  [10]  are  among  the  most  popular  basis  sets 
for  hydrogen  and  the  first  row  atoms  from  B  through  F.  Huzinaga’s  basis  sets  for 
H  to  Ne  were  variationally  reoptimized  and  larger  sets  for  Li  to  Ne  were  constructed 
[11]  and  contracted  [12]  by  van  Duijneveldt;  these  have  also  been  widely  used. 

Recently,  Koga  and  Thakkar  [  1 3]  obtained  further  variational  improvements  in 
(9s5p)  and  ( 1 2s7p)  gtf  sets  for  Li  to  Ne,  and  {4s)  and  (6s)  sets  for  H  and  He.  Their 
(9s5p)  basis  sets  [  1 3]  for  Li  to  Ne  are  in  complete  agreement  with  the  very  recent 
work  of  Schafer  et  al.  [14],  who  Independently  reoptimized  Huzinaga’s  (9s5p)  sets. 
Koga  and  Thakkar’s  basis  sets  for  Li  to  Ne  yield  atomic  energies  lower  than  those 
of  van  Duijneveldt  [1 1]  by  amounts  ranging  from  6.1  X  lO'*  to  1.5  X  10'”  £h- 
These  energy  improvements  are  rather  small  in  the  context  of  molecular  calculations. 
However,  the  reoptimized  exponents  of  the  tight  GTF  differ  by  as  much  as  25% 
from  the  previous  ones  [11],  and  this  may  be  of  some  significance  for  properties 
other  than  the  energy.  We  report  double  and  quadruple  zeta  contractions  of  Koga 

International  Journal  of  Quantum  Chemistry:  Quantum  Chemistry  Symposium  27,  343-354  (1993) 

©  1993  John  Wiley  &  Sons,  Inc.  CCC  0020-7608/93/010343-12 


344 


THAKKAR  ET  AL. 


Table  I.  Atomic  energies,  with  signs  reversed,  from  various  GTE  basis  sets 


HD“ 

vd'’ 

BO' 

BF 

b2' 

(9s5p)'’ 

Li  ('P) 

7.3647293 

7.3647293 

7.3647293 

7.3647293 

Be  (’P) 

14.509918 

14.510791 

14.510812 

14.510815 

B(2P) 

24.526415 

24.52687 

24.526304 

24.527471 

24.527527 

24.527546 

C('P) 

37.684508 

37.68519 

37.684687 

37.685561 

37.685665 

37.685700 

N(*S) 

54  394392 

54.39535 

54.395093 

54.395655 

54.395829 

54.395885 

0(’P) 

74  798837 

74  80040 

74  800333 

74.800635 

74.800912 

74.800995 

F^P) 

99.393300 

99.39557 

99.395769 

99.395937 

99.396285 

99.396402 

Ne  CS) 

128.52711 

128.52752 

128.52793 

128.52811 

128.52827 

*  HD  is  Dunning's  (4s2p)  contraction  (9)  of  Huzinaga's  (9s5p)  set  ( 10). 

■’  VD  is  (9s5p)/(4s2p]  set  of  van  Duijncvcidt  (11.12). 

'  bO.  Bl,  and  b2,  respectivelj.  are  [4s2p]  contractions  of  Koga  and  Thakkar's  (9s5p)  set  [13]  using  0. 
1,  and  2  repeated  pnmitives 

(9s5p)  is  the  uncontracted  set  from  Ref  (13). 


and  Thakkar’s  basis  sets  [13]  for  use  in  molecular  calculations.  The  contraction 
procedure  is  described  in  the  next  section,  and  the  basis  sets  are  tested  in  the  third 
section.  Hartree  atomic  units  are  used  except  where  explicitly  stated  otherwise. 

Contraction  of  the  Basis  Sets 

Basis  sets  of  contracted  Gaussian  type  functions  (CGTFs)  can  be  of  either  the 
segmented  [9]  or  generalized  [  1 5]  type.  In  the  elegant  general  contraction  scheme 
of  Raffenetti  [  1 5].  the  CG  ffs  can  be  constructed  from  atomic  calculations  in  the 
parent  uncontracted  GTF  basis  set  without  any  further  complications.  However,  a 
“primitive"  GTF  will  generally  appear  in  many  cgtfs.  and  this  will  lead  to  many 


Table  II.  Atomic  energies,  with  signs  reversed,  from  various  GTF  basis  sets. 


CO* 

Cl* 

(12s7p)'’ 

NHF' 

Li  ('P) 

7.3650363 

7.3650363 

7.3650363 

7.3650697 

Be(’P) 

14.511439 

14.511443 

14  51 1444 

14.511502 

B('P) 

24.528939 

24.528953 

24.528953 

24.529061 

C('P) 

37.688397 

37.688422 

37.688423 

37.688619 

N{’S) 

54.400561 

54.400602 

54.400603 

54.400934 

0(’P) 

74.808783 

74.808842 

74.808846 

74.809398 

F(^P) 

99.408404 

99.408490 

99.408492 

99.409349 

Ne  CS) 

128.54573 

128.54584 

128.54584 

128.54710 

*  cO  and  cl,  respectively,  are  [7s4p)  contractions  of  Koga  and  Thakkar's 
( 1 2s7p)  set  [  1 3]  using  0  and  I  repeated  primitives. 

'’(I2s7p)  is  the  uncontracted  set  from  Ref.  [13] 

'  NHF  is  the  Hartree-Fock  limit  from  Ref  [21 J. 
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Table  Ill.  Exponents  («)  and  contraction  coefficients  (c)  of  the 
(4s)/[2s]  CGTF  basis  sets  for  H  and  He.* 


H 

He 

(t 

c 

« 

c 

13.011 

0.033484 

38.355 

0  040183 

1.9623 

0.234719 

5.7689 

0.261395 

0.44454 

0.813774 

1.2399 

0.793038 

0.12195 

1.000000 

0.29758 

1.000000 

’  Dashed  lines  separate  the  contracted  functions  A  scale  factor 
of  1.2  should  be  used  for  H  in  molecular  calculations  by  multi¬ 
plying  the  «  by  1.44. 


unnecessar>  integral  evaluations  unless  the  integral  program  specifically  takes  this 
into  account.  Unfortunately,  many  popular  integral  programs  do  not  have  the 
capability  to  use  generalized  contractions  efficiently. 

The  alternative  is  segmented  contraction  [9]  in  which  primitive  GTFs.  with  the 
possible  exception  of  one  or  two  crucial  ones,  appear  in  only  one  CGTF.  Ideally, 
one  should  variationally  optimize  the  contraction  coefficients,  the  exponents,  and 
the  grouping  pattern.  We  are  not  aware  of  any  such  work.  Segmented  CGTF  basis 
sets  with  both  exponents  and  contraction  coefficients  variationally  optimized  for  a 
predetermined  contraction  pattern  can  be  found  in  Refs.  [14.16].  for  example. 
More  common  are  segmented  basis  sets  in  which  the  contraction  pattern  (splitting) 


Table  IV.  Exponents  («)  and  contraclion  coefficients  (c)  of 
the  (6s)/(4sj  CGTF  basis  sets  for  H  and  He.* 


H  He 


a 

c 

c 

82.921 

12.452 

2.8330 

0.022940 

0 175491 

0  864756 

234.06 

35.174 

7.9911 

0.024379 

0.184036 

0.857623 

0.80001 

1.000000 

2.2124 

1.000000 

0.25859 

1  000000 

0.66707 

1.000000 

0.089969 

1.000000 

0.20895 

1.000000 

•  Dashed  lines  separate  the  contracted  funaions.  No  scaling 
of  the  H  basis  set  is  recommended  for  molecular  calculations. 


346 


THAKKAR  ET  AL 


Table  V  Exponents  («)  and  contraction  coefficients  (c)  of  the  (9s5p)/(4s2p]  cgtf  basis  sets  for 

Li.  Be.  B.  and  C* 


Li 

Be 

B 

C 

« 

< 

Cl 

c 

a 

4 

o 

c 

s  1,4990(4  3) 

0  001042 

2  9.330(43) 

0.001046 

4  7108(43) 

0.001133 

6  7799(43) 

0  001222 

2.2491(42) 

0  008024 

4.4002(42) 

0.008066 

7.0674(42) 

0.008740 

1  0172(43) 

0.009430 

5.1184(41) 

0040590 

1.0015(42) 

0041001 

1.6088(42) 

0.044488 

2  .3157(42) 

0.048023 

1.4453(41) 

0  150484 

2.8.304(41) 

0.154551 

4  5518(41) 

0.1685.30 

6  5547(41) 

0.182197 

4  6303 

0  .393278 

9.1146 

0.416999 

1.4724(41) 

0.458573 

2  1253(41) 

0.496064 

1  5900 

0.5294.39 

3  1694 

0.499759 

5.1823 

0440665 

7  5339 

0  385105 

1  5900 

0  217740 

3  1694 

0.316393 

5.182.3 

0.409712 

7  5339 

0.471311 

5.6547(-l) 

0  812720 

1  14.39 

0  721799 

1  9068 

0  632099 

2  8031 

0.571296 

7.34l0(-2) 

1  000000 

1  8648(-l) 

1  000000 

3.3302(-l) 

1.000000 

52151(-1) 

1.000000 

2  8042(-2) 

1  OOOIKX) 

6  59l2(-2) 

1.000000 

I0434(-1) 

1.000000 

1.5957(-1) 

1.000000 

p  .3.2665 

0  012169 

7  4498 

0.014399 

1.2053(41) 

0.017293 

1  87.34(41) 

0.018170 

65II9(-I) 

0  067015 

1.5796 

0  084345 

2  6120 

0  105484 

4  1362 

0.112687 

1  696.3(-l) 

0.2958.30 

4  .3604(-l) 

0.332582 

7  4689(-l) 

0  365660 

1  2004 

0  376170 

5  5778(-2) 

0  745417 

l.44.36(-l) 

0.701899 

2  3873(-l) 

0.66.3256 

3  8346(-l) 

0.648667 

2  0500(-2) 

1  000000 

5()l95(-2) 

1.000000 

7  7218(-2) 

1.000000 

1  2129(-1) 

1  OOOOOO 

*  Dashed  lines  separate  the  contracted  functions  /((±n)  means  .4  x  10'". 


is  optimized  by  trial  and  error,  and  the  contraction  coefficients  are  either  taken 
from  atomic  calculations  in  the  parent  set  as  in  Ref.  [9],  or  variationally  optimized 
as  in  Ref.  [  1 7]. 

In  this  work,  we  examined  up  to  1 1  contraction  patterns  for  each  basis  set,  and 
the  use  of  0,  1,  and  2  primitive  GTFs  appearing  in  two  cgtfs.  For  each  pattern, 
the  contraction  coefficients  were  variationally  optimized  to  minimize  the  spin-re¬ 
stricted  Roothaan-Hartree-Fock  (RHF)  atomic  energy  [  1 8].  The  contraction  pattern 
leading  to  the  lowest  energy  was  chosen  with  no  altempi  to  impose  a  uniform 
pattern  for  all  first-row  atoms.  All  atomic  self-consistent-field  (scf)  calculations 
were  done  with  the  atom  program  [19]  modified  to  allow  the  optimization  of 
contraction  coefficients  by  the  conjugate  direction  algorithm  of  Powell  [20]. 

The  lightest  atoms  pose  a  few  special  problems.  Hydrogen  and  helium  have  only 
one  occupied  atomic  orbital  (AO),  and  hence  contraction  imposes  no  penalty  on 
the  atomic  rhf  energy.  Thus  the  contraction  coefficients  are  simply  taken  from 
uncontracted  calculations  and  renormalized.  The  contraction  pattern  is  chosen  so 
that  the  tightest  gtfs  are  grouped  in  one  CGTF  and  all  the  other  primitive  GTFs  are 
left  unconiracted.  This  procedure  is.  of  course,  equivalent  to  a  generalized  con¬ 
traction  scheme  for  H  and  He. 

Lithium  and  beryllium  have  only  s-type  AOs  occupied  in  their  ground  states  but 
a  basis  set  without  p-type  GTF  would  be  useless  for  meaningful  molecular  calcu¬ 
lations.  This  problem  is  solved  for  Be  by  contracting  wave  functions  [  1 3]  for  the 
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Table  VI.  Exponents  (a)  and  contraction  coefficients  (c)  of  the  (9s5p)/[4s2p]  cgtf  basis  sets  for 

N,  O,  F,  and  Ne.* 


N 

0 

F 

Ne 

a 

c 

a 

c 

a 

c 

a 

C 

s  9.1556(+3) 

0001327 

1.1852(+4) 

0001445 

1.4846(+4) 

0.001915 

1.8135(4-4) 

0.001966 

l.3736(+3) 

0.010240 

1.7782(+3) 

0.01 1 147 

2.2274(+3) 

0.014770 

2.7210(4-3) 

0015165 

3.1273(+2) 

0.052146 

4  0486(+2) 

0  056764 

5.0714(4-2) 

0.075268 

6  1951(4-2) 

0.OV251 

8.8547(+l) 

0  197926 

1  1466(+2) 

0.215473 

1  4364(4-2) 

0.285328 

1.7548(4-2) 

0  292825 

2.8756(+l) 

0.538041 

3  7279(+l) 

0.584655 

4  6740(4-1) 

0.710267 

5  7133(4-1) 

0.701811 

l.0246(+l) 

0.320789 

1.3334(+I) 

0  246860 

4.6740(4-1) 

0.032103 

5.7133(-U) 

0.045089 

1.0246(+I) 

0.524498 

l.3334(+l) 

0.569590 

1.6766(4-1) 

0.634574 

2.0538(4-1) 

0.633958 

3.8443 

0.517846 

5  0385 

0.471791 

6  3699 

0  37901 1 

7.8363 

0.368695 

7.4650(-l) 

1.000000 

1.0136 

1.000000 

1  3164 

1.000000 

1.6548 

1.000000 

2.2475{-l) 

1.000000 

3.0250(-l) 

1.000000 

3.9009(-l) 

1.000000 

4.8755(-l) 

1.000000 

p  2.6667{+l) 

0.018796 

3  4493(+l) 

0.020236 

4  3907(4-1) 

0.021053 

5.475?'4-l) 

0.021555 

5.9557 

0.117689 

7.7562 

0.126799 

9  9298 

0.132049 

1.2440(4-1) 

0.135355 

1.7440 

0.383034 

2.2820 

0.394680 

2  9313 

0.400866 

3.6819 

0.-.04588 

5  5629(-l) 

0638961 

7.169l(-l) 

0.624014 

9  1433(-1) 

0.615513 

1.1444 

0.6101  II 

1.73I5(-1) 

1.000000 

2  I46l(-I) 

1000000 

2  6783(-l) 

1  000000 

T 

00 

© 

1  000000 

*  Dashed  lines  separate  the  contracted  functions.  A(±n)  means  A  X  10’". 


Is^2s2p  (’P)  State.  The  problem  is  a  bit  more  subtle  for  Li  because  the  excited  ls^2p 
(^P)  state  has  no  occupied  valence  s-type  AO.  In  previous  work,  such  as  Ref.  [  1  ], 
the  s-GTFs  were  taken  from  the  ground  ls^2s  (^S)  state  and  the  p-GTFs  from  the  ^P 
state.  We  follow  a  similar  method.  The  exponents  of  the  s-  and  p-GTF  were  taken 
from  the  and  ^P  states,  respectively,  and  the  contraction  coefficients  and  pattern 
of  the  resulting  basis  set  were  optimized  for  the  ^P  state. 

Results  and  Discussion 


Atomic  Energies 

Consider  the  (9s5p)/[4s2p]  double  zeta  sets  first.  Table  I  compares  atomic  rhf 
energies  predicted  by  the  Huzinaga-Dunning  (HD)  basis  sets  [9,10],  the  van  Duij- 
neveldt  (  vd)  basis  sets  [11,12],  and  our  bO,  B  1  and  b2  basis  sets  which  are  contractions 
of  the  Koga-Thakkar  sets  [13]  allowing  0,  1,  and  2  primitive  GTFs  to  occur  in  two 
different  CGTFs.  Despite  the  fact  that  the  uncontracted  basis  sets  of  Koga  and  Thak- 
kar  always  predict  lower  energies  than  the  uncontracted  sets  of  Huzinaga  [10]  and 
van  Duijneveldt  [11],  the  bO  sets  do  not  always  yield  the  lowest  atomic  energy.  The 
HD  set  for  B  and  the  vd  sets  for  B,  C,  N,  O,  and  F  predict  lower  energies  than  bO. 
Perhaps  this  is  because  the  lower  energy  in  the  uncontracted  sets  of  Ref.  [13]  is 
achieved  by  more  efficient  sharing  of  the  primitive  GTFs  between  aos.  The  Bl  sets 
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Table  VII.  Exponents  (i»)  and  contraction  coefficients  (c)  of  the  (12s7p)/[7s4p]  cgtf  basis  sets 

for  Li,  Be,  B.  and  C.‘ 


Li 

Be 

B 

C 

a 

c 

cx 

c 

a 

c 

(X 

c 

s  7  2056(+3) 

0  001377 

1  4726(+4) 

0.000649 

2.383 1(+4) 

0.000677 

3  5088(+4) 

0.000670 

l.0797(+3) 

0  010690 

2  2064(+3) 

0.005039 

3  5706(+3) 

0.005258 

5.2574(+3) 

0.005202 

2.4575(+2) 

0  055804 

5  0211(+2) 

0.026338 

8  1263(+2) 

0.027495 

1.1965(+3) 

0.027207 

6.959I(  +  1) 

0  229711 

1  4216{+2) 

0.108941 

2.3012(+2) 

0  113855 

3.3884(+2) 

0.112759 

2.2677(  +  l) 

0  774239 

4.6333(+l) 

0  372164 

7.5026(+l) 

0.390462 

1.1048(+2) 

0.387704 

8.1342 

3.1007 

0.399797 

0.638965 

1.665.3(+1) 

0.584765 

2.700 1  (+1) 

0  561959 

3.9785(+l) 

0.565654 

1  6653(+l) 

0.175059 

2.7001(+1) 

0.190582 

3  9785(+l) 

0.188092 

3.1007 

1  2284 

0  136977 

0  880219 

6  3909 

0.847351 

1  0405(+l) 

0.833209 

1.5366(+1) 

0.835353 

2.5611 

1.000000 

4  2062 

1.000000 

6.2406 

1.000000 

4  9392(-l) 

1  000000 

1  0431 

1.000000 

I  7351 

1  000000 

2,5893 

1.000000 

9  305I(-2) 

1  000000 

2.638l(-l) 

1.000000 

4.7134(-1) 

1.000000 

7.4105(-1) 

1.000000 

4.5938{-2) 

1.000000 

1.1902(-1) 

1.000000 

1.90I6(-1) 

1.000000 

2.9334(-l) 

1.000000 

2.1487(-2) 

1  000000 

4  9993(-2) 

1.000000 

7  4118(-2) 

1000000 

I  1187(-1) 

1.000000 

p  1  1280(  +  I) 

0  009365 

2  5176(+1) 

0.008491 

3  9999(  +  l) 

0  008421 

6  1636(+1) 

0.008244 

2  5963 

0  059879 

5.8681 

0.058808 

9  3406 

0  059974 

1  4444(+l) 

0.060288 

7  4764(-l) 

0  226884 

1.7584 

0.238068 

2.8358 

0.252929 

4.4412 

0.258479 

2.4245(-l) 

0  807339 

5.9700(-1) 

0.793234 

9.9115(-1) 

0.776731 

1  5716 

0.770226 

9.0648(-2) 

1.000000 

2.2804(-l) 

1.000000 

3.760C(-1) 

1  000000 

5  9804(-I) 

1  000000 

3.7362(-2) 

1.000000 

9.2477(-2) 

1  000000 

1.4581(-1) 

1.000000 

2.3019(-I) 

1.000000 

1.6037(-2) 

1.000000 

3  7652(-2) 

1. 000000 

5.6094(-2) 

1.000000 

8.6596(-2) 

1  000000 

*  Dashed  lines  separate  the  contracted  functions  A(±n)  means  A  X  10*". 


with  one  duplicated  primitive  give  consistently  lower  energies  than  the  HD,  VD, 
and  bO  sets.  The  Bl  energies  are  higher  than  the  uncontracted  ones  by  amount" 
ranging  between  0.02  and  0.47  mhartrees  The  b2  energies  are  lower  still.  We  think 
the  B 1  sets  are  a  good  compromise  and  choose  them  to  be  our  double  zeta  sets. 

Consider  the  “quadruple”  zeta  sets  next.  Table  II  compares  atomic  RHF  energies 
predicted  by  the  cO  and  Cl  basis  sets,  which  are  our  [7s4p]  contractions  of  the 
Koga-Thakkar  sets  [  1 3]  allowing  0  and  1  primitive  GTE  to  occur  in  two  different 
CGTFs,  with  the  uncontracted  (12s7p)  set  [13]  and  the  numerical  Hartree-Fock 
limits  [2 1  ].  The  cl  energies  are  within  1  ^thartree  of  the  uncontracted  ones,  whereas 
the  cO  energies  can  be  as  much  as  0. 1 1  mhartrees  above  the  uncontracted  ones. 
The  c  1  sets  are  our  chosen  quadruple  zeta  sets. 
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Table  VIII.  Exponents  (<»)  and  contraction  coefficients  (c)  of  the  (I2s7p)/[7s4p]  CGTF  basis  sets 

for  N.  O,  F.  and  Ne  “ 


N 

0 

F 

Nc 

« 

t 

rt 

c 

a 

c 

a 

c 

s  4.8335(+4) 

0.001342 

6.36551+4) 

0.000659 

8.09551+4) 

0.001338 

1.00241+5) 

0.000652 

7.242 1(+3) 

O.OI04I9 

9.53761+3) 

0.005115 

1.21301+4) 

0.010395 

1.50201+4) 

0  005062 

1.6482(+3) 

0.054507 

2.17071+3) 

0  026762 

2.76061+3) 

0.054388 

3.41841+3) 

0.026486 

4.6676(+2) 

0.226001 

6.14721+2) 

O.1 11012 

7.81771+2) 

0.225664 

9.68061+2) 

0.109907 

I.5220(+2) 

0.778370 

2.00451+2) 

0.382771 

2.54931+2) 

0  778747 

3.15691+2) 

0.379532 

5.4828(+I) 

0.370936 

7.22301+1) 

0.572099 

9.18801+1) 

0.372275 

1.13791+2) 

0  576290 

2.1209(+1) 

0.665560 

7.22301+1) 

0.183965 

3.56121+1) 

0.664141 

1.13791+2) 

0  181356 

2.1209(+1) 

0  106526 

2.79721+1) 

0.838955 

3.56121+1) 

0.111996 

4.41341  +  1) 

0  841240 

8.6404 

0  906538 

1.14231+1) 

1.000000 

1.45681+1) 

0.901547 

1.80791  +  1) 

1.000000 

3.5990 

1.000000 

4.7730 

1.000000 

6.1013 

1.000000 

7.5847 

1.000000 

1.0622 

1.000000 

1.4400 

1.000000 

1.8691 

1. 000000 

2.3498 

1.000000 

4.1529(-l) 

1.000000 

5.59841-1) 

1.000000 

7.23101-1) 

1.000000 

9.05311-1) 

1.000000 

I.5609(-l) 

1.000000 

2.08071-1) 

1.000000 

2.66491-1) 

1  000000 

3.31481-1) 

1  000000 

p  8.7205(+l) 

0.008156 

1.11541+2) 

0.008325 

1.40991+2) 

0  008401 

1.74991+2) 

0  008433 

2.0478(+I) 

0  060642 

2.62061+1) 

0.062300 

3315l(  +  l) 

0.063288 

4.11801  +  1) 

0.063903 

6.3463 

0.262389 

8.1556 

0.269698 

1.03561+1) 

0.274073 

1.29051+1) 

0.276881 

2.2624 

0.765702 

2.9196 

0.757895 

37182 

0  753122 

4.b434 

0.750009 

8.6I9I(-1) 

1.000000 

1.1047 

1.000000 

1.4022 

1.000000 

1.7483 

1.000000 

3.2978(-l) 

1. 000000 

4.13681-1) 

1.000000 

5.19321-1) 

1.000000 

6.43471-1) 

1.000000 

I.2205(-I) 

1.000000 

1.48101-1) 

1.000000 

1.82251-1) 

1.000000 

2.22941-1) 

1. 000000 

•  Dashed  lines  separate  the  contracted  functions.  /i(±n)  means  /I  X  10'" 


Molecular  Tests 

The  double  and  quadruple  zeta  basis  sets  for  H  and  He  are  listed  in  Tables  111 
and  IV.  The  double  zeta  (DZ)  sets  for  the  first  row  atoms  are  given  in  Tables  V  and 
VI  whereas  the  quadruple  zeta  (QZ)  sets  are  in  Tables  VII  and  VIII.  Note  that  the 
contraction  pattern  is  not  uniform  across  the  first  row.  Following  Dunning  [9],  we 
recommend  that  in  molecular  calculations  the  double  zeta  set  for  H  be  scaled  by 
factor  of  1.2,  that  is,  the  exponents  for  H  in  Table  III  be  multiplied  by  (1.2)*  = 
1 .44.  The  quadruple  zeta  set  for  H  is  sufficiently  flexible  that  we  recommend  it  not 
be  scaled. 
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Tabi  e  IX.  Exponents  of  polanzation  functions;  p-CJTP  for  H. 
sphencal  harmonic  (tivc-membered)  d-GTF  for  first-row  atoms 


Pa 

Pb 

Pc 

2P 

H 

1  0 

1.0 

0.727 

(1.05.0  35) 

Li 

0.18 

0.18 

Be 

0.32 

0.32 

B 

0.70 

0.388 

0.343 

(0.187.  0.822) 

C 

0  75 

0.600 

0.550 

(0.288,  1.335) 

N 

0  80 

0.864 

0.817 

(0.412.  1.986) 

0 

0.85 

1.154 

I.I8S 

(0.535.  2.704) 

F 

0.90 

1.496 

1.640 

(0  682.  3  559) 

Ne 

1.888 

2.202 

(0.852.  4  550) 

Polarization  functions  are  needed  in  molecular  calculations.  Dunning  and  Hay 

[1]  recommended  exponents  for  a  p-GTF  in  H.  and  d-GTF  in  B  to  F.  and  d-GTF 
exponents  for  Li  and  Be  can  be  obtained  from  a  formula  of  Ahlrichs  and  Taylor 

[2] .  This  set  of  polarization  functions  is  listed  as  Pa  in  Table  IX.  A  different  set  of 
d-GTF  for  B  to  Ne  [3],  and  the  same  polarization  functions  for  H,  Li,  and  Be  are 
listed  in  Table  IX  as  Pb.  Dunning's  [22]  correlation  consistent  polarization  functions, 
listed  as  Pc.  are  quite  close  to  the  Pb  set.  Any  one  of  Pa,  Ph.  and  Pc  could  be  used 
with  the  DZ  sets.  The  QZ  sets  can  be  supplemented  by  a  pair  of  p-GTF  for  H,  and 
two  sets  of  d-GTF  for  B  to  Ne  [3]  as  listed  in  Table  IX  under  2P.  or  by  the  (2d  If) 
or  (3d2flg)  correlation  consistent  sets  of  Dunning  [22]. 

As  a  test,  self-consistent-field  (SCF)  geometry  optimizations  were  performed  with 
Gaussian  90  [23]  for  seven  molecules:  LiH.  HF,  BeH^  in  D_.j,  symmetry,  CH4  in 
T,/ symmetry,  NH3  in  C3,  symmetry,  H2O  in  Ci,  symmetry,  and  BiHf,  in  D21,  sym¬ 
metry.  The  basis  sets  used  are  the  Huzmaga-Dunning  (Hd)  [9,10]  double  zeta  sets 
and  our  DZ  and  QZ  sets,  with  and  without  polarization  functions.  There  are  no  HD 
double  zeta  sets  for  Li  and  Be,  and  the  Dunning-Hay  (9s5p)/[3s2p]  split  valence 
sets  [1]  were  used  instead. 

In  all  cases,  the  molecular  SCF  energies  (not  tabulated)  at  the  equilibrium  ge¬ 
ometry.  both  experimental  and  predicted,  obeyed  the  following  ordering;  HD  >  DZ  > 
QZ  >  HD  +  Pa  >  DZ-\-  Pa  >  QZ  +  Ph>  Q2.  +  2P.  Table  X  compares  the  resulting 
geometries  and  dipole  moments  with  experiment  [24-29],  Our  DZ  sets  give  essen¬ 
tially  the  same  results  as  the  standard  HD  sets,  although  in  most  cases  the  DZ  results 
are  slightly  closer  to  the  largest  basis  results  than  the  HD  ones.  As  expected,  polar¬ 
ization  functions  affect  dipole  moments  and  bond  angles  much  more  than  bond 
lengths.  In  most  cases,  the  QZ  +  2P  basis  set  predicts  dipole  moments  noticeably 
closer  to  the  experimental  ones  than  either  the  hd  +  Pa  or  DZ  -t-  Pa  sets. 

Finally,  as  nontrivial  tests  of  these  basis  sets,  we  consider  the  equilibrium  ge¬ 
ometries  of  hydrogen  peroxide  (HiOi)  and  hydrazine  (N2H4)  assuming  C2  symmetry. 
Both  require  the  determination  of  dihedral  angles  that  are  very  sensitive  to  details 
of  the  basis  set,  as  Table  XI  shows  clearly.  Geometry  optimizations  were  also  earned 
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Table  X.  SCE  equilibrium  geometries  and  dipole  moments  n  for  HF,  LiH,  CH4,  BeH2,  H2O,  NHj, 

and  B2H6.* 


HF 

LiH 

CH, 

'■(CH) 

BeHj 

r(BeH) 

r(HF) 

m(HF) 

r(LiH) 

MLiH) 

HD 

09196 

2183 

1.6118 

5.998 

1.0834 

1.3359 

DZ 

09218 

2191 

1.63.12 

6.034 

1.0829 

1.3366 

oz 

0.9174 

2171 

1  6061 

6.035 

1.0810 

1.3306 

HD  +  Pa 

0.90.12 

2.027 

1.6236 

5.949 

1  0846 

1.3357 

DZ  E  Pa 

0  90.17 

2.010 

1.6265 

5.984 

1.0844 

1.3358 

oz  +  Ph 

0.8985 

2  068 

1.6061 

6.011 

1.0831 

1.3308 

QZ  +  2p 

0.9002 

1.927 

— 

— 

1.0819 

— 

E\pt 

0.9168 

1  801 

1.5957 

5.829 

1.085 

— 

H2O 

NH3 

r(OH) 

ZHOH 

r(NH) 

Z.HNH 

MNH3) 

HD 

0  9514 

11.2.51 

2.5.10 

0.9944 

116  28 

1..1p8 

DZ 

0.9519 

112.49 

2.526 

116.26 

1  3ti7 

OZ 

0,9502 

112.10 

2.553 

0.9935 

115.15 

1  429 

HD  +  Pa 

0.9419 

106  71 

1177 

I.OOII 

108  18 

1.809 

uZ  +  Pa 

0.9418 

106.72 

2.174 

1.0008 

108  0^5 

1,803 

oz  +  Ph 

0.9410 

106  91 

2  246 

0.9987 

108  PS 

1.743 

02  +  2p 

0.9414 

106.18 

2.010 

0.9990 

10‘  SS 

1.612 

Expt 

0.9575 

104.51 

1.847 

1.0124 

K'p  P* 

1.47 

B.H, 

r(BB) 

r(BH,) 

r(BH*' 

Z.H,BH, 

HD 

1.8301 

1.1826 

I..1402 

123.00 

DZ 

1.8209 

1.1831 

1.3358 

122  62 

02 

1.8001 

1.1802 

1.32:3 

122.10 

HD  +  Pa 

•'.7944 

1.1865 

I..12S‘t 

122.48 

DZ  +  Pa 

1.7901 

1.1866 

122.28 

OZ  +  Ph 

I.- 309 

1.1839 

1 1:09 

OZ  +  2P 

l.7'71 

1.1833 

1.3  ISO 

121.99 

Expt 

1.743 

1.184 

1 3:4 

121.5 

•  Bond  lengths  (r)  are  in  A,  bond  angles  in  degrees,  and  dipole  moments  in  debyes  The  bndging  and 
terminal  hydrogens  in  B2H(,  are  denoted  H,,and  H,.  respectively  Evperimcntal  data  are  from  Refs.  [24- 
29). 


out  using  fourth-order  many-body  perturbation  theory  with  triple  substitutions 
neglected,  that  is,  SDQ-MP4  [301. 

First,  consider  hydrogen  peroxide.  Inclusion  of  polarization  functions  reduces 
the  dihedral  angle  by  30-45°!  A  previous  complete  MP4/HD  +  Pa  calculation 
[31]  gave  r(00)  =1.471  A,  r{OH)  =  0.970  A,  /LHOO  =-  99.3°,  and  ,3  =  1 16.3°  in 
reasonable  agreement  with  our  SDQ  -  mf4/dz  +  Pa  results.  An  MP2/6-3 1  lG(3d,2p) 
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Table  XI.  scf  and  sdq  -  mp4  geometnes  for  H2O2  and  N2H<.* 


H2O2 

r(00) 

r(OH) 

Z.HOO 

SCF/HD 

1.4438 

0.9564 

102.77 

144.3 

SCF/DZ 

1.4461 

0.9571 

102.72 

142.9 

SCF/QZ 

1.4396 

0.9542 

102.06 

163.1 

SCF/HD  +  Pa 

1.3922 

0.9467 

102.73 

112.5 

SCF/DZ  +  Pa 

1.3931 

0.9466 

102.69 

112.1 

SCF/QZ  +  Pb 

1.3888 

0.9431 

102.74 

118.2 

SCF/QZ  +  2P 

1.3966 

0.9431 

102.68 

111.2 

SDQ  -  MP4/DZ  + 

Pa 

1.4615 

0.9688 

99.74 

114.2 

SDQ  -  MP4/QZ  + 

2P 

1.4652 

0.9630 

99.77 

113.8 

Expt [33] 

1.4645 

0.965 

99.4 

111.8 

N2H4 

/■(NN) 

/■(NH.) 

r(NHb) 

Z.H.NHb 

AH.NN  zHbNN 

«H.NNH,) 

SCF/HD 

1.4064 

0.9952 

0.9984 

115.13 

113.01  116.67 

92.1 

SCF/DZ 

1.4044 

0.9947 

0.9978 

115.14 

113.08  116.72 

92.0 

SCF/QZ 

1.4099 

0.9933 

0.9967 

114.15 

112.07  115.77 

92.1 

SCF/HD  +  Pa 

1.4117 

0.9989 

1.0015 

108.94 

108.53  112.67 

89.8 

SCF/DZ  +  Pa 

1.4107 

0.9986 

1.0013 

108.87 

108.48  112.64 

89.7 

SCF/QZ  +  Pb 

1.4111 

0.9963 

0.9993 

109.25 

108.71  112.73 

90.2 

SCF/QZ  +  IP 

1.4166 

0.9957 

0.9984 

108.67 

108.24  112.17 

90.1 

SDQ  -  MP4/DZ  +  Pa 

1.4419 

1.0152 

1.0185 

106.99 

106.51  111.37 

89.6 

SDQ  -  MP4/QZ  +  2P 

1.4476 

1.0091 

1.0119 

107.44 

106.58  111.05 

90.9 

Expt  [35] 

1.449 

1.021 

1.021 

(106.6) 

106  112 

91 

*  Bond  lengths  are  in  A  and  angles  in  degrees.  B  is  a  dihedral  angle.  The  hydrogens  on  each  N  in 
hydrazine  are  nonequivalent  and  arc  denoted  by  H,  and  Ht,,  respectively. 


calculation  [32]  led  to  r(00)  =  1.450  A,  r(OH)  =  0.962  A,  2-HOO  =  99.5°,  and 
j3  =  111.1°.  Good  agreement  with  an  experimental  geometry  of  Koput  [33]  is  ob¬ 
tained  by  our  sdq  -  mp4/qz  +  2P  calculation:  The  bond  distances  agree  within 
0.002  A,  the  Z.HOO  to  0.4°,  and  the  dihedral  angle  to  2°. 

Next  consider  hydrazine.  All  calculations  show  that  the  two  hydrogens,  labelled 
Ha  and  Hb,  on  each  nitrogen  are  nonequivalent.  A  cisd(Q),  configuration  interaction 
with  all  single  and  double  substitutions  approximately  corrected  for  unlinked  clus¬ 
ters,  calculation  [34]  in  a  6  -  3lG*  basis  led  to  r(NN)  =  1.448  A,  r{NHa)  =  1.018 
A,  r(NHb)  =1.021  A,  zLHaNHb  =  106.64°,  AHaNN  =  105.94°,  2.HbNN  =  1 10.82°, 
and  /3(HaNNHb)  =  90.05°.  An  experimental  equilibrium  structure  is  not  known 
but  a  thermally  averaged  Vg  structure  has  been  determined  by  Kohata  et  al.  [35]. 
They  were  unable  to  match  the  nonequivalent  NH  bonds  with  the  corresponding 
angles  and  therefore  reported  an  average  r(NH).  Moreover,  they  fixed  2.HNH  at 
106.6°  from  the  asD{Q)  calculation  [34].  Our  sdq  -  mp4/qz  +  IP  calculation 
may  be  the  best  equilibrium  geometry  available  to  date. 
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We  hope  the  basis  sets  presented  in  this  paper  will  be  generally  useful  to  the 
quantum  chemistry  community. 
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Abstract 

The  kinetic  energy  component  is  shown  to  be  a  well-defined  quantity  in  the  divide-and-conquer  method. 
This  kinetic  energy  converges  to  the  conventional  Kohn-Sham  kinetic  energy  when  the  numbers  of  the 
“bufier  atoms”  for  subsystems  increase.  Numencal  results  from  s.tmple  calculations  confirm  that  the 
kinetic  energy  component  converges  to  the  Kohn-Sham  value  as  the  basis  set  increases,  fe  1993  John 
Wiley  &  Sons.  Inc 


Introduction 

Accurate  computation  of  the  kinetic  energy  is  essential  to  ah  initio  calculations. 
In  the  density-functional  formalism  the  common  practice  is  to  represent  the  kii.etic 
energy  in  terms  of  the  Kohn-Sham  orbitals.  This  representation  recovers  most  of 
the  kinetic  energy.  The  residue  of  the  kinetic  energy,  conventionally  denoted  by 
is  included  in  the  exchange-correlation  component  £„  of  the  Kohn-Sham 
energy.  The  result  is  in  principle  exact  [1-4].  The  introduction  of  the  auxiliary 
Kohn-Sham  orbitals  increases  the  size  dependence  of  the  density-functional  com¬ 
putations  while  alleviating  the  problem  of  the  unavailability  of  the  explicit  form  of 
the  kinetic  energy  density  functional.  The  Kohn-Sham  kinetic  energy  is  a  functional 
of  the  density,  as  is  the  exchange-correlation  energy  and  so  the  total  energy.  All 
ground-state  properties  can  be  calculated  accurately  as  long  as  the  density  is  com¬ 
puted  accurately. 

In  the  conventional  Kohn-Sham  scheme  the  global  Kohn-Sham  orbitals  are 
expanded  in  terms  of  a  proper  basis  set.  The  coefficients  of  the  expansion  are  obtained 
via  a  Rayleigh-Ritz  vanational  principle.  The  divide-and-conquer  method  solves 
the  Kohn-Sham  equation  differently  [5].  It  uses  the  density  as  its  basic  variational 
variable.  No  global  molecular  orbitals  are  involved.  This  reduces  the  size  dependence 
of  the  computation  from  N  ^  for  the  conventional  Kohn-Sham  method  to  N  ( or 
at  most  N^),  where  N  is  the  number  of  electrons  in  the  system.  The  kinetic  energy 
was  not  calculated  separately.  Instead  it  was  computed  together  with  a  potential 
term  as  a  whole.  Since  the  global  Kohn-Sham  orbitals  are  not  used  in  the  divide- 
and-conquer  method  the  kinetic  energy  has  to  be  redefined.  The  main  objective  of 
this  article  is  to  define  the  kinetic  eneigy  within  the  divide-and-conquer  formulation 
starting  from  a  recent  ah  inilio  construction  of  the  div'de-and-conquer  method 
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[6],  and  to  show  that  this  definition  conforms  to  the  conventional  Kohn-Sham 
definition. 


Theory 

The  Kohn-Sham  equation  is  the  operational  equation  in  density-functional 
computations.  For  an  electronic  system  it  reads 


with 


-  +  vCr) U7)  +  v,A7) 


i'i  =  ci\pi 


(1) 


p(7)  =  Z  «/l0/l^ 

/ 

(2) 

(3) 

II 

(4) 

vCf)  is  the  external  potential  acting  on  the  electronic  system;  and  ci  arc  the 
Kohn-Sham  orbitals  and  eigenvalues,  respectively.  The  total  energy  is 

/;-a  +  0[p)+  2^'  (5) 


where 


^  =  Z  n,c, 

I 


(6) 


Olp] 


j^r, 


I 


0,(7)-  u.,. (7) 


t/7  +  lCxi[f>] 


(7) 


The  last  term  in  Eq.  (S)  is  the  nuclear  interaction  energy. 

The  self-consistent  solution  of  Eqs.  (1)  to  (4)  involves  the  expansion  of  the 
0/(7)  as  a  linear  combination  of  basis  functions,  usually  atomic  orbitals,  or  other 
functions  such  as  Slater-type  or  Gaussian-type  functions  centered  on  atoms.  There 
exists  incfTicicncy  here.  When  the  density  or  total  energy  is  computed  not  every 
term  in  the  linear-combination-of-basis-function  summation  of  the  Kohn-Sham 
orbitals  contributes  because  of  the  exponential  decay  of  the  basis  functions.  Suppose 
that  one  uses  a  basis  set  with  functions  centered  at  one  point  one  would  expect 
very  bad  result  unless  very  big  basis  set  is  used.  One-center  functions  represent  the 
region  near  the  center  well,  and  the  region  far  away  from  the  center  poorly.  One 
way  to  improve  this  bad  rcprcscnlalion  is  of  course  to  use  the  basis  set  mentioned 
above,  that  is  the  one  with  functions  from  all  atoms  in  the  molecule.  Another  way 
to  achieve  this  improvement  is  to  divide  a  large  system  into  several  regions  such 
that  each  region  has  its  own  basis  set  with  functions  centered  only  within  or  near 
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the  region.  And  a  Kohn-Sham  equation  is  solved  for  each  region.  This  way  each 
region  has  been  represented  reasonably  accurately  by  a  set  of  equations  and  its 
associated  basis  set.  The  global  solution  is  obtained  by  assembling  solutions  from 
all  subsystems.  Weighting  functions  can  be  used  for  this  purpose.  The  weighting 
functions  are  designed  in  such  a  way  that  they  are  big  within  the  regions  they  are 
associated  with  and  small  away  from  the  regions.  This  is  exactly  the  philosophy 
behind  the  recently  developed  divide-and-conquer  method  [5,6].  The  advantage 
of  this  divide-and-conquer  approach  is  that  the  auxiliary  Kohn-Sham  orbitals  can 
be  completely  eliminated  and  hence  the  size  dependence  of  the  computations  is 
reduced,  without  any  toss  of  accuracy. 

Consider  a  molecular  system.  Each  subsystem  a  can  be  thought  of  as  consisting 
of  several  atoms.  Let  [p"(7)}  be  the  weighting  functions  mentioned  above,  pre¬ 
viously  called  partition  functions.  The  {;?"( r)}  satisfies  the  relations:  2„  /?"(7)  = 

1  and  p“(7)  ^  0.  Let  {<^7(7)}  be  the  basis  set  associated  with  the  subsystem  a. 
The  Kohn-Sham  equation  ( 1 )  can  be  solved  using  this  basis  set.  Denote  the  solution 
as  { ^f„(7),  } .  The  density  generated  is 

P"(7)=  2n,J^“(7)|2  (8) 

For  closed  shell  =  2.  The  global  density  can  be  composed  as  following 

P(7)  =  2  P"(7)p“(7)  =22  2fg(ey  -  cl)p"(7)\\Pl{7)\^  (9) 

a  <» 

where =  1/[1  +  exp(-/3x)]  is  the  fermi  function.  The  substitution  of  = 

2  with  a  fermi  function  comes  from  a  simple  technical  consideration  which  will 
manifest  itself  in  the  following  normalization  relation: 

N=  fp(7)^/7  =  2  2  2/e(eF-<)<KlF"(7)IK>  ( 10) 

«  I. 

cp  is  the  parameter  to  be  used  to  normalize  the  electron  density  as  given  by  the 
above  equation.  The  fermi  function  makes  this  normalization  process  possible, 
x)  approaches  a  step  function  as  0  becomes  bigger. 

To  compose  5  a  local  form  of  it  has  to  be  used 

<?“(7)=  2  (11) 

where  //  is  the  Kohn-Sham  Hamiltonian.  Another  form  of  this  local  quantity  is 
<S“(7)  =  2  ( 1  la) 

Subtle  differences  between  these  two  forms  exist.  They  are  the  same  if  the  basis  set 
used  is  complete.  The  global  <?  is 

=  2  Jp"(7)<?"(7)rf7  =  2  2  lM^>--Oc?XK\P‘'rr)\K}  (12) 
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or 

«?  =  2  2  f  P“(7)^,t"(7Wr.(7)  dr  ( 12a) 

a  Ig, 


Again  n,^  has  been  replaced  by  the  fermi  function.  Equation  ( 12)  is  the  formula 
used  in  the  previous  publications  [5-9]  and  will  be  used  here. 

Suppose  that  the  limit  -*■  oo  is  taken.  The  divide-and-conquer  formulation  as 
described  by  Eqs.  ( 8 )  to  ( 1 2 )  reduces  to  the  conventional  Kohn-Sham  formulation 
if  the  basis  set  { ( r ) }  covers  the  same  function  space  for  all  a.  The  divide-and- 
conquer  method  results  from  the  discriminate  truncations  of  complete  basis  sets 
in  different  subsystems  [6]. 

The  self-consistent  solution  of  the  divide-and-conquer  formulas  ( 1 ),( 3 )  through 
(5),  (9),  (10),  (12)  does  not  require  the  construction  of  the  global  Kohn-Sham 
orbitals.  Complete  Hamiltonian  and  overlap  matrices  are  noi  needed  here. 

A  local  kinetic  energy  can  be  defined  similar  to  the  definition  of  <?“(  r)  in  Eq. 

(11) 

f"(7)  =  2  (13) 

*a/a 

The  kinetic  energy  in  the  div'de-and-conquer  method  is 


xr)T‘‘{y)d7 


=  2  2  i:McF-erj(K\T\KXK\p'‘(^r)\n) 


An  definition  of  r“(  r )  analogous  to  the  definition  of  <?  "( r)  in  Eq.  (11a)  will  result 
in  the  total  kinetic  energy  definition 


7'  =  2  2  2/e(ef-0  f  dr  (14a) 

a  la 


These  two  definition  of  the  kinetic  energy  become  the  same  if  a  complete  basis  set 
is  used.  The  kinetic  energy  from  both  Eq.  ( 14)  and  ( 14a)  becomes  the  conventional 
Kohn-Sham  kinetic  energy  when  P  -*  ao  and  {<t>?(.r)}  covers  the  same  function 
space  as  the  Kohn-Sham  basis  set  does  for  all  a. 

Equation  (14)  is  used  in  the  following  computations  of  kinetic  energy  because 
of  its  numerical  simplicity.  Equation  ( 14)  is  compatible  with  Eq.  (12)  not  with 
Eq.(12a). 


Results  and  Discussion 

In  the  present  calculations  the  Xa  approximation  with  a  =  0.7  for  the  exchange- 
correlation  energy  functional  ExAp]  has  been  used.  The  smiidard  techniques  for 
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the  multicenter  three-dimensional  numerical  integrations  are  taken  from  the  lit¬ 
erature  [7,10].  The  partition  function  {/?“(  r)}  is  built  from  the  spherical  atomic 
densities:  p"(  r)  =  [po(17  -  ihag  [po(.\'r  -  75/,!)]^  The  basis  set 

consists  of  the  projected  and  optimized  atomic  orbitals  [7],  For  the  divide-and- 
conquer  calculation  two  ways  are  used  to  enlarge  the  subsystem  basis  sets'  to  add 
more  atomic  orbitals  from  atoms  within  the  subsystems  and  to  include  atomic 
orbitals  contributed  by  the  neighboring  atoms,  the  so-caded  “buffer  atoms”  [7], 
The  density-functional  programs  developed  in  Yang's  Laboratory,  modified  as  ap¬ 
propriate.  is  used  [5,7]. 

Sample  calculations  are  carried  out  for  benzene  and  a  tetrapeptide.  The  benzene 
geometry  used  is  D(,h  symmetry  with  dec  =  1.397  A  and  den  =  1.084  A.  The 
tetrapeptide  geometry  is  taken  from  Ref.  [7].  In  the  divide-and-conquer  calculation 
benzene  is  divided  into  six  subsystems,  each  consisting  of  a  CH  piece.  The  tetra¬ 
peptide  is  divided  into  13  pieces,  each  consisting  of  a  heavy  backbone  atom  and 
the  side  atoms  attached  to  it.  Numerical  results  are  reported  in  Table  1  and 
Table  II. 


Table  I.  Total  energy  and  kinetic  energy  component  from  both  the  Kohn- 
Sham  method  and  tne  divide-and-conquer  method  for  benzene,  nm.  .Ini’n'.  . 
stands  for  the  numcncal  atomic  basis  set  used-  m  »-typc  and  n  p-type  atomic 
orbitals  for  C  atom  and  m'  s-type  and  ;i'  p-iype  atomic  orbitals  for  H  atom  hn 
stands  for  that  mb  nearest  neighbors  have  been  used  as  “buffer  atoms  "  Atomic 
unit  IS  used. 


Basis 

C7H 

- 

E 

T 

o 

o 

II 

o 

II 

o 

o 

H  =  150 

21/10 

bO 

230080 

230.068 

224.303 

224.432 

bl 

228.268 

228.281 

">59 

225  253 

b2 

228.228 

228.256 

225.618 

225.627 

b3 

228  3.36 

228.3.36 

225.217 

225.217 

KS 

228  338 

225. 

221 

32/21 

bO 

229  067 

22^  121 

2.31  982 

2.32.0.30 

bl 

228  972 

228.976 

229  010 

229.027 

b2 

228.838 

228.840 

229  2.30 

229.254 

b3 

228.930 

228.930 

229.154 

229.154 

KS 

228.930 

229  21  1 

43l/3.'> 

bO 

229.5^8 

230.451 

2.30  5.3.3 

231  5,50 

bl 

229  026 

229.028 

228.328 

228.310 

b2 

228.911 

228.911 

229.106 

229.109 

b3 

228.994 

228.994 

229.041 

229.04.3 

KS 

228.994 

229  097 
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Table  H.  Total  energy  and  kinetic  energy  for  tetrapeptide.  0  = 
I SG.  And  atomic  unit  is  used,  fiuim  is  from  Harris-type  calculation 
and  i'scr  is  the  self-consistent  result.  See  Table  I  for  the  notations 
of  basis  set. 


Basis 

NCO/H 

~i’scF 

T 

21/10 

bO 

900.162 

899.484 

888.420 

bl 

896.205 

895.620 

894  240 

b2 

896.627 

896.105 

894.121 

b3 

896.656 

896.128 

893.773 

b4 

890.658 

896.129 

893.791 

b5 

896.658 

896.131 

893.759 

KS 

896.658 

896.129 

893.772 

Given  in  Table  I  are  the  total  electronic  energy  and  the  kinetic  energy  component 
from  both  the  divide-and-conquer  method  and  the  conventional  Kohn-Sham 
method  for  benzene.  The  results  show  that  both  the  total  energy  and  the  kinetic 
energy  component  converge  to  the  Kohn-Sham  (KS)  results  when  more  buffer 
atoms  are  included.  Similar  lesults  were  reported  for  the  total  energy  by  Yang  et 
al.  ( 5,7-9  ] .  Particularly  the  divide-and-conquer  method  gives  the  right  kinetic  energy 
component.  This  is  true  for  different  level  of  basis  sets.  The  same  conclusion  is 
reached  from  calculations  for  the  tetrapeptide  as  shown  in  Table  II.  To  see  the  error 
introduced  by  the  fermi  function  substitution  two  different  0  values  have  been  used 
for  the  benzene  calculations.  As  can  be  seen  from  Table  1  that  the  value  plays  a 
very  minor  role  here.  This  means  that  the  introduction  of  the  fermi  function  brings 
in  a  negligible  error  as  far  as  is  sufficiently  large  (see  also  ref  [5]).  Also  given  in 
Table  II  is  the  energy  from  Harris  type  calculations  [11],  The  same  convergence 
pattern  is  found  in  Enam5  as  in  Escf-  This  may  be  very  important  in  large  molecule 
calculation  since  the  Harris  type  calculation  is  much  easier  to  carry  out. 

The  conclusion  is  that  the  divide-and-conquer  method  can  give  the  correct  energy 
components  as  well  as  the  total  energy  though  their  convergence  may  be  slower 
than  that  of  the  total  energy. 
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Abstract 

An  approximate  numencal  method  of  solving  the  Generalized  Master  Equation  for  a  many-body 
problem  is  presented,  with  examples  of  its  application.  This  method  involves  the  construction  from  the 
full  Hamiltonian  (of  the  system  plus  the  “bath”)  of  a  set  of  unitary  Langevin  equations  that  combine 
deterministic  microcanonical,  stochastic  canonical  (heat  bath),  and  stochastic  nonthermal  dynamics  in 
a  single  time-integration  scheme.  If  implefnented  in  a  representation  that  captures  the  essential  physics 
and  repeatedly  run  from  a  given  initial  condition,  this  method  evaluates  stochastic  representatives  from 
the  actual  fiber  bundle  of  system  worldlines  that  flow  from  the  initial  condition  and,  hence,  iiumerically 
evaluates  the  path  integral.  ©  1993  John  Wiley  &  Sons,  Inc, 

1.  Introduction 

In  a  many-body  problem,  selecting  the  “right”  generalized  (functional)  coordi¬ 
nates  for  the  description  of  the  system  is  very  important.  A  good  choice  can  make 
a  problem  relatively  straightforward,  can  teach  us  about  the  key  physics  of  the 
problem,  and  can  even  make  our  approximations  and  omissions  less  important.  A 
bad  choice  can  confuse  the  physics  of  the  problem  or  lead  to  inconsistent  or  un¬ 
physical  results  even  while  it  appears  to  be  working  perfectly.  More  often  than  not, 
the  choice  of  representation  is  connected  to  the  method  of  solution  so  that  one 
more  or  less  determines  the  other,  although  there  are  some  representations  that 
admit  more  than  one  method  of  solution. 

Unfortunately,  there  is  no  prescription  for  selecting  the  best  functional  represen¬ 
tation  tor  a  problem — it  remains  more  art  than  science.  In  addition,  there  are  well- 
known  approaches  based  on  representations  that  poorly  describe  the  problem  but 
that  compensate  for  this  by  being  easy  to  evaluate  and  integrate.  These  methods 
can  be  counted  on  to  at  least  yield  an  answer  at  the  end  of  a  calculation  (although 
there  is  no  guarantee  that  the  answer  is  even  approximately  correct).  Given  the 
grant-funded  ecology  of  scientific  research,  there  is  thus  a  temptation  and  a  tendency 
to  repeatedly  study  a  given  problem  to  ever-higher  order  with  a  poorly  convergent 
representation  rather  than  invest  the  time  (and  take  the  risk)  of  studying  the  problem 
carefully  and  selecting  a  representation  that  may  (or  may  not)  be  numerically  or 
analytically  difficult  but  that  embraces  the  actual  physics  well  and  converges  in 
application  rapidly. 
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Some  problems,  of  course,  do  yield  to  a  heavy-handed  approach  at  modest  cost 
and  frequently  with  “standard”  codes  so  that  even  programming  time  is  minimized. 
For  these  problems,  art  has  become  science  (or  even  engineering)  and  we  do  not 
consider  them  further.  Other  problems,  however,  do  not,  and  these  “complex” 
problems  are  almost  by  definition  among  the  most  interesting  problems  available 
for  study.  Examples  of  complex  problems  are  studies  of  phase  transitions  and  critical 
phenomena  for  various  systems,  relaxation  phenomena,  reaction  dynamics,  and 
turbulence;  generally,  almost  any  system  that  exhibits  macroscopically  nonlinear 
behavior  ( static  or  dynamic,  in  open  or  closed  systems )  is  microscopically  complex, 
as  some  sort  of  fundamental  reorganization  of  the  microscopically  linear  system 
into  self-consistent  structures  is  associated  with  the  macroscopic  nonlinearity.  These 
structures  are  rarely  easily  understood  in  terms  of  simple,  linearized  models  of  their 
microscopic  components  or  modes. 

A  specific  example  of  the  importance  of  the  cor  ct  generalized  coordinates  is 
found  by  studying  the  role  of  vortices  or  defect  structures  in  general  in  Kosterlitz- 
Thouless  transitions  [1]  such  as  occur  in  the  2d /ID  (planar,  or  xy-model)  Hei¬ 
senberg  ferromagnet.  This  kind  of  phase  transition  is  characterized  by  the  unbinding 
of  defect  pairs  (spin  vortices  of  opposite  twist)  that  are  perturbatively  disconnected 
from  linearized  (spin  wave)  descriptions.  These  defects  are  not  just  simple  linear 
combinations  of  spin-wave  modes — they  have  an  identity,  a  structure,  and  an  ef¬ 
fective  interaction  all  their  own.  It  is  difficult,  if  not  impossible,  to  understand  the 
planar  ferromagnet  without  first  recognizing  that  these  defects  are  the  relevant 
structures  and  implementing  their  coordinatization  in  the  method  used. 

For  researchers  in  many  fields,  then,  it  is  useful  to  study  as  many  examples  of 
successful  and  unsuccessful  descriptions  of  complex  problems  as  possible,  so  that 
one’s  “bag  of  tricks”  may  contain  the  trick  that  one  needs  when  one  needs  it  and 
so  that  one  can  avoid  methods  unlikely  to  yield  a  correct  answer.  It  is  also  worthwhile 
to  understand  the  rationale  underlying  the  descriptions  that  do  work  so  that  we 
may  find  new  ones  and  even  eventually  reduce  the  art  of  discovering  such  a  de¬ 
scription  to  a  science  of  implementing  a  prescription. 

In  this  spirit,  therefore,  we  present  a  uniform  approach,  based  on  numerically 
solving  stochastic,  unitary,  integrodifferential  equations  that  has  worked  well  for 
us  in  certain  many-body  problems  in  the  past.  The  equations  solved  are  a  Langevin 
realization  of  the  Generalized  Master  Equation  ( GME )  *  or  path  integral  [  3  ]  for  the 
problem,  and  the  approach  itself  can  be  applied  to  a  wide  range  of  systems  and 
phenomena.  As  such,  it  combines  methods  of  microcanonical  “molecular  dynamics” 
(although  we  have  not  applied  it  to  molecular  problems)  with  methods  of  importance 
sampling  heat  bath  Monte  Carlo  based  on  Glauber  canonical  thermalization  “dy¬ 
namics”  [4.5]. 

To  solve  problems  in  quantum  optics  [6,7],  magnetism  [8],  and  even  lattice 
gauge  theory  [9],  we 


•  Foi  a  discussion  of  the  Generalized  Master  Equation  in  the  context  of  quantum  optics  and  Langevin 
equations,  see,  e.g..  [2]. 
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1 .  Select  a  system  description  that  is  simple  enough  that  it  can  be  worked  with 
numerically  and  complex  enough  that  it  can  describe  all  evaluated  moments 
and  quantities  accessible  by  the  actual  system.  The  system  description  can  be 
fully  quantum-correlated  ( entangled ),  but  if  it  is.  there  are  distinct  limits  on  the 
size  of  the  systems  that  can  be  studied.  In  magnetic  problems  involving  spin 
1/2.  e.g.,  a  fully  entangled  N  spin  description  requires  ~0(2'')  real  number 
coordinates  for  the  system  alone.  On  the  other  hand,  if  one  assumes  a  semi- 
classical  spin-spin  interaction,  the  spins  do  not  entangle  (so  that  the  system  can 
be  in  a  simple  product  state)  a  system  description  scales  like  ~0(A^). 

The  fully  entangled  description  is  quantum  mechanically  “exact”  but  only 
small  systems  can  be  studied,  so  that  many-body  effects,  especially  on  macro¬ 
scopic  scales,  may  be  represented  rather  poorly.  On  the  other  hand,  the  simpler 
description  can  be  scaled  to  rather  large  lattices  ( we  reach  N  =  2  '‘'-2  routinely, 
and  2 “-2^'  are  accessible  on  parallel  supercomputers)  and  thus  exhibit  a  full 
range  of  macroscopically  nonlinear  behaviors,  but  may  or  may  not  represent 
the  local  quantum  theory  very  accurately.  In  between,  there  are  various  ways 
of  constructing  clustered  or  coarse-grained  representations  with  a  limited  range 
of  entanglement  that  should  improve  the  quantum  description  but  still  allow 
statistically  meaningful  and  scale-extensible  macroscopic  behavior  to  be  studied, 
although  this  has  yet  to  be  proven  (by  us,  at  least). 

Experience  suggests  that  in  many  cases  the  semiclassical  or  classical  coordi- 
natization  captures  most  of  the  essential  quantum  physics  and  describes  exper¬ 
iment  rather  well  because  of  the  short  correlation  times  of  the  entangled  states 
in  systems  at  a  finite  temperature. 

2.  With  a  numerically  tractable  representation  in  hand,  we  then  solve  for  its  explicit 
coherent  time  evolution  by  deriving  and  inter/ating  ( numerically )  appropriate 
coupled  differential  equations  from  the  system  Hamiltonian. 

3.  Simultaneously  (while  the  system  undergoes  its  natural  conservative  and  coherent 
microscopic  motions)  simulate  its  incoherent  time  evolution  by  treating  its  in¬ 
coherent  bath  interaction  as  a  random  senes  of  delta-correlated  unitary  transition 
processes.  Each  transition  carries  the  system  from  one  coherent,  time-reversible 
system  worldline  to  another  (generally  quite  near  by ).  In  the  process,  information 
is  irreversibly  lost  (explicitly,  to  the  bath)  and  energy  may  or  may  not  be  con¬ 
served. 

The  essential  feature  of  our  approach  is  that  we  separate  out  the  coherent  and 
incoherent  components  of  the  motion  of  the  system  and  treat  the  intrinsically 
unitanty-preserving  coherent  time  evolution  “exactly”  (analytically  or  numer¬ 
ically)  and  force  the  incoherent  components  of  the  motion  to  be  unitary  tran¬ 
sitions  driven  by  stochastic  interactions  with  the  rest  of  the  universe  in  such  a 
way  that  the  damping/decay  rate  equations  are  satisfied  on  the  average.  The 
various  mechanisms  and  fields  that  contribute  directly  to  the  incoherent  time 
evolution  we  call  the  “bath,”  with  the  understanding  that  it  is  we  who  decide 
which  part  of  nature  makes  up  the  “system”  and  which  part  the  bath. 

In  the  GME,  once  one  decides  on  the  system,  the  bath  is  everything  else  (all 
the  coordinates  traced  over  in  the  projection  to  system-only  coordinates).  In 
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our  approach,  the  same  thing  is  very  nearly  true,  except  that  we  absorb  the 
unitary  contributions  of  the  nonsystem  coordinates  to  system  motion  into  the 
reversible  deterministic  evolution  and  treat  only  the  nonunitary  (damping)  con¬ 
tributions  as  being  the  stochastic  bath  (which  may  or  may  not  be  a  thermal  bath 
in  the  classic  sense). 

4.  Finally,  accumulate  or  record  in  each  time  step  “measurements”  performed  on 
the  system.  As  the  full  microscopic  slate  of  the  system  is  always  available,  arbitrary 
moments  and  quantities  that  are  generalized  functions  of  the  system  coordinates 
can  be  studied. 

5.  Depending  on  what  one  is  studying  in  a  particular  problem,  the  forward  time 
evolution  of  the  system  from  a  given  initial  state  can  be  repeatedly  solved.  The 
set  of  trajectories  thus  accumulated  constitute  a  sampling  of  the  actual  fiber 
bundle  of  trajectories  from  which  the  path  integral  of  the  problem  gets  its  weight. 
The  moments  of  this  bundle  of  trajectories  ( for  a  nonequilibrium  system)  are 
also  of  general  interest. 

The  time  evolution  thus  generated  has  several  remarkable  features;  It  allows  for 
or  includes  both  microcanonical,  canonical,  and  pure  stochastic  time  flow.  Both 
pure  molecular  dynamics  and  pure  (static)  importance  sampling  Monte  Carlo  are 
thus  completely  embedded  subsets  of  the  general  scheme  and  can  be  implemented 
by  simply  setting  certain  rate  parameters  to  zero  or  unity.  This  method  is,  however, 
much  more  general  and  can  be  applied  to  far  more  complex  and  realistic  problems 
in  addition  to  the  already  vast  body  of  problems  accessible  to  the  two  embedded 
methods. 

It  can  describe  (thermal)  equilibrium  problems,  relaxation  models,  and  general 
nonequilibrium  problems.  Open  (damped,  driven)  systems  can  thus  be  micro¬ 
scopically  treated.  The  hysteresis  cycle  of  absorptive  optical  bistability  obtained  in 
[6]  and  described  below  is  the  trace  of  a  first-order  dynamical  phase  transition  in 
an  open,  damped,  driven  system. 

The  microdynamic  method  is  non-Markovian.  When  a  many-body  system  makes 
a  “quantum  jump”  from  one  worldline  to  a  nearby  worldline  (when  the  optical 
state  of  a  single  atom  of  the  system  changes,  e.g.),  most  of  the  information  required 
to  describe  the  original  worldline  is  left  intact.  In  fact,  this  method  can  be  viewed 
(from  the  point  of  view  of  information  theory)  as  solving  for  the  finite  difference 
dynamics  of  entropy  flow  into  or  out  of  the  system  from  the  bath.  In  practice,  the 
behavior  of  the  system  is  distinguishably  connected  to  its  past-time  history  for  times 
much  longer  than  the  time  scale  of  microscopic  motion  or  local  stochastic  relaxation, 
especially  when  strongly  nonlinear  or  critical  behavior  is  macroscopically  observed, 
i.e.,  structures  in  the  system  can  have  extremely  long  lifetimes,  many  times  longer 
than  the  “memory  time”  of  a  single  component  in  the  structure.  Obvious  examples 
of  this  are  critical  slowing  down  in  general  and  metastable  configurations  “frozen 
in”  to  a  system  during  a  rapid  quencii  from  high-temperature  initial  conditions 
through  a  second-order  critical  point. 

In  Figures  4  and  5,  we  offer  numerical  evidence  that  including  system  dynamics 
interleaved  with  the  heat  bath  thermalization  process  can  alter  the  longitudinal 
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relaxation  rates  when  the  spin-spin  relaxation  rates  exceed  the  rates  of  the  spin- 
bath  interaction  (i.e.,  when  the  spin  waves  have  a  long  thermal  coherence  time). 
This  is  not  unreasonable,  but  is  completely  invisible  in  any  Monte  Carlo  approach 
that  does  not  include  microdynamics  explicitly.  On  the  other  hand,  when  the  ther- 
malization  process  is  fast  with  respect  to  the  spin-spin  relaxation  rate,  then  the 
time  scales  associated  with  Glauber  dynamics  are  essentially  real  time  and  dynamical 
scaling  laws  obtained  with  this  tool  are  valid. 

A  final  appealing  feature  of  this  approach  is  that  it  is  based  on  implementing,  as 
faithfully  as  possible,  a  numerical  realization  of  what  we  believe  to  actually  be 
occurring  in  the  system  in  nature.  It  is  more  than  “just  a  simulation” — it  is  a 
numerical  solution  to  the  (admittedly  stochastic)  equations  of  motion.  As  a  con 
sequence,  we  expect  that  the  method  will  respect  detailed  balance  and  broken  er- 
godicity  [10]  when  appropriate  (as  when  the  system  undergoes  a  phase  transition'' 
but  will  also  respect  the  kind  of  broken  ergodicity  that  supersedes  detailed  balance 
when  appropriate.  Not  all  simulation  methods  respect  broken  ergodicity  even  when 
it  occurs  in  the  actual  physical  model,  so  this  is  an  important  consideration  [11],. 

It  IS  not  possible  in  a  short  paper  to  present  more  than  a  brief  outline  of  some 
of  the  results  successfully  obtained  in  this  approach.  This  list  is  constantly  changing, 
as  well,  as  we  are  currently  actively  involved  in  implementing  it  to  a  variety  of 
problems.  We  urge  interested  readers  to  obtain  the  original  papers  or  to  contact  us 
for  more  detailed  discussions. 


II.  Optics 

W'e  deduced  this  method  in  work  leading  to  its  original  application  in  studies  of 
the  nonlinear  quantum  optics  of  “two  level  atoms”  in  a  tuned  cavity  and  stimulated 
with  a  coherent,  resonant  optical  field  (see  [6]  and  the  copious  references  therein). 
Phenomena  exhibited  by  such  a  collection  include  photon  echoes,  superradiance, 
and  absorptive  optical  bistability  (a  dynamical  first-order  phase  transition).  At  the 
time  of  our  work,  there  were  semiheuristic  descriptions  of  photon  echoes,  several 
competing  explanations  for  superradiance  (or  superfluorescence )  at  various  levels 
of  quantum  entanglement,  and  a  mean-field-based  understanding  of  absorptive 
optical  bistability  at  the  level  of  nonlinear  equations  describing  the  equilibrium. 
None  of  the  methods  used  to  understand  these  specific  problems  could  also  describe 
more  than  one  of  the  other  two  in  the  same  coordinatization. 

We  realized  that  if  we  treated  the  ensemble  of  two-level  atoms  as  being  semi- 
classically  coupled  via  the  total  cavity  field  (after  extracting  the  atom-field  entangling 
that  leads  to  the  spontaneous  emission  rates  in  QED),  that  free  quantum  time 
evolution  from  a  simple  product  state  would  leave  the  atoms  in  a  simple  product 
state.  If  we  treated  the  spontaneous  emission  process  physically  as  a  stochastic, 
delta-correlated  transition  from  an  arbitrary  superposition  state  to  the  ground  state, 
we  could  obtain  unitary  Langevin  equations  that  precisely  reproduce  the  single¬ 
atom  expectation  values  in  the  appropriate  limit  but  that  fully  allow  for  classically 
correlated  (collective)  field  emission  and  absorption.  It  was  natural  to  add  other 
generic-phase  interrupting  processes  (such  as  elastic  or  inelastic  collisions)  as  an 
additional  unitarity  preserving  stochastic  transitio:  term  in  the  time-evolution  loop. 


368 


BROWN  AND  CIFTAN 


Figure  1 .  The  modulatioa  of  the  driving  external  field  by  Rabi  oscillations  in  a  collection 
of  N  =  225  two-level,  resonant  atoms  enclosed  in  a  “cavity.”  As  the  atoms  are  driven 
through  the  Rabi  cycle,  they  coherently  absorb  and  re-emit  radiation  in  phase  with  the 
driving  field.  At  the  same  time,  however,  incoherent  processes  destroy  the  collective  phase 
of  the  atoms  so  that  eventually  they  absorb  as  much  as  they  decay  and  the  cavity  becomes 

transparent. 


The  resulting  system  of  3A^  +  1  coupled  linear  ordinary  differential  equations 
for  the  system  of  atoms  plus  the  field  was  readily  integrable  in  time.  The  stochastic 
elements  preserved  the  unitarity  of  the  description  and,  hence,  the  post-transition 
integrability.  Accurate  numerical  results  were  easily  obtained  for  N  ~  10^-10^ 
even  on  modest  computers. 

This  method  proved  capable  of  describing  photon  echoes,  self-initiated  super¬ 
radiance,  and  absorptive  optical  bistability  within  a  single  approach,  by  simply 
altering  the  appropriate  parameters  of  the  model  in  much  the  same  way  that  they 
would  be  varied  in  the  different  experiments  required  to  measure  them  in  a  single 
collection  of  atoms.  In  Figure  1  we  present  the  modulation  of  a  transmitted  radiation 
pulse  by  Rabi  oscillations  in  resonant  two-level  atoms.^  This  shows  that  the  atoms 
in  our  numerical  example  correctly  exhibit  damped,  driven  motion  that  can  be 
obtained  analytically  for  this  simple  case. 

In  Figure  2  we  present  a  typical  superradiance  curve  produced  by  the  numerical 
model,  with  an  inset  closeup  of  the  initiating  spontaneous  emission  event.  This 
picture  is  entirely  consistent  with  the  observations  of  MacGillivray  and  Feld  [13], 
who  also  proposed  that  the  full  apparatus  of  quantum  entangled  atomic  states  was 
not  really  necessary  to  quantitatively  or  qualitatively  describe  superradiance  ex- 


’  See  a  discussion  of  Rabi  oscillations,  together  with  figures,  in,  e.g.,  [12]. 
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Figure  2.  The  collective,  superradiant  emission  o(  N  =  225  two-level  atoms  prepared  at 
time  t  =  0  in  tl.e  completely  inverted  state.  The  system  is  quiescent  for  a  delay  time  after 
inversion,  at  whicn  point  a  single  spontaneous  decay  photon  emission  with  overlap  in  the 
cavity  mode  initiates  the  growth  of  the  superradiant  pulse  (see  inset).  The  pulse  itself 
develops  the  characteristic  hyperbolic  secap'  shape.  The  delay  time  itself  has  been  physically 
observed  (see  MacGi'Mvray  and  Feld  113)). 


penments.  Note  that  this  is  considered  a  nonlinear  macroscopic  phenomenon  that 
we  are  obtaining  from  a  strictly  linear  microscopic  model,  exactly  as  it  obtains  in 
nature. 

Finally,  in  Figure  3,  we  present  a  photon  echo  as  produced  by  the  numerical 
model.  This  phenomenon,  while  well  understood  on  an  heuristic  level,  is  remarkably 
difficult  to  get  from  a  deterministic  or  semideterministic  algebraic  model.  It  verifies 
that  the  atoms  are  being  driven  precisely  as  we  expect  them  to  be  on  a  fully  mi¬ 
croscopic  basis.  There  are  several  interesting  features  of  this  curve  that  are  more 
fully  discussed  in  [6]. 

The  A^-atom  optical  Bloch  equations  (our  numerical  model)  also  gave,  for  the 
first  time,  a  complete  microscopic  picture  of  what  was  happening  inside  the  system 
as  it  was  driven  through  a  complete  hysteresis  cycle  in  absorptive  optical  bistability. 
The  macroscopic  nonlinearity  was  thus  illuminated  as  a  simple,  intuitively  under¬ 
standable  microscopic  behavior  re -ulting  from  the  collective  interaction  of  the  atoms 
with  the  driving  mode  and  the  cavity.  Also,  the  approach  does  not  yield  the  non¬ 
physical  middle  branch  that  results  from  the  cubic  mean  field  equations  for  the 
equilibrium — it  allows,  instead,  a  direct  probe  of  the  size-extensivity  of  the  stability 
of  the  fluctuation-induced  phase  transition  at  the  branch  points  of  the  state  curve. 
A  full  tim.e  trace  of  the  hysteresis  cycle  is  given  in  [6],  as  well  as  the  resulting 
hysteresis  curve  itself. 
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Figure  3,  The  same  collcelion  of  A  =  225  atoms  being  driven  through  a  echo" 

experiment.  The  atoms  are  initially  in  the  ground  state.  They  arc  then  briefly  exposed  to 
a  “ir/2"-pulse  that  coherently  polariacs  them.  The  atoms  start  to  radiate  away  the  energy 
that  they  absorbed,  but  this  process  is  cut  ofTby  their  inhomogeneous  dephasing.  After  a 
delay  r,  they  .re  exposed  to  an  applied  “]r*’'pulsc  (hat  inverts  their  moments  with  respect 
to  the  polarization  plane.  After  an  additional  delay  r,  the  atoms  rephase,  producing  the 
clearly  visible  collective  emission  pulse  in  the  cavity  field. 


Ill.  Magnetism 

Our  original  studies  of  the  optics  problem  were  conducted  at  zero  temperature. 
This  is  because  it  is  very  difficult  to  include  temperature  and  time  evolution  in  an 
open  many-body  problem.  From  some  of  our  initial  studies  [7],  however,  it  became 
clear  that  we  needed  to  be  able  to  treat  finite  temperature  bath  interactions,  as  they 
could  lead  to  a  fundamentally  new  kind  of  organization  of  the  optical  states.  Since 
the  role  of  temperature  in  many-body  physics  is  nowhere  as  well  understood  as  it 
is  in  magnetism,  and  since  in  any  event  spin  dynamics  in  magnetism  is  closely 
allied  to  the  pseudospin  dynamics  that  we  studied  in  optics,  we  turned  to  magnetism 
to  learn  how  to  implement  canonical  heat  baths  in  our  optical  method. 

The  natural  extension  of  the  method  that  we  used  to  treat  spontaneous  emission 
and  collisions  in  the  optics  problem  was  to  induce  uriitarity-prcscrving  transitions 
from  arbitrary  superposition  statc.s  to  new  superpositions  states  selected  from  the 
canonical  distribution  of  available  states  at  the  given  temperature.  We  therefore 
implemented  this  in  our  code  after  converting  it  into  a  form  suitable  for  studying 
magnetism.  As  we  studied  Monte  Carlo  methods  for  the  first  time  (to  learn  how 
to  accumulate  appropriate  averages  so  that  we  could  check  our  results),  we  learned 
that  this  is  the  Glauber  or  heat  bath  thermalization  method  [4,5]  that  we  had 
reinvented  with  a  somewhat  different  motivation. 
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In  ordinao  Monte  Carlo,  the  time  scale  of  Glauber  thermalization  dynamics  is 
arbitrary — “Monte  Carlo  time”  is  not  necessarily  related  to  real  time,  although  it 
is  frequently  treated  as  if  it  were  in  ':tudies  of  dynamic  scaling.  In  the  unitary 
Langevin  method  that  we  describe  herein.  Monte  Carlo  time  is  directly  and  uniquely 
related  to  real  time  by  the  presence  of  the  relative  time  scales  of  the  various  physical 
processes  being  directly  integrated  over  in  the  deterministic  evolution.  It  is  not 
necessary  to  confine  oneself  to  studies  of  models  where  the  thermalization  time  is 
much  smaller  than  the  spin  -spin  dynamics  time  scale  or  vice  versa.  Instead,  one 
can  study  both  of  these  possibilities  as  well  as  the  possibility  that  spin-wave  relaxation 
and  Glauber  thermalization  proceed  on  commensurate  time  scales.  In  this  case, 
the  combination  of  microcanonical  spin-wave  relaxation  and  *■  e  canonical  thermal 
relaxation  rates  can.  at  least  in  principle,  alter  the  dynamic  susceptibilities  of  the 
system  being  studied. 

Using  our  method,  we  directly  study  the  spin  autocorrelation  function 
(S(0)-  S(t))  (where  the  average  is  over  multiple  time  trajectories  and  all  lattices 
sites  on  an  8  X  8  X  8  simple  cubic  Heisenberg  ferromagnet  with  periodic  boundary 
conditions).  The  spin  autocorrelation  has  two  components;  The  “longitudinal” 
component  measures  how  rapidly  the  spin  component  parallel  to  the  bulk  mag¬ 
netization  vector  at  t  =  0  changes;  the  “transverse”  component  measures  how 
rapidly  the  short-range  structure  in  the  components  perpendicular  to  the  initial 
magnetization  vector  decays  in  time.  The  longitudinal  component,  for  a  system  in 
equilibrium,  is  expected  to  undergo  a  random  walk  on  the  surface  of  the  (order 
parameter)  sphere  with  an  angular  step  size  that  scales  like  Vit.  The  transverse 
autocorrelation  relaxation  time  is  expected  to  diverge  at  the  critical  point  and  a 
dynamical  critical  exponent  is  associated  with  this. 

We  find  that  the  full  inclusion  of  dynamics  dr"s  alter  the  transverse  relaxation 
rates  of  the  system  under  certain,  readily  understandable  circumstances.  We  also 
find  that  it  does  noi  alter  the  longitudinal  relaxation  or  in  any  way  affect  detailed 
balance  or  the  averages  of  thermodynamic  (static)  quantities. 

In  the  case  of  classical  Heisenberg  spins  interacting  via  a  nearest-neighbor  inter¬ 
action.  the  classical  free  motion  is  for  each  spin  to  instantaneously  precess  around 
the  instantaneous  “field”  produced  by  its  nearest  neighbors  at  a  rate  determined 
by  the  field  strength.  This  is  simply  described  by  a  set  of  coupled  ordinary  differential 
equations  that  can  be  explicitly  ( numerically )  integrated  very  accurately.  The  time 
required  to  produce  a  single  2w  precession  around  the  largest  internal  field  possible 
can  be  larger  than,  smaller  than,  or  comparable  to  the  time  required  to  make  a 
Monte  Carlo  time  step  ( i.e.,  apply  the  heat  bath  algorithm  to  each  spin  in  the  system 
one  time).  This  can  be  adjusted  in  the  code  by  interspersing  several  smaller  inte¬ 
gration  time  steps  inside  the  thermalization  loop  or  by  performing  several  ther- 
malization  loops  per  integration  time  step. 

Figure  4  is  a  comparison  of  a  typical  ordinary  heat  bath  Monte  Carlo  autocor¬ 
relation  ( no  dynamics  at  all )  with  the  autocorrelation  produced  when  dynamics 
arc  included  but  with  spin-spin  relaxation  rates  that  are  comparable  to  or  slower 
than  the  thermal  Monte  Carlo  relaxation  rates;  i.e..  each  spin  can  precess  only  a 
small  distance  around  its  local  site  field  before  being  “thermalized,”  even  in  the 
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Figure  4,  The  decay  of  total  autocorrelation  in  an  8  X  8  X  8  simple  cubic  lattice  of 
Heisenberg  spins  at  their  cntical  temperature.  In  one  case,  dynamical  spin-spin  relaxation 
occurs,  but  at  a  rate  the  same  than  or  slower  than  the  thermal  relaxation  rate.  In  the  other 
case,  thermal  relaxation  only  is  present.  The  two  relaxation  rates  are  indistinguishable. 


strongest  local  fields  that  can  exist  in  the  system.  The  thermalizing  bath  interactions 
(Monte  Carlo  steps)  destroy  the  local-phase  angles  of  the  spins  with  respect  to  their 
local  instantaneous  field.  This  results  in  a  characteristic  fast  decay  of  the  transverse 
component  of  its  spin  autocorrelation  to  the  residual  slowly  decaying  longitudinal 
component.  Clearly,  the  transverse  relaxation  rates  are  equal  for  both  situations 
(with  or  without  the  dynamics)  or  nearly  so.  The  same  is  true  for  the  longitudinal 
rates.  The  scatter  observed  is  due  to  relatively  small  number  ( 100)  of  short  runs 
being  averaged  and  should  not  be  interpreted  as  a  structure  of  any  sort. 

On  the  other  hand,  in  Figure  5,  we  set  the  time  scales  so  that  each  spin  could 
precess  through  angles  comparable  to  2r  due  to  the  Hamiltonian  spin-spin  inter¬ 
action  ( spin  wave  motion )  during  the  intervals  between  thermalizing  bath  inter¬ 
actions.  This  spin  motion  has  the  effect  of  destroying  transverse  correlations  in¬ 
dependent  of  the  bath.  As  a  consequence,  transverse  relaxation  is  greatly  accelerated 
on  the  time  scale  of  the  Monte  Carlo  process  (f  counts  Monte  Carlo  “sweeps”  or 
time  steps ) .  Indeed,  one  would  expect  that  even  if  no  random  processes  whatsoever 
were  present,  the  spin-spin  interaction  would  generally  suffice  to  destroy  transverse 
correlation  in  the  system  via  wavelike  dispersion  processes.  Longitudinal  correlation, 
however,  is  a  constant  of  the  motion  for  the  Heisenberg  Hamiltonian  and  only  the 
bath  breaks  this  symmetry,  inducing  gradual  longitudinal  relaxation. 

Clearly,  this  is  a  very  simple  case  that  is  readily  understandable — after  the  sim¬ 
ulation  reveals  the  phenomenon.  On  the  other  hand,  it  also  shows  how  far  more 
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Figure  5.  The  same  as  in  Figure  4,  but  now  the  spin-spin  relaxation  for  the  “fastest” 
modes  proceeds  ~10x  faster  than  the  thermal  relaxation  rate.  This  clearly  ’.ffects  the 
transverse  decay  rate,  as  spin-spin  relaxation  alone  would  destroy  transverse  autocorrelation. 
The  decay  of  longitudinal  autocorrelation,  however,  is  still  dominated  by  the  Monte  Carlo 
( thermal )  relaxation  process,  as  this  kind  of  autocorrelation  is  nearly  conserved  by  Ham¬ 
iltonian  motion. 


complex  behavior  can  be  simulated  for  an  open  system  that  cannot  be  so  easily 
understood.  It  also  provides,  at  last,  a  formal  process  for  validating  the  notion  of 
“Monte  Carlo  time”  and  its  relevance  to  dynamic  scaling  laws.  In  systems  where 
the  thermal  spin-wave  coherence  time  is  short,  Monte  Carlo  time  as  used  in  heat 
bath  simulations  should  yield  meaningful  dynamical  exponents.  In  systems  where 
the  thermal  relaxation  piocess  is  slow  with  respect  to  the  time  constants  of  the 
internal  motion,  however,  this  is  not  guaranteed  to  be  the  case. 

The  continuing  study  of  both  dynamics  and  static  critical  properties  in  open  and 
closed  systems  is  the  focus  of  our  current  research  efforts  [8].  In  addition,  we  are 
involved  on  an  advisory  level  with  the  extensions  of  this  method  in  lattice  gauge 
calculations  being  conducted  here  [9].  It  is  our  belief  that  this  method  will  eventually 
prove  to  be  a  powerful  tool  in  the  arsenal  of  the  many-body  theorist. 


IV.  Conclusions 

We  have  presented  a  description  of  a  method  that,  supplemented  by  a  perusal 
of  our  published  and  unpublished  work,  should  suffice  to  enable  interested  re¬ 
searchers  to  implement  a  generically  similar  scheme  in  a  wide  range  of  dynamics 
or  static  many-body  problems.  We  have  also  presented  evidence  that  this  scheme 
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leads  to  physically  reasonable  and  intuitively  understandable  results  in  a  very 
natural  way. 

To  conclude,  we  want  to  present  the  logic  of  this  approach  in  more  complex 
(and  complete)  representations,  including  both  the  full  iV-body  representation  and 
coherent  states.  The  former  leads  to  a  problem  that  is  numerically  intractable  as  it 
stands  except  for  very  small  N;  the  latter  has  functional  extensions  that  may  make 
it  very  useful  in  this  sort  of  approach. 

In  the  specific  examples  discussed  above,  the  spins  or  atoms  were  assumed  to  be 
in  a  simple  product  state — no  quantum  entangling  of  the  spin-spin  interaction  was 
allowed. 

Note  that  in  magnetism  this  makes  a  quantum  spin  model  into  a  semiclassical 
spin  model  equivalent  to  the  classical  model  (S=  cc).  The  spins  are  still  semiclassical 
in  that  quantum  transitions  are  still  allowed;  this  can  lead  to  some  interesting  and 
unique  new  physics  that  we  are  still  studying. 

In  optics  this  means  that  we  are  not  using  the  fully  entangled  Dicke  states  [14] 
or  atomic  coherent  states  [  1 5  ]  ( the  spin-state  equivalents  of  the  coherent  states  of 
the  radiation  field  [16] )  but  a  representation  intermediate  between  these  entangled 
representations  and  the  neoclassical  picture  of  Jaynes  and  Cummings  [17],  This 
representation  treats  the  single  atom-field  entangling  in  QED  (extracting  the  actual 
spontaneous  emission  rate  and  Lamb  shift)  but  leaves  the  atom-atom  interaction 
disentangled.  This  works  remarkably  well  for  most  problems. 

If,  however,  one  is  faced  with  a  problem  where  many-body  entangling  is  a  nec¬ 
essary  evil  with  significant  physical  consequences,  all  is  not  lost.  The  method  can 
still  be  applied,  although  one  will  have  to  work  much  harder  and  strain  one’s  com¬ 
putational  resources. 

The  essential  elements  to  applying  this  method  to  entangled  problems  are 

1 .  Identifying  the  best  representation,  i.e.,  the  smallest  and  most  tractable  repre¬ 
sentation  that  still  contains  the  essential  features  of  the  model. 

2.  Deriving  microreversible  equations  of  motion  for  the  coordinates  o'"  this  de¬ 
scription. 

3.  Deriving,  deducing,  or  even  guessing  the  appropnate  unitai^  transitions  induced 
by  the  “bath”  (thermal  or  otherwise). 

4.  Implementing  the  equations  of  motion  and  stochastic  bath  interactions  in  a 
single  piece  of  code  with  appropriate  statistical  hooks  and  accumulators. 

By  using  clustering  and  scaling  arguments  in  the  first  step,  it  may  be  possible  to 
embed  this  algorithm  at  the  heart  of  a  real-space  real-time  numerical  renormalization 
scheme  applicable  to  entangled  open  systems.  This  would  open  a  new  range  of 
extremely  interesting  problems  up  to  detailed  study  and  is  one  of  the  chief  goals  of 
our  current  research. 

We  are  also  very  interested  in  synthesizing  the  results  from  these  physically  mo¬ 
tivated  Langevin  equation  calculations  with  the  concepts  and  results  obtained  with 
the  functional  integral  method  [18].  The  microdynamic  method,  when  applied 
carefully,  may  give  us  insight  into  the  correct  form  of  the  “mean  field”  results  (or 
a  related  Landau  theory)  that  can  be  corrected  for  fluctuations  by  functional  integral 
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methods.  We  have  obtained  [19]  the  real-space,  real-time,  dynamical  evolution 
equations  in  the  functional  integral  approach  i  using  the  coherent  state  representa¬ 
tion  [16]). 


Acknowledgment 

We  wish  to  acknowledge  the  support  the  Army  Research  Office,  which  sponsored 
this  research. 


Bibliography 

[  I J  J.  M.  Kosterlitz  and  D.  J.  Thouless.  J.  Ph>s.  C6,  1 181  ( 197.t) 

[2]  G.  S.  Agarwal.  Quantum  Optus.  Springer  Tracts  in  Modem  Physics.  Vol.  70  {Springer-Verlag, 
New  York.  1974). 

[3]  R.  P  Feynman  and  A.  R.  Hibbs.  Quantum  Sfethanus  and  Path  Integrals  (McGraw-Hill,  New 
York,  1964). 

[4]  R.  J.  Glauber.  Jour.  Math.  Phys.  4.  294  ( 1963). 

[5]  K.  Binder  and  D  W  Hecrmann.  Monte  Carlo  Simulation  m  Stalistual  Plnsics.  Springer  Senes 
in  Solid-State  Sciences  80  (Springer-Verlag.  New  York.  1988). 

[6]  R.  G.  Brown  and  M.  Ciftan.  Phy.  Rev.  A40,  3080  ( 1989). 

[7]  R.  G.  Brown  and  M.  Ciftan.  in  Condensed  Matter  Theories  3.  J.  S  Arponen.  R.  G.  Bishop,  and 
M.  Manninen.  Eds.  (Plenum,  New  York,  1987),  p.  339. 

[8]  R.  G  Brown  and  M.  Ciftan.  unpublished  (work  m  progress  on  the  semiclassical/classical  Heisenberg 
model). 

[9J  B.  Millie.'  and  A.  Trayanov.  Phys.  Rev.  Lett.  68,  3387  ( 1992). 

( 10]  R.  O.  Palmer.  Adv.  Phys.  32,  669  ( 1982). 

(11]  See.  e.g..  u.c  discussion  in  O  G.  Mountsen.  Computer  Studies  of  Phase  Transitions  and  Critual 
Phenomeiu.  Sponger- Verlag  Senes  in  Computational  Physics  ( Spnnger-Verlag.  New  York.  1 984 ) 
p.  7  and  ff. 

[12]  R.  Loudon.  The  Quantum  Theorx  of  Light  (Clarendon.  Oxford.  1983) 

[13]  J.  C.  MacGillivray  and  M  S.  Feld.  Phys  Rev.  AI4,  1 169  ( 1976). 

1 14]  R.  H.  Dicke,  Phys.  Rev.  93, 99  ( 1954). 

[15]  F.  T.  Arecchi,  E.  Couriens.  R  Gilmore,  and  H.  Thomas.  Phys.  Rev.  A6.  221 1  ( 1972). 

[16]  R.  Glauber.  Phys  Rev.  131, 2766  ( 1963). 

[17]  E.  T.  Jaynes  and  F.  W.  Cummings,  Proc,  IEEE  51, 89  ( 1963). 

[  18]  J.  W.  Negele  and  H.  Orland.  Quantum  Many-Pamcle  Sisrcm.s  (Addison- Wesley,  New  York,  1988). 

[  19]  W.  Poetz  and  M.  Ciftan.  unpublished  ( 1993). 


Received  July  1,  1993 


Determinantal  Inequalities  among  (r") 

P.  CSAVINSZKY 

Department  of  Physics  and  Astronomy.  Vniversity  of  Maine.  Orono.  .Maine  04469-5709 


Abstract 

This  paper  considers  two  types  of  2  X  2.  3  x  3.  and  4X4  determinantal  inequalities.  The  elements 
of  one  type  are  (r"),  while  the  elements  of  the  other  type  are  c(r").  where  c  =  3  +  n.  The  2  X  2,  3  X 

3,  and  4X4  determinantal  inequalities  involve  n  =  -1. 0.  1.  n  =  -  1, 0.  2.  3. 4.  and  n  =  -1, 0.  1.  2,  3, 

4.  5.  The  two  types  of  2  X  2,  3  X  3.  and  4X4  inequalities  arc  used  to  obtain  lower  bound  estimates  of 
(r),  (»•’),  and  for  the  noble  gas  atoms  He,  Ne.  Ar.  Kr.  and  Xe.  The  values  of  these  quantities 
obtained  from  the  inequalities  are  compared  with  the  quantum  mechanical  values  of  Boyd,  who  calculated 
them  with  the  near  Hartree-Fock  analytical  wave  functions  of  Clementi  and  Roetti.  ©  1993  John  Wiley 
&  Sons.  Inc. 


Introduction 

The  establishment  of  inequalities  involving  (r").  where  r  is  the  distance  from 
the  atomic  nucleus,  poses  the  question:  To  what  accuracy  can  one  calculate  an 
expectation  value  of  r"  in  terms  of  other  expectation  values  of  powers  of  r?  Several 
N  X  A  determinantal  inequalities  involving  (r")  were  given  by  Gadre  and  Matcha 
[1].  The  2X2  inequalities  involve  <r")  with  n  =  -1. 0.  1.  The  3X3  inequalities 
involve  <r")  with  «  =  -1,  0,  1,  2,  3.  One  of  the  2  X  2  and  one  of  the  3  X  3 
inequalities  involve  c<r"),  where  c  =  3  +  n. 

In  the  present  work,  two  4X4  determinantal  inequalities  are  established.  One 
of  these  involves  (r"),  while  the  other  one  involves  c^r”}  with  c  -  2  +  n  and 
n=  -1,0,  1,2,  3,  4,  5. 

The  motivation  for  establishing  N  X  A  determinantal  inequalities  is  simple.  The 
2X2  case,  for  instance,  permits  one  to  obtain  a  lower  bound  estimate  of  (  r)  in 
terms  of  (r’’).  Similarly,  the  3  X  3  case  permits  one  to  obtain  a  lower  bound 
estimate  of  <r^)  in  terms  of  {r),  and  (r‘).  In  a  similar  manner,  in  the  4 

X  4  case,  one  can  obtain  a  lower  bound  estimate  of  (r^)  in  terms  of  (f), 
<r^),  (r^),  and  (r*).  In  the  present  work,  (r),  (r^),  and  (r^)  are  calculated 
from  two  types  of  determinantal  inequalities  for  the  noble  gas  atoms  He,  Ne,  Ar, 
Kr.  and  Xe.  In  these  calculations,  the  (r")  values  of  Boyd  [2]  are  used  who  obtained 
them  from  the  near  Hartree-Fock  (hf)  analytical  wave  functions  of  Clementi  and 
Roetti  [3].* 


*  It  should  be  mentioned  here  that  a  large  body  of  literature  exists  on  (per  electron)  momentum 
expectation  values  (p").  Some  of  this  literature  deals  with  inequalities  involving  expectation  values  of 
powers  of  p,  while  another  part  of  the  literature  deals  with  inequalities  involving  both  (p")  and  (r"). 
None  of  the  relevant  publications  will  be  referred  to  in  the  present  paper. 
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In  what  follows,  atomic  units  will  be  used.  (The  unit  of  length  is  the  bohr,  the 
unit  of  energy  is  the  hartree.) 


Determinantal  Inequalities 

A  type  of  determinantal  inequalities  was  obtained  by  Gadre  and  Matcha  [  1  ] , 
who  made  use  of  the  Gram  inequality,  namely  [4]  of 


|J /x(:c)/.(:f)^/A-  ^0  (X./i=  1,2,... 

,N). 

(1) 

Defining/;(jr)  by  [1]’^ 

/„(x)  =  [4trr='’-'p(r)]'/^  (n=  1,2,,., 

■  ,N), 

(2) 

where  p(r)  is  the  position  electron  density,  and  noting  that 

p(r)  =  (l/47r)r-^Z)(r), 

(3) 

where  D{r)  is  the  position  radial  electron  density,  normalized  to  unity  [2],  one 
obtains  with 


r"D{r)  dr 


(4) 


the  2  X  2,  3  X  3,  and  4X4  inequalities 


1 


1 

<'■> 


sO. 


1 

<'■> 


J  <'■> 

(O  <r^> 


^0, 


and 


<r->  1  <r>  <r^> 

1  <r>  <r^>  <r^> 

<r>  <r^>  <r^>  <r^> 

<r^>  <r^>  <r^>  <r’> 


^0.. 


(5) 


(6) 


(7) 


Putting  the  2  X  2,  3  X  3,  and  4X4  determinants  in  Eqs.  (5)-(7)  equal  to  zero, 
the  calculated  lower  bound  values  of  <r).  (r^),  and  (/■’),  for  the  noble  gas  atoms 
He,  Ne,  Ar,  Kr,  and  Xe,  are  displayed  in  the  second  columns  of  Tables  I,  II, 
and  III. 

Another  type  of  determinantal  inequalities  was  obtained  by  Gadre  and  Matcha 
[  1 )  by  making  use  of  a  theorem  by  Polya  and  Szegb  [  5  ] ,  namely. 


•  There  IS  a  mispnnt  in  Ref.  [I ).  The  quantity /„  there  is  given  as /„  =  (47rp(r)^"  I.  2.  .1. 

.).  whereas  the  correct  expression  is  that  displayed  in  Eq.  (2)  of  the  present  paper. 
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Table  1.  Calculated  lower  bound  (LB)  and  quantum 
mechanical  (qm)  values  of  {r)  (in  a.u.)  and  their  ratios* 


Atom 

<f)LB 

<f)oM 

He 

0.59 

0.93 

0.64 

Ne 

0.32 

0.79 

0.41 

Ar 

0.26 

0.89 

0.29 

Kr 

0.20 

0.73 

0.27 

Xe 

0.17 

0.72 

0.24 

*  The  calculated  values  of  (r)  were  obtained  from 
the  2  X  2  determinant  in  Eq.  (5)  by  putting  it  equal 
to  zero.  The  quantum  mechanical  values  of  (r)  were 
taken  from  Ret.  [2]. 


I 


(ax  +  a^+  1)  dm  s;  0 


(X,M=  1,2,  ...,iV).  (8) 


Defining/(x)  by  47rp(r)  [1],*  and  setting  a,  =  1/2,  =  3/2;  a,  =  1/2,  = 

3/2,  Oi  =  5/2;  and  rzi  =  1/2,  a2  =  3/2,  ay  =  5/2,  and  04  =  1  jl,  one  obtains  the 
2  X  2,  3  X  3,  and  4X4  inequalities 


2<r-'> 

3 


2<r-'>  3  4<r> 

3  4<r>  5<rZ> 

4<r>  5<r2>  6<r3> 


^0, 


(9) 

(10) 


*  The  factor  is  missing  in  Ref.  (IJ.  The  introduction  of  this  factor  is,  however,  necessary  if  one 
wants  to  reproduce  Eq.  (6)  of  Ref.  (IJ. 


T«ble  II.  Calculated  lower  bound  (lb)  and  quantum 
mechanical  (qm)  values  of  (r’)  (in  a.u.)  and  their 
ratios.* 


Atom 

He 

1.72 

1.94 

0.89 

Ne 

1.29 

1.44 

Ar 

2.54 

3.11 

0.82 

Kr 

1.82 

2.44 

0.74 

Xe 

2.03 

2.82 

0.72 

*  The  calculated  values  of  <r’)  were  obtained  from 
the  3  X  3  determinant  in  Eq.  (6)  by  putting  it  equal 
to  zero.  The  quantum  mechanical  values  of  (r’)  were 
taken  from  Ref.  [2]. 
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Table  III.  Calculated  lower  bound  (lb)  and  quantum 
mechanical  (qm)  values  of  (r*)  (in  a.u.)  and  their 
ratios.* 


Atom 

<r’>LB 

<r’>QM 

He 

7.02 

9.20 

0.76 

Ne 

5.93 

6  24 

0.95 

Ar 

19.87 

24.42 

0.81 

Kr 

20.91 

22.27 

0.94 

Xe 

29.67 

31.75 

0.94 

*  The  calculated  values  of  (r^)  were  obtained  from 
the  4  X  4  determinant  in  Eq.  (7;  by  putting  it  equal 
to  zero.  The  quantum  mechanical  values  of  (r*)  were 
taken  from  Ref.  [2]. 


and 


|2<r->  3  4<r>  5<r^> 

'  3  4<r>  5<r">  6<r^> 

I  4<r>  5<r2>  6<r^>  7<r^> 

I  5<r2>  6<r’)  S(r^) 


^0. 


(11) 


Putting  the  2  X  2, 3  X  3,  and  4X4  determinants  in  Eqs.  (9)-(  1 1 )  equal  to  zero, 
the  calculated  lower  bounr  values  of  <  r ) ,  (  ) ,  and  (  r ’ ) ,  for  the  noble  gas  atoms 
He,  Ne,  Ar,  Kr,  and  Xe,  are  displayed  in  the  second  columns  of  Tables  IV,  V, 
and  VI. 


Table  IV.  Calculated  lower  bound  (lb)  and  quantum 
mechanical  (QM)  values  of  <r>  (in  a.u.)  and  their  ratios.* 


Atom 

(Olb 

(Oqm 

<r>LB/<''>QM 

He 

0.67 

0.93 

0.72 

Ne 

0.36 

0.79 

0.46 

Ar 

0.29 

0.89 

0.33 

Kr 

0.22 

0.73 

0.31 

Xe 

0.19 

0.72 

0.26 

*  The  calculated  values  of  {r>  were  obtained  from 
the  2  X  2  determinant  in  Eq.  (9)  by  putting  it  equal 
to  zero.  The  quantum  mechanical  values  of  (r)  were 
taken  from  Ref.  [2]. 
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Table  V.  Calculated  lower  bound  (lb)  and  quantum 
mechanical  (QM)  values  of  (r’)  (in  a  u.)  and  their 
ratios.* 


Atom 

{r^)oM 

He 

1.79 

1.94 

0.92 

Ne 

1.25 

1.44 

0.86 

Ar 

2.64 

3.11 

0.85 

Kr 

1.90 

2.44 

0.78 

Xe 

2.13 

2.82 

0.75 

•  The  calculated  values  of  (r^)  were  obtained  from 
the  3  X  3  determinant  in  Eq.  (10)  by  putting  it  equal 
to  zero.  The  quantum  mechanical  values  of  {r’)  were 
taken  from  Ref.  (2). 


Finally,  it  is  noted  that  Eq.  (9),  in  the  form  of  (8/9)<r)<r  ' )  >  1,  has  already 
been  obtained  by  Gadre  [6].* 


•  There  is  a  misprint  in  Ref  (6].  The  nght-hand  side  of  the  inequality  [Eq.  ( 1 )]  is  wntten  as 

'  -  (jfifF )'  f  I'  " 

whereas  the  correct  expression  is 


The  inequalities  in  Ref  [6]  have  been  denved  on  the  assumption  that  the  electronic  charge  density  p{r'i 
IS  monotonically  decreasing  with  r.  There  is  no  general  proof  of  this,  but  there  is  numencal  evidence  by 
Sperber  [7]  and  by  Simas  et  al.  [14]  that  hf  charge  densities  are  monotonically  decreasing  functions  of 
r  111  Ref  [14]  this  has  been  established  for  the  neutral  atoms  H  through  U  by  making  use  of  the  near 
HF  analytical  wave  functions  of  dementi  and  Roetti  [  3  ]  and  those  of  MacLean  and  MacLean  [15]. 


Table  VI.  Calculated  lower  bound  (lb)  and  quantum 
mechanical  (QM)  values  of  (r*)  (in  a.u.)  and  their 
ratios.* 


Atom 

<r’)oM 

('■’)lb/(''’)qm 

He 

9.01 

9.20 

0.98 

Ne 

6.01 

6.24 

0.96 

Ar 

23.28 

24.42 

0.95 

Kr 

21.10 

22.27 

0.95 

Xe 

30.01 

31.75 

0.95 

*  The  calculated  values  of  <r’)  were  obtained  from 
the  4  X  4  determinant  in  Eq.  ( 1 1 )  by  putting  it  equal 
to  zero.  The  quantum  mechanical  values  of  (r’)  were 
taken  from  Ref  [2], 
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Discussion 

It  is  seen  from  the  second  column  of  Table  1  that  the  value  of  <  r)i,B  decreases 
as  one  goes  from  He  to  Xe.  The  second  column  of  Table  II  does  not  show  a  regular 
behavior  in  the  values  of  <r^)LB-  The  second  column  of  Table  III  shows  that,  with 
the  exception  of  Ne,  the  value  of  ( )lb  increases  as  one  goes  from  He  to  Xe. 
Inspection  of  the  (r)QM  (QM  means  quantum  mechanical)  values  in  the  third 
column  of  Table  I  shows  that  the  decreasing  trend  in  this  quantity  is  broken  with 
Ar.  One  can  see  from  the  third  column  of  Table  11  that  there  is  no  systematic  trend 
in  the  values  of  as  one  goes  from  He  to  Xe.  A  look  at  the  values  of  <r^)QM 

in  the  third  column  of  Table  III  reveals  no  systematic  trend  in  this  quantity  either. 
The  fourth  columns  of  Tables  I  and  II  reveal  that,  for  all  noble  gas  atoms.  (f)LB/ 
('')qm  and  (r-'')i.B/<r-’)QM  is  smaller  than  unity.  This  is  also  the  case  for 
as  can  be  seen  from  the  fourth  column  of  Table  III. 

Moving  now  to  Table  IV,  one  sees  that  the  values  of  (/■)lb,  in  the  second  column 
of  Table  IV,  exhibit  a  decreasing  trend.  The  (r)LB/(t‘)QM  values,  in  the  fourth 
column  of  Table  IV,  show  a  decreasing  trend  too.  A  glance  at  the  values  in 

the  second  column  of  Table  V  shows  an  irregular  behavior.  Inspection  of  the  (r’)LB/ 
('■^)qm  values  in  the  fourth  column  of  Table  V  shows  a  decreasing  behavior  as 
one  moves  from  He  to  Xe.  One  can  see  from  the  second  column  of  Table  VI  that 
there  is  no  systematic  trend  in  the  values  of  (r’>LB  as  one  moves  from  He  to  Xe. 
It  is  interesting  to  note  that  the  values  of  (/•^>lb/<''’)qm,  in  the  fourth  column  of 
Table  VI,  show  a  near  constancy. 

One  more  point  should  be  mentioned  in  connection  with  Tables  I-VI.  In  Ref. 
[2]  the  quantities  and  <r^>QM  are  displayed  to  an  ac¬ 

curacy  of  five  numbers  following  the  decimal  point.  The  quantity  (r'’)QM  is  displayed 
to  an  accuracy  of  four  numbers  following  the  decimal  point,  while  for  (r*)QM  the 
accuracy  drops  to  three  numbers  following  the  decimal  point.  For  this  reason,  the 
present  calculations  were  carried  out  to  an  accuracy  of  three  numbers  following 
the  decimal  point  and  the  results  shown  in  Tables  I-IV  are  rounded  to  two  numbers 
following  the  decimal  point. 

Conclusion 

It  does  not  seem  promising  to  enter  into  speculation  about  the  reason  for  the 
trends  exhibited  in  Tables  I-VI  by  <r>LB,  <''^>lb,  and  <r^>LB.  About  the  only  safe 
statement  is  that  the  trends  are  connected  with  the  increasing  number  of  electrons 
distributed  in  an  increasing  number  of  shells,  as  one  moves  from  He  to  Xe. 

Another  question  may  be  asked:  Which  2  X  2,  3  X  3,  and  4X4  determinantal 
inequalities  give  “better”  bounds?  Are  the  inequalities  in  Eqs.  ( 5 )-( 7 )  better  than 
those  in  Eqs.  (9)-(  1 1 ).  or  is  it  the  other  way  around?  One  may  attempt  to  answer 
this  question  by  calculating  the  quantities 

column  4  of  Table  I 
column  4  of  Table  IV  ’ 
column  4  of  Table  II 


column  4  of  Table  V  ’ 


(12) 
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Table  VII.  Companson  of  the  two  sets  of  2  X  2,  3  X  3.  and  4X4  determinantal  inequalities  [Eqs 

(5H7)and(9Hll)] 


Atom 

Colum".  4  of  Table  I 
Column  4  of  Table  IV 

Column  4  of  Table  II 

Column  4  of  Table  V 

Column  4  of  Table  III 

Column  4  of  Table  VI 

He 

0.89 

0.97 

0.78 

Ne 

0.89 

1.05 

0.99 

Ar 

0.88 

0.96 

0.85 

Kr 

0.87 

0.95 

0.99 

Xe 

0.92 

0.96 

0.99 

and 


column  4  of  Table  III 

-  ( 131 

column  4  of  Table  VI 

for  He,  Ne,  Ar,  Kj,  and  Xe.  These  quantities  are  tabulated  in  Table  VII. 

Inspection  of  Table  VII  reveals  that,  for  all  noble  gas  atoms,  the  values  in  the 
second  column  of  the  table  are  remarkably  constant,  whereas  those  in  the  third 
and  fourth  columns  show  deviations  from  constancy.  It  is  also  seen  from  Table 
VII  that,  in  most  cases,  the  vatues  displayed  in  the  second,  third,  and  fourth  columns 
a's  smaller  than  unity.  Looking  at  these  columns  of  Table  VII,  it  appears  safe  to 
say  that  both  sets  of  determinantal  inequalities  are  equally  “good.”  One  might  add 
that  the  second  set  [Eqs.  (9)-(  1 1 )]  is  “better”  for  (r)iB/(f)QM  than  the  first  set 
[Eqs.  (5)-(7)],  while  no  such  clear-cut  behavior  is  exhibited  by 
and  <r^>LB/<^^>QM- 

It  should  be  mentioned  here  for  the  sake  of  completeness  that  some  inequalities 
involving  (r”)  have  been  established  for  two-electron  systems.  The  papers  by  Tsap- 
line  [8],  Blau  et  al.  [9],  and  Galvez  [10]  deal  with  this  subject.*  It  should  also  be 
mentioned  that  Weinhold  [11]  used  determinantal  inequalities  for  obtaining  bounds 
to  various  quantum  mechanical  quantities  such  as  energy  eigenvalues,  scattering 
phase  shifts  and  several  other  quantities. 

In  closing,  two  recent  papers  (not  dealing  with  two-electron  atoms)  should  also 
be  mentioned.  The  first  paper  is  by  Angulo  and  Dehesa  [12],  who  obtained  in¬ 
equalities  for  The  second  paper  is  by  Galvez  and  Porras  [13],  who 

obtained  the  inequalities  ^  (2Z//c)(r^“')  (Z  is  the  atomic  number)  with 

k  =  1,2,3.  Galvez  and  Porras  [13]  have  also  obtained  more  complicated  inequalities 
that  involve  (on  the  left-hand  side  of  the  inequality),  and  ), 

(r*),  and  k  (on  the  right-hand  side  of  the  inequality). 


*  References  [8-10)  gave  general  A-electron  inequalities.  In  Refs.  [8)  and  [10]  they  were  applied  to 
one-electron  and  two-electron  systems 
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Abstract 

Relativistic  quantum  defect  orbital  (RQDO)  calculations,  with  and  without  explicit  account  for  core¬ 
valence  correlation,  have  been  performed  on  several  electron  transitions  in  the  zinc  and  cadmium  is¬ 
oelectronic  sequences,  which  are  of  interest  m  astrophysics  and  fusion  plasma  research  A  comparative 
study  with  other  theoretical  results  and  experimental  measurements  has  also  been  carried  out.  ©  1993 
John  Wiley  &  Sons.  Inc. 


Introduction 

Spectroscopic  investigations  on  the  structure  of  highly  ionized  atoms  have  un¬ 
dergone  significant  developments  in  recent  years.  The  experimental  progress  is 
largely  due  to  the  introduction  of  several  powerful  light  sources,  such  as  laser- 
produced  plasmas,  Tokamak  fusion  reactors,  and  beams  of  fast,  excited  ions.  The 
results  obtained  with  these  laboratory  devices  complement  the  data  from  astro- 
physical  observations  of  the  solar  corona  and  solar  flares.  Experimental  and  theo¬ 
retical  data  for  highly  ionized  atoms  are  needed  for  the  diagnostics  of  astrophysical 
and  laboratory  plasmas.  For  more  than  40  years  it  has  been  possible  to  record  the 
spectrum  of  the  sun  in  the  uv  and  x-ray  regions  by  means  of  spacecraft-based 
observations.  These  observations  have  revealed  a  large  number  of  radiative  tran¬ 
sitions,  not  previously  observed  in  the  laboratory,  thereby  stimulating  much  activity 
in  theoretical  and  experimental  atomic  spectroscopy. 

Highly  ionized  atoms  appear  in  magnetically  con'"  led  high-temperature  fusion 
plasmas,  such  as  those  created  with  Tokamaks,  stelle.*  .ors.  magnetic  mirrors,  etc., 
as  well  as  in  inertially  confined  plasmas,  produced  with  powerful  lasers  of  particle 
beams.  The  presence  of  heavy,  ionized  atoms  results  in  energy  losses  and  plasma 
cooling.  However,  there  are  also  positive  aspects,  since  the  radiation  emitted  from 
the  impurities  provides  detailed  information  about  the  physical  processes  in  these 
plasmas  [  1  ] . 
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386 


LAVIN  ET  AL 


To  rrovide  the  atomic  data  necessary  for  plasma  diagnostics,  such  as  tran¬ 
sition  probabilities  and  oscillator  strengths,  is  still  a  challenge  for  experimen¬ 
talists  and  theoreticians.  Nevertheless,  cosmic  and  Tokamaks-oriented  spec¬ 
troscopy  can  also  supply  valuable  tests  of  atomic  model  calculations  [2].  The 
most  reliable  results  for  both  energy  levels  and  transition  probabilities  are 
obtained  with  the  multiconiiguration  Dirac-Fock  method  [3].  However,  this 
approach  becomes  prohibitively  time-consuming  if  the  number  of  transitions 
to  be  studied  is  very  large.  According  to  Curtis  [4],  the  spectroscopic  classi¬ 
fication  of  the  relevant  lines  exceeds  the  general  capability  of  ab  initio  methods, 
and  sometimes  requires  the  application  of  semiempirical  met  .ods.  This  con¬ 
forms  with  the  earlier  remarks  by  Hafner  and  Schwarz  [  5  ]  in  their  application 
of  a  relativistic  pseudopotential  approach  to  the  calculation  of  a  large  number 
of  atomic  transition  probabilities.  Very  recently,  Laughlin  [6]  has  pointed  out, 
after  comparing  lifetimes  obtained  with  a  numerical  Coulomb  approximation 
and  a  model  potential  with  accurate  experimental  measurements  for  some 
alkali-like  systems,  that  high  accuracy  may  be  achieved  with  relatively 
simple  computational  procedures,  as  long  as  they  are  appropriate  to  the 
problem. 

The  quantum  defect  orbital  (rqdo)  method  [7]  and  its  relativistic  (rqdo)  for¬ 
mulation  [8]  have  proven  to  be  both  simple  and  reliable  procedures  to  predict 
oscillator  strengths.  Moreover,  being  simple  and  analytically  solvable  models,  these 
methods  may  give  a  deeper  insight  into  the  physical  nature  of  the  problems  than 
sophisticated  numerical  techniques  do.  A  special  feature  of  the  qdo  and  rqdo 
methods  is  that  the  computational  effort  does  not  increase  as  the  atomic  system 
becomes  heavier. 

The  principal  resonance  lines  of  the  Zn  and  Cd  isoelectronic  sequences  are  im¬ 
portant  spectral  features  for  the  diagnostics  and  modeling  of  high-temperature  mag¬ 
netically  confined  plasmas.  These  ions  contain  two  valence  electrons  outside  core 
electrons  in  tightly-bound  closed  shells,  and  their  spectra  remain  simple,  even  for 
very  highly  ionized  species.  The  spectral  transitions  most  commonly  observed  in 
fusion-type  plasmas  are  transitions  to  the  ground  state  from  low-lying  excited  states 
having  the  same  principal  quantum  number  [9].  In  the  zinc  and  cadmium  se¬ 
quences,  these  transitions  are  of  the  types  4s'-4s4p  and  5s’-5s5p.  respectively, 
which  give  prominent  spectral  lines.  Some  other  transitions,  such  as  4s4p-4s4d  and 
5s5p-5s5d  are  also  of  practical  interest.  Unfortunately,  experimental  data  for  these 
transitions  exist  only  for  the  lower  sequence  members,  and  are  mainly  restricted 
to  the  resonance  lines. 

In  this  work,  we  have  calculated  oscillator  strengths  for  the  above  transitions 
in  ions  belonging  to  the  Zn  and  Cd  isoelectronic  sequences  with  both  the  QDO 
and  RQDO  procedures,  with  and  without  explicit  inclusion  of  core-valence  cor¬ 
relation  through  two  forms  of  the  core-polarization  corrected  transition  operator, 
as  described  in  the  next  section.  The  relativistic  calculations  have  been  performed 
in  the  intermediate  coupling  scheme.  A  comparative  study  of  the  results  obtained 
with  other  theoretical  and  experimental  (where  available) /-values  will  also  be 
carried  out. 
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Method  of  Calculation 


The  relativistic  quantum  defect  orbitals  corresponding  to  a  state  characterized 
by  its  experimental  energy  E'  are  the  analytical  solutions  of  the  quasirelativistic 
second-order  Dirac-like  equation  [8] 


where. 


d'  A(A+1) 
dr 


'>7* 

nt 


'I'r 


Rl) 


2^,rd^ri} 


(1) 


A  =  n  -  M  +  1  -  6'  +  c . 

=  Z„rt(  1  +  «-£■') . 

(Z'nc,)-  ^  (1  +a-E'/2) 

2{n-d’f  (!+«-£')-■ 


(2) 

(3) 

(4) 


where  « is  the  relativistic  principal  quantum  number,  n  and  /  are  the  principal  and 
orbital  angular  momentum  quantum  numbers.  6'  is  the  relati\  islic  quantum  defect. 
c  is  an  integer  chosen  to  ensure  the  normalizability  of  the  wavefunction  and  its 
correct  nodal  structure.  is  the  nuclear  charge  acting  on  the  valence  electrons 
at  large  radial  distance,  and  o  is  the  fine  structure  constant.  Atomic  units  are  used 
throughout.  To  determine  the  wavefunctions.  5’  is  first  obtained  empirically  from 
Eq.  (4).  Then  the  model  potential  parameter  A  is  obtained  from  Eq.  (2)  and  sub¬ 
stituted  into  Eq.  ( 1 )  which  can  be  solved  analytically.  Since  the  effective  Hamiltonian 
in  Eq.  ( 1 )  includes  a  screening  term,  the  quantum  defect  orbitals  are  approximately 
valid  in  the  core  region  of  space. 

The  non-relativist ic  QDO  equation  is  obtained  b^  setting  «  equal  to  zero.  Then 
E'  becomes  the  experimental  term  energy,  averaged  over  its  fine  structure  com¬ 
ponents. 

Both  the  relativistic  and  non-relativistic  quantum  defect  orbitals  lead  to  closed- 
form  analytical  expressions  for  the  transition  integrals.  This  allows  u*;  to  calculate 
transition  probabilities  and  oscillator  strengths  by  simple  algebra  and  with  little 
computational  effort. 

The  radial  part  of  the  transition  integrals  involved  in  the  computation  of  transition 
probabilities  and  oscillator  strengths  is  defined  as 

R,rdr  (5) 

Jo 


where  the  simplest  form  of  the  transition  operator  is  Q(r)  =  r.  in  its  dipole-length 
formulation. 

One  aspect  of  the  computational  procedure  which  has  to  be  investigated  is  how 
to  correct  the  transition  operator  for  the  neglect  of  explicit  core-valence  correlation 
in  the  quantum  defects  orbitals.  It  is  well  known  that  core-valence  correlation  sig¬ 
nificantly  contributes  to  certain  transition  matrix  elements  The  model  potential 
parameter  given  by  Eq.  (2)  implicitly  accounts  for  core-valence  correlation,  also 
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known  as  core- valence  polarization.  This  partly  avoids  the  addition  of  a  core-po¬ 
larization  term  to  the  model  potential,  depending  on  free  parameters,  as  done  in 
some  relevant  procedures  [10.1 1].  An  additional  and  much  simpler  explicit  intro¬ 
duction  of  core-valence  polarization  can  be  achieved  by  w'riting  the  dipole-length 
transition  operator  in  the  first-order  corrected  form  originally  proposed  by  Hameed 
et  al.  [12] 

Q(r)  =  T[1  -  aJrY(r)]  (6) 

where  is  the  electric  dipole  polarizability  of  the  atomic  core,  and  /( r)  is  a  cut¬ 
off  function  which  tends  to  unity  for  r  -►  x  and  presents  divergence  for  ,•  -*•  0.  It 
is  difficult  to  derive  a  general  expression  for/(r).  In  previous  works  [13]  we  have 
used  that  proposed  by  Caves  and  Dalgarno  [14]; 

/(r)=  1 -exp{-r/r,)'  (7) 

where  r,  is  a  suitable  chosen  cut-off  parameter,  which  we  have  always  taken  to  be 
approximately  equal  to  twice  the  core  mean  radious. 

Very  recently  [15]  a  new  form  for/(r)  has  been  successfully  tested  within  the 
QDO/RQDO  context: 

/(r)  =  [l -exp(-r/r,)]^  (8) 

This  expression  offers  the  great  advantage,  unlike  Eq.  (6).  of  retaining  total  an- 
alyticity  in  the  QDO/RQDO  transition  integrals. 

In  the  present  calculations  the  above  three  forms  of  Q(r),  r,  and  those  given  by 
Eq.  (6 )  combined  with  Eqs.  ( 7 )  and  (8 ).  respectively,  have  been  employed  through¬ 
out.  The  core  polarizabilities  have  been  taken  from  Refs.  [16]  and  [17].  A  much 
greater  consistency  has  been  obtained  with  the  choice  of  Eq.  ( 8 )  over  the  choice  of 
Eq.  ( 7 )  for  the  cut-off  function,  in  the  sense  that  the  dependence  of  the  resulting 
oscillator  strengths  on  the  value  of  is  much  smaller  with  the  former  than  with 
the  latter.  Hence,  m  the  next  section  we  shall  only  present  the  RQDO  oscillator 
strengths  calculated  with  Q(r)  =  rand  with  Q(r)  given  by  Eqs.  (6)  and  (8). 

For  obtaining  the  quantum  defects  as  well  as  the  transition  energies,  the  empirical 
energy  data  that  we  consider  as  most  reliable  have  been  employed  [  1 8-25  ] . 

Results  and  Analysis 

The  relativistic  quantum  defect  orbital  (RQDO)  oscillator  strengths,  calculated 
within  the  intermediate  coupling  scheme,  for  the  zinc  and  cadmium  isoelectronic 
sequences,  are  displayed  in  Tables  I  to  IV.  Two  sets  of  RQDO /values  are  shown, 
those  computed  with  the  standard  dipole-length  operator,  Q(r)  =  r,  and  those 
where  core-valence  correlation  has  been  explicitly  introduced  by  using  the  core¬ 
polarization  transition  operator  given  by  Eqs.  (6)  and  (8).  We  have  studied  as 
man\  atomic  systems  as  those  for  which  empirical  energy  data  were  available. 
Other  oscillator  strengths,  from  theoretical  calculations  and  experimental  mea¬ 
surements,  found  in  the  literature,  have  been  included  in  the  tables  for  a  comparative 
analysis. 
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Table  1.  Oscillator  strengths  for  the  4s^  'S„-4s4p  *P,  transition  in  zinc-like  ions. 


RQDO* 

RQDO*’ 

RHF' 

RRPA** 

mcrrpa' 

HF* 

MCHF* 

MCHF*' 

Exp. 

Znl 

1.96 

1.86 

1.97 

2.02 

1.77 

1.56 

1.50  ±0.05' 
1.46  ±0.8’ 
1.1810.14* 
1.43  ±0.15' 
1.45  ±0.16" 

Gall 

2.63 

2.38 

1.97 

1.98 

2.58 

2.01 

1.76 

1.39  ±0.17" 
1.85  ±0.15” 

Gelll 

2.32 

2.04 

1.98 

1.98 

2.56 

2.00 

1.74 

1.38  ±0.22" 
1.85  ±0.2” 

AsIV 

2.06 

1.73 

1.94 

2.46 

1.94 

1.71 

1.12±0.11" 
1.56  ±0.23” 

SeV 

1.86 

1.53 

2.36 

1.89 

1.68 

0.96  ±0.18" 
1.3010.13” 

BrVI 

1.70 

1.48 

1.84 

1.83 

2.27 

1.84 

1.64 

0.8210.12” 
0.82  ±  0.07' 

KrVlI 

1.57 

1.3f 

1.78 

2.19 

1.79 

1.61 

0.81  ±0.09‘ 
0.94  ±  0.04’ 

RbVIll 

1.46 

1.26 

1.72 

1.73 

SrIX 

1.39 

1.25 

1.67 

YX 

1.39 

1.24 

1.63 

ZrXI 

1.33 

1.21 

1.59 

1.61 

NbXII 

1.28 

1.16 

1.55 

1.56 

MoXlII 

1.24 

1.13 

1.51 

1.54 

1.53 

1.86 

1.57 

1.43 

TcXIV 

1.17 

l.ll 

1.48 

RuXV 

1.12 

1.03 

1.44 

RhXVI 

1.07 

1.01 

1.41 

PdXVII 

1.02 

0.97 

1.38 

AgXVIII 

0.98 

0.93 

1.36 

1.37 

CdXIX 

0.95 

0.90 

1.33 

1.35 

InXX 

0.91 

0.88 

1.30 

SnXXI 

0.89 

0.85 

1.28 

1  68 

1  39 

1.29 

SbXXIl 

0.86 

0.83 

1.26 

TeXXIII 

0.83 

0.81 

1.24 

1.70 

1.37 

1.28 

IXXIV 

0.81 

0.79 

1.22 

1.24 

XeXXV 

0.79 

0.78 

1.25 

1.24 

1.23 

•rqdo. 

RQDO,  with  polarization. 

'  Biemont  et  al.  (1989)  [30). 

Shorer  and  Dalgamo  (1977)  [28]. 

'Cheng  and  Huang (1992)  [31]. 

Froese  Fischer  and  Hansen  ( 1 978)  [26]. 

'  Landman  and  Novick  (1964)  [41] 

'  Lurio  et  al.  (1964)  [42]. 

Baumann  and  Smith  (1970)  [43]. 

'  Andersen  and  Sorensen  (1973)  [33]. 

"*  Abjean  and  Johannin-Gilles  t'975)  [44). 

"Sorensen  (1973)  [34j. 

"  Andersen  et  al.  (1979)  beam-foil,  corrected  for  cascading  [35]. 
^  Bahr  et  al.  ( 1 982)  beam-foil,  corrected  for  cascading  [36]. 

"  Pinnington  et  al.  (1977)  [37]. 

'  Knystautas  and  Drouin  (1977)  [38]. 

‘Druetta  and  Buchet  (1976)  [39]. 

'  Irwin  et  al.  [40]. 
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Table  II.  Oscillator  strengths  for  the  4s4p  'Pi-4s4<J  'Dj  transition  in  zinc-like  ions. 


Ion 

RQDO* 

RQDO^' 

MCHF* 

MCDHF'’ 

MCDHF' 

RHF^ 

MP* 

SeV 

1.33 

1.20 

1.66 

1.65 

BrVI 

1.35 

1.21 

1.60 

1.60 

KrVII 

1.33 

1.16 

1.54 

1.53 

1.59 

RbVIII 

1.27 

1.11 

1.66 

1.68 

SrIX 

1.23 

1.08 

1.59 

1.62 

YX 

1.23 

1. 10 

1.53 

1.63 

ZrXI 

1.19 

1.08 

1.47 

1.49 

NbXII 

1.14 

1.03 

1.41 

1.94 

MoXlII 

1.09 

1.00 

1.21 

1.19 

1.26 

1.36 

1.38 

TcXIV 

1.02 

0.95 

1.31 

RuXV 

0.96 

0.90 

1.27 

RhXVI 

0.91 

0.85 

1.23 

PdXVII 

0.86 

081 

1.19 

AgXVIII 

0.81 

0.77 

1.15 

CdXIX 

0.77 

0  73 

1.12 

InXX 

0.73 

0.70 

1.09 

SnXXI 

0.70 

0.67 

0.93 

0.9 

0.98 

1.06 

SbXXII 

0.66 

0.64 

1.03 

TeXXIII 

0.63 

0.61 

1.02 

IXXIV 

0.60 

0.58 

XeXXV 

0.57 

0  55 

0.83 

0.8 

0.88 

0.95 

*RQtX). 

•’  RQDO,  with  polarization. 

'  Froese  Fischer  and  Hansen  (1979)  [27]. 

Froese  Fischer  and  Hansen,  intermediate  coupling  with  polarization  (1979)  [27]. 

'  Froese  Fischer  and  Hansen,  intermediate  coupling  with  polarization  and  approximate  correction 
relativistic  (1979)  [27]. 

'Biemont  et  al.  (1989)  [30]. 

•  Victor  and  Taylor  (1983)  [29]. 


Prior  to  analyzing  the  results  presented  for  each  individual  isoclectronic  sequence, 
a  few  general  comments  on  the  main  features  of  the  comparative  theoretical  and 
experimental  procedures  are  in  order. 

The  main  concerns  of  the  authors  whose  theoretical  results  for  the  zinc-like  ions 
are  included  in  Tables  I  and  II,  can  be  summarized  as  follows.  Most  of  them  have 
taken  into  account  intrashell  correlation  within  the  valence  shell  (or  outer  corre¬ 
lation,  in  the  terminology  of  Froese  Fischer  and  Hansen  [26,27])  [26-31],  and 
some  of  them  emphasize  the  importance  of  relativistic  effects,  particularly  for  the 
heavier  and  more  highly  ionized  atoms.  Shorer  and  Dalgamo  [28],  Victor  and 
Taylor  [29],  Biemont  et  al.  [30]  and  Cheng  and  Huang  [31]  follow  theoretical 
procedures  into  which  relativistic  effects  are  directly  incorporated,  whereas  Froese 
Fischer  and  Hansen  only  estimate  these  effects  by  using  observed  energies  and 
allowing  for  spin-orbit  interaction  [26],  or  by  using  multiconfiguration  Dirac- 
Hartree-Fock  energies  and  an  intermediate  coupling  scheme  [27]  in  their  multi- 
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Table  III  Oscillator  strengths  for  the  5s^  'S<,-5snp  'P(  (n  =  5. 6)  and  5s5p  'Pi-5snd  'Dj  transitions  in 

cadmium. 


MCRHF' 

cirhf' 

Ion 

Transition 

RQDO* 

RQDO'' 

Cl' 

RHFIC^ 

4-  CP 

-<  CP 

Exp. 

Cdl 

5s^  'S„-5s5p  ‘P| 

1.32 

1.19 

1.85 

2.13 

1.36 

1.35 

1.42±0.04» 
1.12  +  0.08" 
1.30  ±0.1' 

Inll 

1.61 

1.44 

1.64 

2.73 

1.58 

1.57 

1.26  +  0.20' 
1.43  ±0.09" 

SnIII 

1.62 

1.46 

2.7o 

1  61 

1.60 

1.30  ±0.20' 
1.36' 

1.47  ±0.15'" 
1.50  ±0.1" 

SblV 

1.57 

1.40 

2.71 

1  62 

1.62 

0.88  ±0.. 5' 

TeV 

1.54 

1.39 

2.66 

1  63 

1.57 

0  88  ±0.15' 
1.39  ±0.13"’ 
1.31  ±0.1° 

IVI 

1.48 

1.33 

1.49 

2.60 

I.6I 

1.57 

1.10  ±0.15° 

XeVlI 

1  51 

1.36 

1.48 

2  54 

I  60 

1.57 

1.1  ±01" 

1.22  ±0.07' 
1.45  ±0.10° 

CsVIII 

1.49 

1.35 

2.56 

1.70 

1.60 

BalX 

1.45 

1.33 

2.5 1 

1  70 

1.70 

Cdl 

5s^  'S^-5s6p'Pi 

0.124 

0.106 

XeVII 

0.009 

0.014 

CsVIII 

0.015 

0.020 

BalX 

0.022 

0.027 

Cdl 

5s5p'P|-5s5d'D2 

0.692 

0.672 

XeVII 

0.872 

0.808 

CsVIII 

0.85 

0.791 

BalX 

0.82 

0.764 

■RODO. 

’’  RQDO.  with  polarization. 

'Hibbert(1982)(471. 

Migdalek  and  Basils  (1986)  146] 

'Migdalek  and  Bojara  (1988)  (48) 

•  Luno  and  Novick  ( 1964)  [42]. 

'■  Baumann  and  Smith  (1970)  [4.3]. 

’  Andersen  and  Sorensen  (1973)  [33] 

'  Andersen  et  al.  ( 1 973)  [5 1  ]. 

^  Ansbacher  et  al.  (1986).  beam-foil,  corrected  for  cascading  [52]. 

^  Pinnington  et  al  (1985)  beam-foil,  corrected  for  cascading  [53]. 
"  Pinnington  et  al.  (1987b)  beam-foil,  corrected  for  cascading  [54]. 
°  Pinnington  et  al.  ( 1 987a)  corrected  for  cascading  [55]. 

"O’Neill  etal.(1979)  [24]. 

*•  Pinnington  ( 1976)  [56]. 

'  Kemahan  et  al.  (1980)  [57]. 


392 


LAViN  ET  AL. 


Table  IV  Oscillator  strengths  for  the  5s^  'So-5snp  'P,,  5s5p  'P|-5snp  'So,  5s5p  'P|-5snd  'D^  transitions 

in  cadmium. 


RADO* 

RODO*’ 

Semicmp.' 

SCHF^ 

Exp. 

5s^  'S„-5s5p  ‘P| 

1.32 

1  19 

2.05 

1.95 

1.42  +0.04' 
1.12  ±0.08' 

1  30  ±0.1* 

5s^  'S„-5s6p  'Pi 

0.124 

0.106 

0.124 

0  33 

5s^  'S„-5s7p  'P, 

0.375  (-1) 

0.31  (-1) 

0.309  (-1) 

5s^  'S„-5s8p  'P, 

0.166  (-1) 

0.136  (-1) 

0.125(-1) 

5s=  'S<.-5s9p  'P, 

0.885  (-2) 

0.72  (-2) 

0.632  (-2) 

5s‘  'S„-5sl0p  'P| 

0.523  (-2) 

0.423  (-2) 

5s^  'S„-5sl  Ip  'Pi 

0.341  (-2) 

0.275  (-2) 

5s5p  'Pi-5s6s  'S„ 

0.179 

0.176 

0.173 

0.014 

5s5p  'Pi-5s7s  'S„ 

0.926  (-2) 

0.883  (-2) 

0.108  (-1) 

5s5p  'Pi-5s8s  'S„ 

0.275  (-2) 

0.259  (-2) 

0.34  (-2) 

.“isSp  'Pi-5s9s  'S„ 

0.124  (-2) 

0.1 17  (-2) 

0.158  (-2) 

5s5p  'Pi-5sl0s  'S„ 

0.665  (-3) 

0.621  (-3) 

0.86  (-3) 

5s5p  'Pi-5sl  Is  'S„ 

0.401  (-3) 

0.374  (-3) 

0.524  (-3) 

5s5p  'Pi-5sl2s  'S„ 

0  277  (-3) 

0  259  (-3) 

5s5p  'Pi-5s5d  'Dj 

0.692 

0.672 

0.778 

5s5p  'Pi-5s6d  'D: 

0.973  (-1) 

0.922  (-1) 

0.951  (-1) 

5s5p  'Pi-5s7d  'Dj 

0.310  (-1) 

0.289  (-1) 

0.268  (-1) 

5s5p  'Pi-5s8d  'D: 

0.135  (-1) 

0.125  (-1) 

0.105  (-1) 

5s5p  'Pi-5s9d  'Dj 

0.752  (-2) 

0.692  (-2) 

0.56  (-2) 

5s5p  'Pi-5sl0d  'D: 

0  464  (-2) 

0  425  (-2) 

0.338  (-2) 

*  RQDO. 

RQtK).  with  polarization. 
'Zihtis(l971)[49] 

‘’Hclliwelin968)  [50]. 

'  Lurio  and  Novick  (1964)  [42]. 
'Baumann  and  Smith  (1970)  [43]. 

*  Andersen  and  Sorensen  (1973)  [33] 


configuration  Hartree-Fock  (MCHF)  treatments.  However,  the  better  agreement 
between  theory  and  experiment  brought  about  by  the  explicit  inclusion  of  core¬ 
valence  correlation  (or  full  correlation  [26,27],  although  recognized  by  some  of 
the  authors,  such  as  Cheng  and  Huang  [31],  is  only  carried  out  by  Froese  Fischer 
and  Hansen  [26,27].  Regarding  the  experimental  ./^values,  which  are  in  most  cases 
lower  than  the  oscillator  strengths  calculated  by  the  above  authors,  their  small 
magnitude  has  been  attributed  [32,31]  to  an  inadequate  treatment  of  cascading 
effects  in  the  measurement  of  lifetimes  with  beam-foil  techniques  which  result  in 
longer  lifetimes  for  the  excited  levels  than  the  actual  ones  and  too  small  oscillator 
strengths.  Some  of  the  most  recent  measurements  have  already  been  corrected  for 
cascading.  When  this  is  the  case,  we  have  made  a  remark  at  the  foot  of  the  tables. 
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In  Table  1  we  display,  together  with  the  rqdo  results,  the  oscillator  strengths 
calculated  by  Biemont  et  al.  [30]  with  the  relativistic  Hartree-Fock  (rhf)  method. 
These  authors  claim  to  have  accounted  for  the  most  important  relativistic  effects, 
as  well  as  for  valence  electron  correlation  by  including  all  the  configurations  within 
the  H  =  4  complex.  Also  shown  are  the  relativistic  random-phase-approximation 
(RRPA)  and  multiconfiguration  rrpa  (mcrrpa)  results  by  Shorer  and  Dalgarno 
[28]  and  Cheng  and  Huang  [31],  respectively,  the  three  sets  of /values  reported 
by  Froese  Fischer  and  Hansen  [  26  ]  with  the  approximate  relativistic  Hartree-Fock 
( HF),  multiconfiguration  Hartree-Fock  (mchf)  with  account  for  outer  correlation 
only,  and  with  full  correlation,  respectively.  The  experimental  data  found  in  the 
literature  are  also  shown  in  the  table.  Although  most  of  the  available  measurements 
are  the  result  of  beam-foil  experiments  [33-40],  some  others  come  from  Hanle 
effect  [41,42],  phase-shift  [43]  and  absorption  in  atomic  beam  [44]  observations. 
Large  discrepancies  between  the  observed  and  most  of  the  calculated  oscillator 
strengths  are  apparent,  the  discrepancies  increasing  with  increasing  degree  of  ion¬ 
ization.  In  their  extensive  study  of  this  transition,  Froese  Fischer  and  Hansen  [26] 
conclude  that  outer  shell  or  intravalence  correlation  brings  the  largest  change  in 
the  ./-values  at  the  lower  Z  end  of  the  sequence.  The  present  rqdo  results  with 
explicit  account  for  polanzation  are  in  fairly  good  accordance  with  the  experimental 
results,  considering  the  overall  degree  of  agreement  between  the  other  theoretical 
/-values  and  the  measurements.  In  particular,  the  rqdo /values  agree  best  with 
the  most  recent  experimental  values  [35,36]  which  are  claimed  to  have  reduced 
the  influence  of  cascading  in  the  lifetimes.  Except  for  the  less  ionized  species,  the 
RQDO  oscillator  strengths,  when  corrected  for  polarization,  agree  rather  well  with 
the  MCHF,  full  correlation,  results  of  Froese  Fischer  and  Hansen  [26].  For  Znl, 
Gall,  and  Gelll  the  correlation  within  the  valence  shell  is  expected  to  be  most 
important.  These  effects,  which  lead  to  a  reduction  in  the  magnitude  of  oscillator 
strengths,  are  not  explicitly  accounted  for  in  the  RQDO  procedure.  For  the  higher 
ions  for  which  experimental  data  are  available,  the  best  coincidence  between  theory 
and  experiment  is  achieved  with  the  RQDO  values. 

Table  II  contains  the  oscillator  strengths  for  the  4s4p  'P|-4s4d  'D2  transition  in 
the  zinc  sequence.  The  empirical  energy  data  available  for  this  transition  in  the  low 
Z  end  of  the  sequence  [45]  suffered  from  significant  uncertainties,  and,  hence,  the 
RQDO  results  reported  in  this  table  begin  with  SeV.  No  experimental  ./^values  have 
been  found  in  the  literature  for  the  ions  studied,  and  our  analysis  must  rest  on  the 
comparative  '  'icoretical  results  that  are  available.  These  comprise  the  multiconfigu¬ 
ration  Hartrcc-Fock  ( MCHF)  with  core-polarization /values  by  Froese  Fischer  and 
Hansen  [27],  the  multi-configuration  Dirac-Hartree-Fock  (mcdhf)  with  polar¬ 
ization,  and  within  the  intermediate  coupling  schei  le,  respectively,  by  the  same 
authors  [27],  the  RHF  results  by  Biemont  et  al.  [30],  obtained  with  the  same 
technique  as  for  the  resonance  transition,  and  the./^ values  calculated  by  Victor  and 
Taylor  [29]  using  a  semiempirical  model  potential  and  Dirac-Hartree-Fock  (dhf) 
orbitals  for  describing  the  core.  In  this  transition  it  is  observed,  as  in  the  previous 
one.  that  the  influence  of  core-valence  polarization  in  the  oscillator  strengths  de¬ 
creases  with  the  degree  of  ionization.  The  RQDO  results,  even  without  explicit  account 
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lor  polarization,  are  lower  in  magnitude  than  the  remaining  theoretical  data,  but, 
they  show  the  best  overall  accordance  with  the  MCDHF  with  polarization  /values 
of  Froese  Fischer  and  Hansen  [27],  Whether  the  experimental  measurements,  if 
available,  would  also  yield  lower  oscillator  strengths  than  most  of  the  theoretical 
results  and  would  show  a  reasonable  agreement  with  the  RQDO  values  is  hard  to 
predict.  The  uncertainties  in  the  empirical  energy  data  employed  in  our  calculations 
may  partly  account  for  the  RQDO /values  not  having  attained  a  better  agreement 
with  the  MCDHF  results. 

The  resonance,  5s^  'So-5s5p  'Pi,  transition  of  cadmium-like  ions  has  also  been 
the  object  of  several  theoretical  studies,  as  well  as  experimental  measurements. 
Migdalek  and  Baylis  [46]  have  also  been  concerned,  with  correctly  accounting  for 
intravalence  correlation,  core-valence  polarization  and  relativistic  effects  for  the 
cadmium  isoelectronic  sequence,  as  were  the  aforementioned  authors  who  studied 
the  zinc  isoelectronic  sequence.  Previous  calculations  of  similar  nature  are  those 
by  Hibbert  [47]  on  a  few  Cd-like  ions.  Hibbert  [47]  derived  configuration  interaction 
(ci)  wavefunctions  to  represent  the  intravalence  correlation,  keeping  the  core  frozen. 
He,  then,  estimated  empirical  factors  to  correct  for  errors  in  the  calculated  transition 
energies,  for  core-polarization  effects,  and  for  relativistic  spin  dependent  interactions. 
These  factors  demonstrated  the  importance  of  both  relativistic  effects  and  core¬ 
valence  correlation.  Migdalek  and  Baylis  [46],  with  an  approach  that  combines 
limited  relativistic  mixing  to  represent  intravalence  correlation  with  a  polarization 
model  to  account  for  core-valence  correlation,  report  results  for  many  of  the  ions 
studied  by  us  with  two  specific  methods,  the  relativistic  Hartree-Fock  in  intermediate 
coupling,  RHRC,  and  the  multiconfiguration  Hartree-Fock  with  a  polarization  cor¬ 
rection  (MCRHF-FCP).  The  calculations  of  Migdalek  and  Bojara  [48]  followed  a 
relativistic  configuration  interaction  procedure  with  explicit  core-polarization 
(CIRHF+CP).  These  authors  claim  to  have  obtained  reliable  oscillator  strengths,  at 
a  far  smaller  computational  expense  than  required  by  similar  scale  multiconfigu¬ 
ration  calculations.  Their  results  are  also  displayed  in  Table  III.  The  rqdo /values 
for  the  resonance  transition  of  Cd-like  systems,  without  explicit  core  polarization 
correction,  agree  quite  well  with  both  the  MCRHF+CP  and  CIRHF-FCP  results  of 
Refs.  [46]  and  [48] .  When  explicit  polarization  correction  is  introduced,  the  RQDO 
oscillator  strengths  become  smaller  than  the  previous  ones,  but  better  approach  the 
experimental  values.  Which  of  the  two  rqdo  sets  of  results  is  more  correct  is  hard 
to  decide.  Admiting  that  core-valence  correlation  is  important,  it  may  happen  that 
their  implicit  account  in  the  RQDO /values  calculated  with  Q{r)  =  r  suffice. 

Also  included  in  Table  III  are  the  rqdo  oscillator  strengths  for  the  5s'  'So-5s6p 
'Pi  and  5s5p  'Pi-5s5d  'D2  transitions  for  the  only  few  ions  for  which  experimental 
energy  data  were  available.  Given  the  reduced  number  of  results,  we  have  not  built 
independent  tables  for  them.  No  comparative  data  have  been  found  in  the  literature. 
The  RQDO /values  for  these  transitions  are  lower  in  magnitude  than  for  the  reso¬ 
nance  transition,  a  feature  that  was  to  be  expected. 

And,  finally,  in  Table  IV  we  display  the  oscillator  strengths  for  transitions  be¬ 
longing  to  the  principal  (5s^  'S„-5snp'P|),  sharp  (5s5p  'P|-5sns  'So)  and  diffuse 
(5s5p  'Pi-5snd  'D2)  spectral  series  of  neutral  cadmium,  for  which  some  empirical 
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energy  data  were  available.  As  comparative  results  we  have  found  those  calculated 
by  a  semiempirical  method  by  Zilitis  [49],  and  those  by  Helliwell  [50]  with  a 
semiempirically  modified  self-consistent-field  (SCF)  method.  A  few  experimental 
measurements  are  also  included  in  the  table.  For  the  resonance  transition,  a  fairly 
good  agreement  between  the  rqdo  results  without  explicit  polarization  and  the 
measurement  is  apparent,  while  the  other  two  theoretical  results  are  appreciably 
higher.  For  the  remaining  transitions  in  the  table  and  overall  good  agreement  be¬ 
tween  the  RQDO  and  the  semiempirical  /values  [49]  is  observed.  The  explicit  ac¬ 
count  for  core-valence  polarization  has  less  influence  in  the  rqdo /values  for  the 
higher  transitions  in  the  three  spectral  series  than  for  the  lower  ones.  The  effects  of 
explicitly  neglecting  intrashell  correlation  are  hard  to  establish,  given  the  nature  of 
the  comparative  data. 


Concluding  Remarks 

The  RQDO  oscillator  strengths  calculated  for  the  resonance  4s^  'S(,-4s4p  ‘Pi  and 
the  4s4p  'Pi-4s4d  'D2  transitions  in  the  zinc  isoelectronic  sequence  are  in  fairly 
good  agreement  with  the  results  of  the  rather  more  sophisticated  mcrrpa  [31]  and 
MCHF  [26,27]  calculations,  which  account  for  inner  and  outer  correlation,  as  well 
as  for  relativistic  effects.  The  rqdo /values  conform  with  the  experimental  results 
better  than  those  of  much  more  complex  theoretical  procedures,  although  the  ac¬ 
curacy  of  some  of  the  measurements  has  not  been  completely  established. 

For  the  cadmium  isoelectronic  sequence,  similar  comments  could  be  made  re¬ 
garding  the  correctness  of  the  rqdo  oscillator  strengths  for  the  resonance  transition. 
For  this  sequence,  the  most  complete  calculations  reported  are  those  of  the 
MCRHF-I-CP  [46]  and  cirhf+cp  [48]  procedures,  with  which  results  the  RQDO 
values  conform  reasonably  well.  For  neutral  cadmium,  the  correctness  of  the  rqdo 
oscillator  strengths  in  transition  belonging  to  the  principal,  sharp  and  diffuse  spectral 
series,  is  more  difficult  to  establish,  given  the  scarcity  of  comparative  data. 

Overall,  we  seem  to  find  reasons  to  be  hopeful  about  the  possibilities  of  the  RQDO 
formalism  for  predicting  spectral  properties  of  interest  in  astrophysics  and  nuclear 
fusion  research.  These  reason  rest  on  the  correctness  of  the  results  so  far  achieved, 
and  the  low  computational  expense  and  avoidance  of  the  numerous  convergence 
problems  which  are  common  in  the  multiconfigurational  approaches. 
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Abstract 

In  recent  publications  we  have  presented  a  general  theory  for  the  identification  and  computation  of 
correlated  wasefunctions  of  a  particular  class  of  doubly  excited  states  which  constitute  a  two-electron 
ionization  ladder  (TEIl)  leading  smoothly  to  the  so-called  Wannier  state  at  f  =  0  In  this  work,  we 
examine  further  the  properties  of  these  wavefunctions  for  two-electron  atoms  of 'S  and  'P"  symmetry, 
especially  as  regards  their  analysis  in  terms  of  hydrogenic  basis  sets  and  good  quantum  numbers.  We 
find  that  the  Hemck-Smanoglu  (K,  T)  classification  loses  accuracy  as  we  move  toward  threshold  and 
we  show  that,  when  single  as  well  as  double  excitations  are  considered,  a  better  quantum  number  for 
the  TEIl.  IS  =  A  -  I  -  K.  where  .V.A'  are  not  good  numbers  anymore  The  extent  of  the  breakdown 
of  the  (A.  T)  representation  depends  on  the  system  and  on  the  level  of  excitation  (more  serious  in 
negative  ions  and  for  high  lying  states).  «  I99.t  John  Wiley  &  Sons.  Inc. 


Introduction 

According  to  a  recently  developed  theory  [  1  -3  ] .  the  wavefunctions  of  a  class  of 
doubly  excited  states  ( Df .s )  which  leads  smoothly  to  the  Wannier  state  at  threshold, 
i.e.,  to  the  state  where  the  two  electrons  are  ejected  with  zero  energy  in  opposite 
directions,  can  be  identified  prion  and  computed  from  first  principles  with  great 
accuracy.  This  class  has  been  called  the  H’annier  two-dectron  ionization  ladder 
(TEIL).  The  concepts,  the  formalism,  and  the  computational  methods  are  general, 
and  allow  practical  applications  to  polyelectronic  atoms  and  to  large  values  of  the 
principal  quantum  number  n.  By  now,  we  have  predicted  the  teils  of  H~,  He, 
Li^  ('P"  symmetry).  H  ('S),  He  (^S),  and  Li “{'S)  [1-3].  Correlated  wavefunc¬ 
tions  up  to  n  =  10  have  been  obtained  and  a  number  of  conclusions  have  been 
drawn  from  their  analysis  [1-3]. 

In  this  article  we  proceed  further  with  the  study  of  the  TEIL  wavefunctions,  in 
two  directions. 

( 1 )  Since  many  of  the  arguments  and  computations  in  the  literature  of  DES  are 
based  on  configuration-interaction  calculations  with  hydrogenic  functions, 
e.g.  [4-6],  we  have  related  the  information  obtained  from  our  state-specific 
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TEIL  wavefunctions  and  energies  to  the  information  obtained  in  terms  of  hy- 
drogenic  basis  sets.  The  conclusions  are  drawn  from  calculations  on  the  'S 
and  'P"  symmetry  of  a  two-electron  system  with  Z  =  1,  2,  3,  and  hydrogenic 
quantum  numbers  «  =  6-15. 

(2)  Wulfman  [8]  and  Sinanoglu  and  Herrick  [4]  have  pioneered  the  use  of  group 
theory  for  the  approximate  treatment  of  the  structure  of  des  and  for  the  es¬ 
tablishment  of  new  classification  schemes.  The  quantum  numbers  K,  T  [4] 
have  since  been  used  repeatedly  in  the  phenomenology  and  qualitative  analysis 
of  DES  [  6,9, 10].  Given  that  our  theory  yields  correlated  wavefunctions  for  low 
as  well  as  for  high  n — previously  not  available — the  accuracy  of  the  {K,  T) 
classification  for  the  TElL  can  be  tested  as  a  function  of  energy  and  nuclear 
charge. 


The  ‘S  and  ‘P®  TEIL  Wavefunctions — Comparison  with 
Hydrogenic  Basis  Set  Calculations 

The  theory  of  Refs.  [1-3]  shows  how  correlated  wavefunctions  of  the  TEIL  in 
polyelectronic  atoms  can  be  computed  from  first  principles.  We  have  demonstrated 
that  these  highly  correlated  systems  can  be  treated  in  terms  of  orbitals  and  config¬ 
urations,  provided  the  state-specific  approach  to  the  electronic  structure  of  excited 
states  is  employed  (e.g.  [11,12]).  Hi^  n  states  (n  up  to  10)  became  amenable  to 
calculation  for  the  first  time.  The  demonstration  [  1  ]  that  the  TEIL  states  correspond 
to  the  lowest  energy  in  each  manifold  of  hydrogenically  degenerate  states  and  that 
they  satisfy  the  quantum  mechanical  version  of  7]  =  -fi  has  recently  also  been 
used  [6]  for  the  identification  of  such  states  in  He  'P® — albeit  of  low  n  only — from 
Ci  wavefunctions  with  hydrogenic  orbitals.  (Other  orbitals,  such  as  Sturmians  or 
numerical,  can,  of  course,  also  be  used  in  such  truncated  diagonalization  procedures 
for  two-electron  atoms). 

In  this  article  we  analyze  further  the  teils  of 'S  and  'P®  symmetries.  For  small 
/I  (up  to  6),  the  wavefunction  corresponding  to  the  square  integrable  part  of  the 
resonance  [1-3,12,13],  for  each  shell  n,  is  given  by 

W„=Wl  +  X„  (1) 

where  IF®  is  obtained  from  a  multiconfigurational  Hartree-Fock  ( MCHF)  calculation 
and  X„  is  obtained  variationally.  Both  IF®  and  X„  are  subjected  to  specific  orthog¬ 
onality  constraints  [1-3,12,13].  IF„  may  corresponu  to  a  dielectronic  or  to  a  po- 
lyclectronic  system  and  is  composed  of  configurations  which  are  symmetry  adapted. 

For  small  atoms,  the  configurations  which  enter  IF^  are  of  the  intrashell  type, 
i.e., «!  =  «2  =  w  for  the  two-valence  electrons.  The  self-consistent  calculation  takes 
into  account  angular  as  well  as  radial  correlation.  The  mchf  solution  for  the  lowest 
energy  results  in  positive  coefficients,  provided  the  Condon  and  Shortley  sign  con¬ 
vention  of  the  Clebsch-Gordon  coefficients  is  followed.  For  n  >  6  the  Mc  hf  solutions 
do  not  converge  and  we  proceed  as  follows:  (a)  We  diagonalize  the  Hamiltonian 
in  a  hydrogenic  basis  with  Nq  <,  «,  n'  ^  iV©  +  3;  (b)  We  reexpress  the  resulting 
eigenvectors  in  the  basis  of  the  natural  orbitals  [7].  This  brings  the  wavefunction 
again  in  the  form  ( 1 ).  We  have  confirmed  that  for  n  ^  6,  this  procedure  yields 
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orbitals  and  coefficients  identical  to  the  MCHF  ones.  Thus,  the  two  procedures  are 
equivalent. 


The  TEIL  and  its  Good  Quantum  Numbers — The  Importance 
of  Double  Excitations 

It  has  been  emphasized  in  the  literature  of  des  that  the  0(3  J,  single  configuration 
approximation  (sca)  breaks  down.  It  is  important  to  remind  the  reader  that  this 
fact  characterizes  other  situations  as  well,  such  as  valence-Rydberg  mixings  or  mo¬ 
lecular  structure  in  the  dissociation  region.  Its  remedy  does  not  necessarily  imply, 
at  least  not  yet,  the  abandonment  of  the  orbital  concept  and  configuration  mixing. 
It  is  true  that  it  looks  simpler — and  perhaps  more  elegant — to  have  only  one  “con¬ 
figuration”  in  order  to  describe  in  zeroth  order  the  state  of  a  system.  However,  this 
does  not  mean  that  the  method  used  to  obtain  it  is  more  practical  and  easily  ex¬ 
tendable  to  incorporate  the  remaining  electron  correlation.  On  the  other  hand,  for 
a  theory  of  des  which  defines  and  computes  consistently  a  multiconfigurational 
zeroth  order  vector  as  well  as  its  important  corrections,  the  resulting  information 
can  be  quantitative  as  welt  as  conceptual.  Having  said  this,  we  turn  to  the  problem 
of  optimizing  the  zeroth  order  representation  of  des  in  terms  of  “configurations” 
with  good  quantum  numbers. 

Given  the  multitude  of  closely  lying  des  and  the  related  complexities  of  the 
standaid  Cl  methods,  the  group  theoretic  analyses  of  Wulfman  [8]  and  of  Herrick 
and  Sinanoglu  [4,9]  have  shed  new  light  from  a  different  angle  to  the  problem  of 
the  economical  qualitative  description  and  meaningful  classification  of  des  in  two- 
electron  atoms.  In  these,  the  concept  of  the  doubly  excited  symmetry  basis  (desb) 
was  introduced,  which  is  defined  in  terms  of  hydrogenic  solutions.  The  desb  states 
are  represented  by  a  specific  linear  combination  of  configurations  of  a  given  L  and 
Ml,  for  various  allowed  values  of  l\  and  h- The  coefficients  of  the  expansion  depend 
on  the  principal  quantum  numbers  «i  and  as  well  as  on  L  and  two  additional 
quantum  numbers  J,  and  Ji  whose  range  of  values  is 


n-  N  ,  n  +  A'  -  2 

; -  /  =  1,  2 


where  n  =  max(/i|,  'V  =  min(ni,  «2)  (2a) 

with  the  additional  restriction 

(2b) 

In  the  expressions  for  the  eigenvalues  of  certain  operators,  the  quantum  numbers 
J\  and  Jz  appear  only  as  sums  or  as  differences.  Thus.  Wulfman  ( 8  ]  defined  P  =  J\ 
+  Jz  and  Q  =  J\  -  Jzss  the  appropriate  quantum  numbers.  On  the  other  hand, 
Herrick  and  Sinanoglu  define  T  =  \Q\  and  K  -  P  -  n  +  1  as  the  pair  of  the  new 
quantum  numbers.  The  first  choice  is  due  to  the  fact  that  only  simple  ±  admixtures 
of  + 1  (21  and  -\Q\  states  have  definite  parity.  The  second  is  bom  out  of  numerical 
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Tablf  1.  Analysis  of  the  configuration-interaction  uavcfunctions  corresponding  to  the  lowest  energy 
of  each  manifold  in  terms  of  DESB  vectors.  The  AT  fraction  contains  all  the  intrashell  «!  =  «;  =  A'o  desb 
configurations  plus  all  the  intershell  ones  resulting  from  it  through  double  excitation  The  t  T  fraction 
includes  the  double  excitations  with  a  higher  A„„  =  .V  -  I  -  T.  The  most  important  b\  far  is  that  with 

A’  =  A'o  +  1 . 


A' 

L 

H- 

He 

Li" 

AT 

fraction 

FT 

fraction 

A7‘ 

fraction 

FT 

fraction 

KT 

fraction 

FT 

fraction 

6 

0 

0  98777 

0.99519 

0.99879 

0.99886 

0.99839 

0.99845 

1 

0.97470 

0.98761 

0.99700 

0.99724 

0.99782 

0.99786 

7 

0 

0.98432 

0.99499 

0.99884 

0.9 '"02 

0.99867 

0.99871 

1 

0.97116 

0.98764 

0.99711 

0.99'’55 

0.99811 

0.99814 

8 

0 

0.98015 

0.99469 

0.99872 

0.99911 

0.99893 

0.99895 

1 

0.96668 

0.98734 

0  99695 

0.99770 

0  99839 

0.99843 

9 

0 

0.97537 

0.99431 

0.99845 

0.99913 

0.99914 

0.99916 

1 

0.96149 

0.98683 

0.99656 

0.99772 

0.99863 

0.99869 

10 

0 

0.97000 

0.99390 

0.99801 

0  99910 

0.99929 

0.99934 

1 

0.95572 

098619 

0.99595 

0.99763 

0.99880 

0.99890 

11 

0 

0.96428 

0.99347 

0.99741 

0.99902 

0.99937 

0.99947 

1 

0.94950 

0.98547 

0  99514 

0.99746 

0.99889 

0.99906 

12 

0 

0.95812 

0.99302 

0.99665 

0.99890 

0.99940 

0.99957 

1 

0.94289 

0.98471 

0  99414 

0.99722 

0  99890 

0.99918 

1.1 

0 

0.95163 

0  99257 

0  99573 

0  99875 

0  99935 

0.99963 

1 

0.93599 

0.98393 

0.99295 

0.99692 

0.99881 

0.99924 

14 

0 

0.94486 

0.99212 

0.99466 

0.99858 

0.99923 

0.99967 

1 

0.92885 

0.98314 

0.99160 

0.99658 

0.99864 

0.99926 

15 

0 

0  93787 

0  99169 

0.99.t45 

0.99839 

0  99905 

0.99968 

1 

0.92153 

0.98235 

0.99008 

0  99620 

0.99838 

0.99923 

results.  It  is  useful  because,  in  this  way.  a  whole  Rydberg  scries  can  be  assigned  the 
same  A'.  T.  Their  allowed  values  are  [4.9] 

7'=0.  !•••.  min(Z,.  A  -1)  (.^a) 

and 

-( A'  -  1  -  7")  <  A'  ^  -  1  -  7’  in  steps  of  2  (3b) 

Robaux  [6]  has  subjected  hydrogenic  basis  Cl  wavefunctions  for  He  'P"  DES 
(,V  =  2-6)  to  a  (A'.  7)  analysis,  where  configurations  arc  made  up  of  fixed  N  and 
a  variable  /i.  according  to  the  Herrick  and  Stnanoglu  prescription  [4).  Her  main 
findings  and  conclusions  arc: 

( 1 )  In  accordance  with  the  results  of  Ref.  [1].  the  lowest  energy  state  within  an  N 
manifold  yields  (n)  =  (ry).  The  set  of  these  states  defines  the  teii  . 
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Table  II.  Purity  of  the  Fquantum  number  in  the  first  three 
intrashell  'S  states  (T  =  0)  of  the  /V  =  6  manifold.  It  is  best 
for  the  lowest  state,  the  TEIL  state,  studied  here  and  in  our 
earlier  work.  It  deteriorates  for  the  other  roots. 


F 

A’ =  6 

H- 

'S 

He 

Li* 

First  intrashell  state 

0 

0.99519 

0.99886 

0.99845 

2 

0.00478 

0.00112 

0.00153 

4 

0.00004 

0.00002 

0.00002 

Second  intrashell  state 

0 

0.00368 

0.0001 1 

0.00090 

2 

0.96606 

0.99253 

0.99550 

4 

0.02953 

0.00728 

0.00353 

6 

0.00072 

0.00009 

0.00008 

Third  intrashell  state 

0 

0.00008 

0.00002 

0.00002 

2 

0.04900 

0.00251 

0.00010 

4 

0.85820 

0.96300 

0.98441 

6 

0  08722 

0.03364 

0.01521 

8 

0.00522 

0.00080 

0.00024 

(2)  For  the  teil,  K  and  T  are  good  numbers  (see  Table  111  of  Ref.  [6]).  Given 
that  the  ideas  of  DESB  states  and  K.  T  or  molecular  classification  schemes 
[4.6,9, 10, 1 5  ]  have  thus  far  been  applied  only  to  low-lying  DES.  it  is  important 
to  establish  the  degree  of  validity  of  the  goodness  of  the  K,  T  classification  as 
N  increases.  Our  TEIL  wavefunctions  reach  A’  =  1 5  and  offer  a  good  possibility 
for  such  an  analysis. 

We  have  expanded  our  TEIL  functions  in  terms  of  DESB  states,  without  the  re¬ 
striction  of  fixed  A',  a  restriction  implying  that  only  single  excitations  are  included. 
Instead,  we  introduce  a  lowest  value  A',,  which  corresponds  to  the  mchf  vector, 
and  then  expand  in  terms  of  DESB  states  whose  quantum  numbers  A  and  n  satisfy 
n  >  A’  >  A',,.  In  this  way,  double  excitations  are  included  in  the  expansion,  while 
the  whole  state  remains  orthogonal  to  the  lower  threshold  channels.  Let 

»:<„  =2222  CV„\NnKT(LS)  >n>N^  N„  (4) 

n  ^  K  1 

First  we  examine  T.  For  '  5  states.  T  can  have  only  one  value,  T  =  0.  Therefore, 
no  conclusions  can  be  drawn  from  this  case.  On  the  other  hand,  for  the  '  P"  states, 
T  =  I  for  the  intrashell  DESB  states  (//  =  A)  but  it  can  assume  two  values.  T  =  0 
and  1,  for  the  intershell  DESB  states  (h  #  A  ).  Therefore,  the  'P"  states  offer  the 
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Table  III.  Energies,  in  atomic  units,  of  the  configuration- 
interaction  wavefunctions  corresponding  to  the  lowest  root 
of  each  manifold  for  'Sand  'P  symmetries  of  H",  He  and  Li*. 
The  energies  are  well  approximated  by  the  values  produced 
from  the  formula.  Ref.  [Ic]  E„  =  -C(2)  [n(n  ~  l)/n‘). 


'S 

N 

H- 

He 

Li* 

6 

-0.01824 

-0.09074 

-0.21863 

7 

-0.01.452 

-0.06693 

-0.16102 

8 

-0.01042 

-0.05140 

-0.12352 

9 

-0.00828 

-0.04072 

-0.09775 

10 

-0.00674 

-0.03305 

-0.07928 

II 

-0.00560 

-0.02737 

-0.06560 

12 

-0.00472 

-0.02303 

-0.05518 

\i 

-0.00404 

-0.01966 

-0.04706 

14 

-0.00349 

-001697 

-0  04061 

15 

-0.00305 

-001480 

-0.03540 

'P 

N 

H- 

He 

Li* 

6 

-0.01756 

-0.08902 

-0.21577 

7 

-0.01311 

-0.06590 

-0.159.32 

8 

-0,01016 

-0.05074 

-0.12243 

9 

-0.00810 

-0.04027 

-0.09701 

10 

-0.00661 

-0.03274 

-0.07876 

II 

-O.OO.V.SO 

-0.02714 

-0.06521 

12 

-0.00465 

-0.02286 

-0.05489 

1.4 

-0.00398 

-0.01952 

-0.04683 

14 

-0.00345 

-0.01687 

-0.04043 

15 

-0.00301 

-0.01472 

-0.03526 

opportunitv  of  testing  whether  7  is  a  good  quantum  number.  It  turns  out  that 
intersheli  desb  functions  with  7=0  have  negligible  overlap  with  Therefore, 
our  numerical  results  show  that  7  is  a  good  number. 

Next,  let  us  examine  A',  given  a  value  of  7.  The  intrashell  DESB  function  with 
the  lowest  energy  has  K  =  =  A'  -  1  -  7(A’  =  A’,,).  Our  computations  show 

that  it  gives  the  highest  contribution  to  the  expansion  (4).  When  only  single  ex¬ 
citations  are  added,  Ama,  remains  a  goed  quantum  number.  Things  change  when 
an  improvement  of  the  total  wavefunction  is  made,  by  including  double  excitations 
as  well.  Considering  the  first  double  excitation,  (A'  =  A’„  +  1 ),  we  see  that  it  does 
not  contain  A'^j,  (A  runs  in  steps  of  two).  On  the  other  hand,  our  calculations 
show  that  the  highest  contribution  comes  from  the  term  with  K  =  An,,,  =  N 
-1-7,  (A-  =  A-„  +  1 ). 

We  note  that  the  same  analysis  of  the  other  roots  of  each  manifold  N„  shows 
characteristics  very  similar  to  the  ones  discussed  above  for  the  lowest  energy  state 
(the  TEiL  state). 
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Taking  into  account  the  aforementioned  findings,  we  have  looked  for  a  new 
quantum  number  to  replace  K,  given  that  cannot  be  kept  fixed  anymore.  Indeed, 
such  a  number  exists  and  is  defined  by 

F  =  N-l-K„,„=T,  (5) 

for  the  lowest  energy  state  of  each  No  manifold.  Here,  neither  N  or  are 

good  quantum  numbers. 

Eq.  (5)  refers  to  the  lowest  energy  state  of  each  manifold  N„.  In  general, 

F  =  N-l~K  (5a) 

Since  the  range  of  K  is  given  by  (3b),  the  corresponding  range  of  F  is 

T^F^IN  —  1  —  T,  in  steps  of  2  (6 ) 

This  proposal  extends  and  revises  the  scheme  of  Herrick  and  Sinanoglu  where 
both  N  and  K  are  considered  to  be  good  quantum  numbers.  If  this  is  assumed  to 
be  the  case,  then,  of  course,  the  definition  and  use  of  F  is  redundant. 

Given  the  above,  the  appropriate  representation  of  W  is 

2  2  C^^\NnFnLS)  >n^N^No  (7 ) 

n  S 

Thus,  having  found  through  computation  based  on  our  TEIL  wavefunctions  that 
for  high-lying  DES  the  (K,T)  representation  loses  accuracy,  we  have  identified  the 
cause  of  the  breakdown  in  the  importance  of  the  double  excitations.  The  magnitude 
of  their  effect  depends  on  the  system  and  on  the  level  of  excitation. 

The  numerical  demonstration  of  these  conclusions  is  given  in  Table  1.  The  im¬ 
portance  of  DESB  states  with  doubly  excited  configurations  manifests  itself  in  H  " 
already  for  small  N„.  For  He  and  the  positive  ions  we  must  move  to  higher  values 
of  No,  close  to  10,  for  their  contribution  to  become  important. 

Thus,  the  FT  scheme  provides  approximately  good  quantum  numbers  for  rela¬ 
tively  high  values  of  Ng  even  for  the  difficult  case  of  H  ^ .  Nevertheless,  for  sufficiently 
high  values  of  N„  all  quantum  numbers  are  expected  to  cease  being  good  ones.  For 
the  other  intrashell  states  of  each  A'-manifold,  matters  deteriorate  for  lower  values 
of  No-  The  extent  of  the  breakdown  of  the  F  quantum  number  depends  on  the 
system  and  on  the  level  of  excitation  (more  serious  in  low  values  of  Z  and  for  high- 
lying  states)  as  well  as  on  the  value  of  f  (more  serious  for  high  values),  as  it  is 
shown  in  Table  II.  For  the  sake  of  completeness,  the  energies  of  the  wavefunctions 
computed  for  this  work  are  given  in  Table  III. 

Conclusion 

The  computation  of  the  correlated  TEIL  wavefunctions  [1-3]  in  the  regime  of 
high  principal  quantum  numbers  has  allowed  for  the  first  time  the  quantitative 
examination  of  assumptions  and  group  theoretic  results  obtained  using  hydrogenic 
basis  sets.  The  'Sand  the  'P"  symmetries  were  studied  for  Z=  1,2, 3.  One  conclusion 
is  that  the  Herrick-Sinanoglu  (K,  T)  representation  for  two  electron  DES  [4,9] 
loses  its  accuracy  as  we  move  up  in  energy.  It  is  found  that  neither  N  nor  K  are 
good  numbers.  The  reason  that  this  breakdown  is  not  seen  in  the  low-lying  states 
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is  the  fact  that  in  this  case,  the  relative  importance  of  the  double  excitations  (with 
hydrogenic  functions)  is  small.  We  have  explained  this  breakdown  in  terms  of  the 
spatial  characteristics  of  the  wavefunctions  and  have  shown  why  it  is  more  important 
in  H",  i.e..  the  negative  ions. 

Finally,  we  have  proposed  a  new  quantum  number.  F  =  N  -  I  -  K,  [Eqs.  (5), 
(5a).  (6)]  which  remains  “good”  (not  exact  of  course)  for  low-  as  well  as  for  high- 
lying  DES.  We  note  that  F  is  numerically  the  same  quantity  as  vz,  the  number  of 
bending  quanta  in  the  molecular  picture  of  DES  (Refs.  [15,9]). 

Therefore,  the  results  of  Table  II  provide  a  physical  picture  of  the  relative  im¬ 
portance  of  the  vibration-bending  modes  i>2  =  0,2,  and  4  in  the  first  three  intrashell 
states  of 'S  symmetry. 
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Abstract 

In  the  Dyson  equation,  the  self-energy  describes  all  relaxation  and  correlation  corrections  to  Koopmans’s 
theorem.  Energy-independent  contributions  to  the  self-energy  depend  on  the  one-electron  reduced  density 
matrix  of  the  reference  state  Three  formalisms  for  calculating  these  terms  are  considered;  perturbation 
theory,  coupled-cluster  theory,  and  contour  integral  theory.  In  one  method,  combinations  of  coupled- 
cluster  singles  and  doubles  amplitudes  are  substituted  for  first-order  double  excitation  coefficients  and 
for  second-order  single  excitation  coefficients.  Another  approach  generates  a  description  of  reference 
state  correlation  through  the  evaluation  of  approximate  contour  integral  expressions.  Calculations  on 
electron  binding  energies  of  closed-shell  molecules  and  anions  reveal  that  the  coupled-cluster  results  for 
the  energy-independent  self-energy  terms  are  closer  to  the  perturbative  results  than  to  the  contour  integral 
values.  199.^  John  Wiley  &  Sons.  Inc 

Introduction 

Applications  of  electron  propagator  theory  [1-7]  in  quantum  chemistry  have 
been  confined  chiefly  to  the  calculation  of  ionization  energies  and  electron  affinities 
at  a  fixed  nuclear  geometry.  Recent  developments  have  extended  the  scope  of  ap¬ 
plications  to  effective  densities  and  energy  gradients  for  initial,  A^-electron  states 
and  final  states  with  N  ±  1  electrons  [  8  ] .  At  the  heart  of  these  procedures  lie  choices 
for  the  treatment  of  correlation  in  propagator  reference  states.  These  choices  ideally 
are  matched  with  operator  manifolds  that  produce  a  balanced  description  of  many 
states.  While  the  choice  of  operator  manifolds  is  determin-’d  principally  by  the 
kinds  of  final  states  under  study,  there  is  comparatively  greater  license  in  reference 
state  approximations.  In  the  context  of  diagrammatic  approaches,  these  approxi¬ 
mations  generally  originate  from  perturbative  improvements  to  a  Hartree-Fock 
wavefunction.  Recent  attempts  to  achieve  renormalized  descriptions  of  reference 
state  correlation  have  employed  coupled-cluster  wavefunctions  [9].  In  addition, 
there  have  been  many  applicatiens  of  multiconfigurational  SCF  wavefunctions  and 
related  operator  manifolds  [10].  All  of  these  methods  depend  on  separate  deter¬ 
minations  of  reference  state  wavefunctions  that  are  essentially  independent  of  the 
propagator  formalism.  Separate  calculations  on  reference  state  correlation  can  be 
avoided  entirely  by  systematic  improvements  to  the  operator  manifold;  an  uncor¬ 
related  reference  state  suffices  for  such  an  approach  [  4  ] .  A  disadvantage  of  methods 
of  this  type  is  that  they  require  large  operator  manifolds  to  compensate  for  the 
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absence  of  reference  state  correlation.  An  intermediate  method,  one  which  makes 
limited  use  of  correlated  reference  states,  can  be  derived  from  propagator  expressions 
for  the  density  matrices  of  these  states  [  6, 1 1  ] .  Self-consistent  descriptions  of  density 
matrices  and  electron  binding  energies  can  be  achieved  in  this  manner. 

In  this  article,  three  methods  for  calculating  contributions  to  the  electron  prop- 
ag.ator’s  self-energy  matrix  are  compared.  This  matrix,  which  contains  all  correlation 
corrections  to  Koopmans’s  theorem  electron  binding  energies,  has  contributions 
which  depend  on  the  one-electron  density  matrix  of  the  reference  state.  Correlated 
density  matrices  are  obtained  from  perturbation  theory  on  a  Hartree-Fock  wave- 
function.  from  a  coupled-cluster  wavefunction  and  from  contour  integrals  involving 
the  self-energy  matrix.  Numerical  results  on  ionization  energies  and  electron  affinities 
complete  the  comparison. 


Electron  Propagator  Theory 


Elements  of  the  electron  propagator  matrix  contain  information  about  the  A- 
electron  reference  state  and  final  states  with  A’  ±  1  electrons.  The  p.  q  element  of 
this  matrix,  where  p  and  q  are  general  spin-orbital  indices,  is  given  by 


lim 

,-o 


y  1.«)<.V-  l./t|fl^|A’> 

f  E  + E„(N- D- HoiM- IV 


^  y  (A'lrr<;|A' -b  1./;)<A'4-  l./;|r/^|A’) 
E  -  E„(  A’  +  1 )  +  Ao(  A)  +  11) 


Poles  are  values  of  E  that  produce  singularities,  for  they  equal  an  ionization  energy 
or  an  electron  affinity.  Another  useful  expression  of  the  electron  propagator  matrix 
employs  superoperator  theory  [12].  The  superoperator  metric,  defined  by 

(Y|Z)  =  <0|[Y^ZM0>. 

depends  on  the  choice  of  the  N-electron  reference  state,  10),  and  on  the  space  of 
ionization  operators.  The  identity  and  Hamiltonian  superoperators  operate  on  the 
latter;  the  defining  equations  are 

ix  =  X, 


and 

HX  =  [X,  Hj-  . 

In  this  notation,  the  electron  propagator  matrix  is 

G(/0  =  (a|(/;/-//)  'a). 

but  a  more  useful  form  that  avoids  an  inverse  operator  in  favor  of  a  matrix  inversion 
is 

G(E)  =  (a|h)(hi(/:7-  //)h)  '(hla).- 
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where  h  stands  for  the  full  space  of  ionization  operators.  After  partitioning  this 
space  into  the  primary  space  of  simple  field  operators,  a,  and  an  orthogonal  sec¬ 
ondary  space,  f.  one  obtains 


G(£)  =  [(aia)(a|f)] 


//)a)  (a|(£/-//)f)- 

-1 

■(a|a)' 

//)a)  (fl{£/-//)f). 

.(f|a). 

=  [10] 


£1  -(a|Ha) 
-(f|Ha) 


-(a|Hf)  ■ 

-\ 

r 

£1  -(f|Hf). 

.0. 

The  Dyson  equation  can  be  derived  at  this  point  by  observing  that  only  the  upper- 
left  block  of  the  inverse  matrix  is  relevant  and  by  considering  the  inverse  propagator 
matrix.  One  obtains 


G-'(£)  =  £1  -  (a|Ha)  -  (a|Hf  )[£1  -  (f|Hf  )]-‘(f|Ha) . 


The  second  term’s  elements  depend  on  the  reference  state’s  one-electron  reduced 
density  matrix,  p, 

(ap|//a,)  =  //p,  +  2  (P9lk5)<0|£ijfl,|0>  =  Ap,  +  2  (wllw)p«. 


It  is  possible  to  separate  the  con  elated  and  uncorrelated  contributions  to  the  density 
matrix  by  writing  the  reference  state  as  Hartree-Fock  plus  corrections.  In  the  ca¬ 
nonical  orbital  basis,  the  uncorrelated  part  is  diagonal; 

+  Z  (pq\\rs)(HF\aUs\HF}  =  A;*,  +  2  (Pg\kr)nr, 

n  r 

where  the  occupation  numbers,  n,  are  unity  for  occupied  spin-orbitals  and  zero  for 
virtual  spin-orbitals.  A  separation  of  correlated  and  uncorrelated  terms  in  the  Dyson 
equation  is  thereby  effected  by  writing 

G-'(£)  =  £!-£-  (alHaWuuon  -  (a|Hf  )[£1  -  (f  |Hf  )]-‘{f|Ha) 

=  G^'(£)-  2(£). 

The  first  two  terms,  which  are  diagonal  in  the  canonical  orbital  basis,  constitute 
the  inverse  of  the  uncorrelated  electron  propagator  matrix,  whose  poles  are  identical 
to  the  ionization  energies  and  electron  affinities  of  Koopmans’s  theorem: 

G?'(£U  =  {E-  Cp)bp^  . 

The  subscript  below  the  (a|  Ha)correiai.on  term  stipulates  that  the  Hartree-Fock  con¬ 
tribution  to  the  one-electron  density  matrix  has  been  omitted.  The  elements  of  this 
term  have  the  following  form: 

(a^l  Ha,^)cofTclaiion  “  2  . 


In  the  Dyson  equation,  correlation  corrections  to  the  uncorrelated  inverse  propagator 
matrix  arise  from  the  energy-independent  term  just  discussed  or  from  the  energy- 
dependent  final  term.  These  corrections,  also  known  as  the  self-energy  matrix,  do 
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not  vanish  when  E  becomes  positively  or  negatively  infinite,  for  the  constant  terms 
remain.  Because  of  this  limiting  behavior,  the  constant  parts  of  the  self-energy 
matrix,  2(£),  are  often  symbolized  by  2(oo),  where 

2(E)  =  2(oo)  +  2'(£). 

The  energy-dependent  part  of  the  self-eneigy  is  denoted  by  the  prime  superscript: 

•L'{E)  =  (a|Hf  )[£1  -  (f|Hf  )]-’(t|Ha) . 

The  energy-independent  part  is  defined  by 

2(qo)  ”  (  a  I H  a  )coryelation  ■ 

Perturbative  Arguments 

In  order  to  define  an  approximate  self-energy  expression,  various  investigators 
have  advocated  a  variety  of  choices  for  the  operator  manifold  f  and  the  reference 
state.  For  example,  the  burden  of  calculating  accurate  poles  (but  not  residues)  can 
be  placed  entirely  on  the  operator  space  by  choosing  a  Hartree-Fock  reference  for 
all  superoperator  matrix  elements  [4].  To  a  lesser  extent,  one  can  dispense  with 
the  operator  manifold  f  and  rely  solely  on  reference  state  corrections  in  the  matrix 
2(oo)  [13].  For  the  lowest  ionization  energy,  this  is  a  satisfactory  approach  in  the 
limit  of  a  complete  basis.  A  mixed  strategy,  relying  on  improvements  of  both  kinds, 
has  been  applied  more  often.  Justifications  for  various  choices  generally  involve 
counting  orders  of  the  fluctuation  potential  in  the  self-energy.  For  example,  the 
second  order  self-energy  is  obtained  when 

2''2>(£)  =  (a|Hf3)*”[£l  -  (f3iHf3)<°>]-'(f3|Ha)<'> , 

where  the  superscripts  in  parentheses  refer  to  the  highest  order  in  the  fluctuation 
potential  that  is  included  in  the  superoperator  matrices  and  fs  refers  to  the  space 
of  triple  field  operator  products.  Only  Hartree-Fock  contributions  to  ground-state 
averages  are  needed  at  this  level.  A  renormalized  theory  which  recovers  all  second- 
order  terms  is  obtained  when  the  matrix  ( f  |  Hf )  is  evaluated  in  first  order.  This 
replacement,  operative  in  the  so-called  2p-h  td,\  approximation  [3],  includes  ring 
and  ladder  diagrams  through  all  orders.  To  obtain  all  terms  through  third  order  in 
2'(£),  the  following  form  is  sufficient: 

2'<^^>(£)  =  (a|Hf3)<^»[£l  -  (f3|Hf3)<'>]-'(f3|Ha)'2) . 

Beginning  in  third  order,  correlation  corrections  to  the  ground-state  averages  are 
needed  [14].  In  this  case,  they  arise  in  (a|Hf3)  and  (f3|Ha)  through  first-order 
double  excitations.  This  approximate  2'  has  been  called  extended  2p-h  tda  and 
adc(3)  in  the  literature  [6].  It  contains  many  terms  that  are  higher  than  third 
order  and  therefore  the  notation  3+  is  used  in  the  superscript.  Because  superoperator 
matrix  elements  that  couple  a  and  fj  first  appear  in  second  order,  the  fs  portion  of 
the  operator  manifold  is  not  needed  until  fourth  order  in  the  self-energy  matrix. 

For  the  constant  part  of  the  self-energy,  contributions  of  order  n  require  in 
order  n  -  1 .  Because  single  excitation  corrections  to  the  Hartree-Fock  wavefunction 
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vanish  in  first  ordci,  there  are  no  first-order  corrections  in  the  correlation  density 
matrix.  Therefore,  there  are  no  constant  self-energy  terms  in  second  order.  S(oo ) 
appears  in  third-order  through  first-order  double  excitations  and  second-order  single 
excitations  in  the  reference  state.  The  former  excitations  contribute  to  the  occupied- 
occupied  and  virtual-virtual  blocks  of  the  density  matrix  and  the  latter  contribute 
to  the  occupied-virtual  block.  If  third-order  S(oo )  is  added  to  the  resulting 

self-energy  matrix  is  complete  through  third  order  [15]. 

Coupled-Cluster  Reference  States 

The  perturbative  approach  discussed  above  rests  on  systematic  improvements 
in  ground-state  averages  and  operator  manifolds.  Infinite  order  corrections  can  be 
obtained  by  retaining  the  full  inverse  matrix  that  occurs  ir.  the  energy-dependent 
part  of  the  self-energy  matrix.  ( Finite  order  terms  can  be  identified  by  expanding 
the  inverse  in  powers  of  its  uncorrelated,  diagonal  part  and  its  correlated,  nondi¬ 
agonal  part.)  Instead  of  relying  on  finite  order  improvements  to  the  reference  state’s 
wavefunction  or  density  matrices,  one  can  employ  a  description  that  contains  infinite 
order  corrections  [9] .  The  usual  coupled-cluster  ansatz  for  the  reference  state  has 
several  properties  that  make  it  convenient  for  this  purpose.  The  most  common 
coupled-cluster  wavefunction  [16] 

\CC)  =  e^\HF). 

where 


r  =  r,  +  Tz  +  Tj  +  •  •  • 

T:  =  'Z  X  tfala, 

I  a 

T2=  'Z  "Z  tfalala.Oj 

i>j  a>b 

Ti=  Z  Z  tl^aUlala.ajak 

t>j>k  a>b>c 

has  amplitudes  that  are  easily  related  to  the  usual  configuration  interaction  ansatz, 
|C/>  =  (1  +  C,  +  Cz  +  Cj  -t-  •  •  , 

where 

C\  =  ZZ  cfala, 

I  a 

C2  =  Z  Z  cfalala.Oj 

t>j  a>b 

Cz  =  2  Z  c^alalala.ajOk . 

i>j>k  a>h>c 

By  equating  terms  with  different  degrees  of  excitation  with  respect  to  the  Hartree- 
Fock  reference,  one  concludes  that  the  single  and  double  excitation  coefficients 
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that  are  implicit  in  a  coupled-cluster  singles  and  doubles  wavefunction  [17]  are 
given  by 


c?  =  /f 
and 

^ab  ^  fOb  I  fO  fb _  t^fb 

^ij  *  ^1  tj  li  . 

Of  special  interest  to  this  discussion  is  the  inclusion  of  all  first-order  doubles  con¬ 
tributions  and  all  second-order  singles  contributions  to  the  configuration  interaction 
coefficients  generated  in  this  manner.  Substitution  of  these  coefficients  for  the  usual 
perturbative  values  in  the  expressions  for  in  second  order  recovers  all  third-order 
self-energy  contributions  while  including  many  higher  order  terms.  While  it  is  pos¬ 
sible  to  restrict  these  substitutions  to  the  constant  part  of  the  self-energy  matrix, 
recent  implementations  have  extended  these  replacements  to  the  second-order  su¬ 
peroperator  matrices,  (a|Hf3)‘^’and(f3|Ha)‘^’,  required  for  the  third-order  energy- 
dependent  self-energy  [9]. 


Contour  Integral  Theory 

The  energy-independent  part  of  the  self-energy  matrix  depends  on  the  one-electron 
reduced  density  matrix.  This  matrix,  in  turn,  can  be  evaluated  through  the  contour 
integral  relation 


where  C  denotes  a  contour  that  encloses  all  ionization  energy  poles  in  the  complex 
plane  [1].  From  iterations  on  an  alternative  form  of  the  Dyson  equation, 

G(£)  =  Go(£)  4-  Go(£)S(£)G(£) , 

it  is  clear  that  the  propagator  matrix  can  be  expressed  in  powers  of  the  self-energy 
matrix: 

G(£)  =  Go(£)  +  Go(£)S(£)Go(£)  +  Go(£)2(£)Go(£)2(£)Go(£)  -f  •  •  •  , 

These  relations  suggest  an  iterative  procedure  in  which  density  matrices  obtained 
from  the  contour  integral  are  used  to  obtain  new  elements  of  the  self-energy  matrix, 
which,  in  turn,  are  used  to  evaluate  the  propagator  matrix.  The  cycle  is  completed 
when  the  new  propagator  matrix  produces  a  new  density  matrix.  Because  2(oo) 
depends  linearly  on  p*",  various  powers  of  latter  matrix  are  implicitly  contained  in 
the  contour  integral  relation.  By  truncating  the  Dyson  expansion  so  that 

Go(£)2(£)Go(£)2(£)Go(£) 

and  subsequent  terms  are  omitted,  one  obtains  a  linearized  expression  for  the  density 
matrix  [6,1 1).  Because  the  uncorrelated  contribution  to  the  density  is  given  by 
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=  ^.j^GoiE)  dE , 

the  correlated  contribution,  p'^,  is  involved  in  the  following  relation: 

p"  =  :^  f  Go(£){2(co)  +  S'{£-)}Go(£)i/£. 

27r/  Jc 

The  first  term  on  the  right  side  of  this  equation  is  easily  evaluated  through 

£  G„^,{£)2(<x.  )nfio„{E)  dE  =  Go,,(£)  2  (pq\\rs)p‘^sG^^{E)  dE . 

The  last  expression  vanishes  for  Cp  =  for  Cp  e,,  one  obtains 

— — —  2  {pq\\rs)p‘r, . 

rv 


By  defining 


t  «n  -  tla 


ifCp  Cq 


if  Cp  +  Cq 


one  can  write  the  following  equation  for  the  correlated  part  of  the  density  matrix: 

(1 -dV)p^  =  -^-.  f  Go(E)2'{E)G^(E)dE. 

Ztt/  Jc 

The  most  taxing  aspect  of  this  calculation  is  evaluation  of  the  contour  integral 
matrix 

f  Go(E)r(E)Go(E)dE. 

2iri  Jc 

In  effect,  the  energy-dependent  self-energy  matrix  must  be  calculated  at  energies 
equal  to  the  canonical  orbital  energies.  For  2''’^’(£),  calculation  of 

[£1  -(f3|Hf3)">]-(f3|Ha)'2> 

u  .c  crucial  step.  The  inverse  matrix  is  expanded  in  powers  of  its  diagonal,  zeroth- 
erder  part  and  its  first-order  part: 


{[£1  -(f3|Hof3)]‘'  +  [£1  -(f3|Hof3)]-'(f3|Vf3)[£J  -(f3|Hof3)]"' 

-p  [£I  -  (f3|Hof3)l-'(f3l  Vf3)[£l  -  (f3|Hof3)]-'  (f3|Vf3)(£T  -  (f3lHof3)]-' 

+  ..•}(f3!Ha)<='. 
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Table  I.  6-311  ++G(2df,2pd)  anion  ionization  energies  (eV). 


Anion 

Third  order 

Coupled  cluster 

Cv>ntour  integral 

Expt. 

[231 

2{x)., 

Pole 

2:(oo)„ 

Pole 

2(oo)„ 

Pole 

F- 

-4.337 

-4.155 

-3.797 

3.40 

OH- 

-0.384 

-z  119 

1.83 

NHi 

0.197 

-0.830 

0.78 

The  convergence  criterion  for  these  matrix  elements  is  10  *  atomic  units.  The  most 
demanding  summations  scale  where  O  is  the  number  of  occupied  orbitals 

and  Kis  the  number  of  virtual  orbitals.  (Identical  scaling  factors  occur  in  a  variety 
of  methods,  such  as  singles  and  doubles  configuration  interaction,  third-order  many- 
body  perturbation  theory  and  coupled-cluster  singles  and  doubles  [16].)  By  rear¬ 
ranging  the  linear  equations  and  expanding  (1  -  dV)"',  it  is  possible  to  identify 
terms  of  various  orders: 

p‘  =  (1  +dV<‘)-l-dV‘'>dV<'>+  •  •  •) 
f  Go(£){S'<2>(£)  + •  •  •  }Go(£) ^f£ . 

ZTTl  Jc 

Derivation  of  the  correlated  density  matrix  through  second-order  { and  the  corre¬ 
sponding  third-order  constant  diagrams)  requires  the  second-order  self-energy  and 
the  identity  term  from  the  expansion  of  { 1  -  dV )“' .  Retention  of  the  full  inverse 
introduces  higher  order  corrections.  For  third-order  second-  and  third-order 
contributions  to  2'(£)  are  required.  Because  the  lowest  order  contributions  to  the 
self-energy  matrix  are  second  order,  the  lowest  order  terms  that  are  neglected  in 
the  approximation 


G(£)  Go(£)  +  Go(£)2(£)Go(£) 

occur  in  the  fourth-order  density  matrix.  Insertion  of  2''^'^'{£)  in  contour  integral 
expressions  that  ignore  quadratic  and  higher  terms  in  the  self-eneigy  matrix  provides 


Table  II.  5s4pld  N2  ionization  energies  (eV) 


Final 

state 

Third  order 

Coupled  cluster 

Contour  integral 

Expt. 

[23] 

2:(x)„ 

Pole 

2(x)„ 

Pole 

2(x)„ 

Pole 

'Ilu 

-16.628 

16.98 

2v 

-15.482 

-15.694 

15.60 

2v 

-18.790 

-18.711 

-18.951 

18.78 
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Table  III.  6-3 1 1 ++G(2df.2pd)  H2O  ionization  energies  (eV) 


Final 

state 

Third  order 

Coupled  cluster 

Contour  integral 

Expt. 

123] 

2(oo)„ 

Pole 

2(ooX. 

Pole 

2(00)., 

Pole 

0.048 

-13.064 

0.006 

-13.046 

0.111 

-13.006 

12.62 

0.021 

-15.331 

-0.021 

-15.334 

-15.266 

14.74 

0.012 

-19.262 

-0.026 

-19.307 

-19.194 

18.51 

a  full  fourth-order  description  of  2{oo).  Many,  but  not  all,  fifth-order  and  higher 
terms  in  the  self-energy  matrix  are  implicitly  included.  One  source  of  these  terms 
is  the  renormalized  character  of  Another  source  is  the  infinite  order 

expansion  of  ( 1  -  dV )“'  that  is  implicit  in  solving 

=  ( 1  -  dV ^  £  Go(£)2'<^^>(£)Go(£)  dE . 

The  latter  equation  is  used  to  produce  p'’  and  the  resulting  elements  of  2(oo)  in 
the  calculations  that  are  presented  in  the  next  section.  Linear  equations  for  p*'  are 
solved  with  a  direct  inversion  of  iterative  subspaces  procedure  [18],  where  the 
convergence  criterion  is  10~*. 

Numerical  Examples 

The  three  methods  discussed  above  have  been  programmed  as  part  of  an  extension 
of  ept90,  an  electron  propagator  program  [19].  Hartree-Fock  calculations  and 
integral  transformations  to  the  canonical  molecular  orbital  basis  are  performed 
with  Gaussian  90  [20] .  Coupled-cluster  singles  and  doubles  calculations  have  been 
implemented  separately  [9]. 

6-31 1-1- +G(2df,2pd)  basis  sets  [21]  are  used  in  calculations  on  anions,  H2O  and 
H3N.  Bond  lengths  are  0.965  A  in  OH“  and  1.030  A  in  NHJ;  the  bond  angle  in 
the  latter  anion  is  101.3°.  In  the  H2O  calculations,  the  bond  length  is  0.959  A  and 
the  bond  angle  is  103.5°-,  for  H3N,  the  corresponding  data  are  1.013  A  and  107.4°. 
Core  molecular  orbitals  are  dropped  in  all  summations  pertaining  to  the  propagator 
and  coupled-cluster  singles  and  doubles  calculations.  The  basis  and  geometry  for 


Table  IV.  6-31  l-r+G(2df.2pd)  H3N  ionization  energies  (eV). 


Third  order  Coupled  cluster  Contour  integral 

Final  -  -  -  Expt. 


state 

2(oo)„ 

Pole 

2(oo)„ 

Pole 

2(x)„ 

Pole 

[23] 

'^1 

-10.961 

-16.740 

■B 

-11.005 

-16.769 

10.85 

15.8 
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No  have  been  previously  employed  [22],  No  molecular  orbitals  are  dropped  from 
the  latter  calculation. 

Results  on  anions  in  Table  I  include  ionization  energies  and  diagonal  elements 
of  the  constant  self-energy  matrix  whose  indices  match  the  index  of  the  highest 
occupied  canonical  orbital.  The  basis  sets  are  typical  of  accurate  electron  affinity 
calculations,  for  they  include  diffuse  s  and  p  functions,  a  triple  f  valence  set,  and 
double  polarization  functions.  In  the  columns  labeled  third  order,  constant  terms 
of  third  order  are  combined  with  2"’^ ’(£,  ).  Coupled-cluster  substitutions  for  per¬ 
turbative  reference  state  corrections  are  made  in  the  following  two  columns.  The 
next  two  columns  are  results  from  the  contour  integral  procedure.  The  present  set 
of  anions  is  a  severe  test  of  electron  affinity  methods,  for  the  highly  electronegative 
central  atoms  possess  electrons  that  are  confined  to  a  small  volume.  For  F“  and 
OH  - ,  the  coupled-cluster  results  effect  clear  improvements  over  third  order.  Ex¬ 
perimental  values  for  the  anion  ionization  energies  are  still  a  few  tenths  of  an 
electron  Volt  away  [23].  Changes  in  the  constant  self-energy  terms  explain  only 
part  of  the  differences  between  the  poles.  Coupled  cluster  improvements  to  the 
(alHfj)  and  (fjIHa)  matrices  may  have  a  greater  effect.  The  difference  between 
third-order  and  contour  integral  results,  however,  lies  entirely  in  the  treatment  of 
the  constant  self-energy  matrix.  There  is  a  marked  contrast  between  the  contour 
integral  values  and  the  third-order  values  that  leads  to  a  similar  set  of  shifts  in  the 
poles.  This  is  an  indication  that  higher-order  terms  in  the  constant  self-energy  matrix 
can  be  large. 

Calculations  on  Nj  with  the  model  basis  of  Table  II  disclose  a  similar  set  of 
trends.  Discrepancies  between  third-order  and  coupled-cluster  results  are  small. 
Contour  integral  descriptions  of  the  constant  self-energy  matrix  improve  the  third- 
order  results  somewhat  better. 

For  calculations  on  H2O  (Table  111),  the  coupled-cluster  values  for  the  constant 
self-energy  terms  do  not  lie  between  the  third-order  and  contour  integral  results. 
The  latter  treatment  gives  a  qualitatively  superior  set  of  improvements  to  third 
order.  The  remaining  deficiencies  of  third  order  probably  originate  from  the  absence 
of  quintuple  products  in  the  operator  manifold. 

Results  on  H3N  (Table  IV)  demonstrate  that  a  third-order  description  of  the 
constant  self-energy  matrix  can  be  accurate.  The  coupled-cluster  poles  lie  between 
the  third-order  and  contour  integral  results. 

Conclusions 

Fourth-  and  higher-order  corrections  to  the  constant  self-energy  matrix  can  be 
important  in  the  calculation  of  electron  binding  energies.  The  requisite  improve¬ 
ments  in  the  one-electron  density  matrix  can  be  obtained  through  perturbative 
corrections  to  reference  state  wavefunctions,  through  coupled-cluster  wavefunctions 
or  through  contour  integrals.  With  the  use  and  the  truncation  of  the 

'on  equation  after  the  term  that  is  linear  in  the  self-energy  matrix,  the  most 
..uii’cult  summation  has  a  scaling  factor,  O^V^,  identical  to  that  of  its  counterpart 
in  a  coupied-clurter  sii.gles  and  doubles  optimization.  (The  most  difficult  step  in 
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the  evaluation  of  the  constant  third-order  diagrams  has  an  O'V  '  scaling  factor.) 
The  contour  integral  procedure  sometimes  produces  large  corrections  to  the  third- 
order  constant  self-energy  matrix.  In  the  coupled-cluster  approach,  discrepancies 
with  the  third-order  constant  terms  are  smaller.  Based  on  these  preliminary  results, 
the  contour  integral  procedure  appears  to  be  a  more  efficient  sixth-power  method 
than  the  coupled-cluster  approach  for  improving  the  constant  contributions  to  the 
self-energy.  In  order  to  take  full  advantage  of  the  coupled-cluster  reference  state,  it 
may  be  necessary  to  calculate  additional  contributions  to  p‘  with  the  same  singles 
and  doubles  amplitudes.  Another  promising  possibility  is  abandonment  of  the  Dyson 
expansion  of  the  propagator  matrix  in  powers  of  the  self-energy  matrix  in  favor  of 
direct  integration  over  the  Coulson  contour. 
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Abstract 

The  geometnes  of  the  ground  and  first  triplet  excited  state  of  malonaldehyde  arc  compared  for  two 
distinct  configurations  of  the  molecule.  The  first  represents  the  equilibrium  geometry  and  the  second 
structure  corresponds  to  the  transition  state  for  the  intramolecular  hydrogen  transfer.  The  ground  state 
computations  utilize  both  self-consistent  field  and  Moller-Plesset  second-order  perturbation  theory.  The 
excited  state  computations  employ  several  different  theoretical  methods,  unrestncted  Hartree-  Fock  (UHF). 
unrestncted  second-order  Moller-Plesset  perturbation  theory  (tJMP2).  ci  Singles  (CIS),  and  complete 
active  space  self-consistent  field  (Cas).  The  geometric  parameters  obtained  for  the  two  configurations 
are  contrasicd  in  both  the  ground  and  excited  state.  The  structural  differences  are  related  to  the  extent 
of  hydrogen  bonding  present,  which  is  compared  to  the  calculated  proton  transfer  barrier  g  1993  John 
Wiley  &  Sons.  Inc. 


Introduction 

In  contrast  to  proton  transfers  which  take  place  in  the  ground  electronic  state, 
the  study  of  such  reactions  occurring  in  excited  electronic  states  is  in  its  infancy. 
The  sensitivity  of  the  properties  of  the  proton-transfer  reaction  to  the  nature  of  the 
solvent  suggests  gas-phase  work  would  be  required  in  order  to  understand  the  in¬ 
trinsic  behavior  of  a  given  system.  Some  relevant  work  is  being  reported  in  the  gas 
phase  [1-8],  as  well  as  in  low-temperature  matrices  [9,10]. 

On  the  theoretical  side,  only  a  couple  of  ab  initio  studies  have  been  reported  for 
excited  state  proton  transfer  at  this  time  [  1 1 , 1 2  ] .  Since  the  computational  methods 
available  to  study  excited  states  are  somewhat  limited,  especially  for  states  that  are 
not  the  lowest  energy  of  a  given  symmetry  type,  it  is  important  to  determine  which 
of  the  available  methods  are  dependable  and  how  the  results  obtained  from  different 
methods  compare  with  one  another.  The  most  logical  point  to  begin  the  comparison 
concerns  the  optimized  structures  obtained  using  various  methods.  This  comparison 
is  crucial  because  electronic  properties  depend  on  the  quality  of  the  geometry  con¬ 
sidered. 

This  article  systematically  compares  the  geometries  obtained  for  the  lowest  excited 
triplet  state  of  malonaldehyde  using  a  number  of  theoretical  methods.  These  methods 
were  chosen  less  for  purposes  of  rigorous  accuracy  but  rather  as  a  means  of  com¬ 
paring  the  results  of  various  computationally  tractable  procedures.  Unrestricted 
Hartree-Fock  (uhf)  is  a  historically  popular  means  of  investigating  some  of  the 
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lower  excited  states.  While  it  does  not  yield  a  state  which  is  an  eigenfunction  of  the 
spin  operator,  it  is  usually  possible  to  draw  a  direct  connection  with  the  correct 
state.  A  common  follow-up  to  UHF  which  introduces  dynamic  correlation  in  an 
efficient  manner  is  unrestricted  Moller-Plesset  perturbation  theory  (UMP2)  which 
has  been  investigated  here  as  well.  Pople  and  coworkers  have  recently  introduced 
a  Cl  Singles  scheme  (ciS)  which  permits  one  to  obtain  a  first  approximation  of 
various  excited  states,  even  those  of  the  same  symmetry  as  the  ground  state.  This 
method  would  appear  quite  promising  for  purposes  of  studying  excited  state  proton 
transfers  and  warrants  some  scrutiny.  As  a  last  procedure,  we  test  the  complete 
active  space  self-consistent  field  (CAS)  method  which  removes  much  of  the  arbi¬ 
trariness  of  MCSCF  and  has  proven  rather  dependable. 

Malonaldehyde  was  chosen  due  to  its  relatively  small  size  and  simplicity.  This 
molecule  has  itself  been  studied  both  experimentally  [13-17]  and  theoretically  in 
the  ground  state  [18-20],  which  provide  valuable  benchmarks  in  determining  the 
accuracy  of  the  computational  methods  tested. 

Computational  Details 

All  of  the  computations  relevant  to  this  communication,  except  for  the  complete 
active  space  self-consistent  field  calculations  (CAS).  were  performed  using  the 
Gaussian  90  ab  inuio  package  [21],  The  cas  calculations  [  22]  utilized  the  General 
Atomic  and  Molecular  Electronic  Structure  System  (gamess)  [23],  The  ci  singles 
(CIS)  method  [24]  was  utilized  as  implemented  in  Gaussian  90.  Electron  correlation 
was  incorporated  into  some  of  the  ground  and  excited  state  computations  via  a 
second-order  Moller-Plesset  correction  to  either  the  SCF  or  CIS  energy.  Three  basis 
sets  are  considered,  split  valence  4-3 IG  [25],  polarized  triple  valence  6-31 IG** 

[  26,27  ] ,  and  another  containing  diffuse  functions,  6-3 1 +G  **  [  28  ] . 

Two  configurations  of  malonaldehyde  are  investigated  as  illustrated  in  Figure  1. 
The  first,  designated  M,  represents  the  equilibrium  geometry.  The  hydrogen  is 
covalently  bonded  to  one  oxygen  and  interacts  with  the  other  via  a  hydrogen  bond. 
The  second  configuration  corresponds  to  the  midpoint  or  transition  state,  TS,  of 
the  proton  transfer  process  and  has  the  hydrogen  equidistant  from  the  two  oxygens. 
Both  Iructures  were  initially  found,  and  in  subsequent  calculations  constrained, 
to  be  planar.  M  has  C,  symmetry  and  the  point  group  of  TS  is  .  All  harmonic 
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Figure  I.  Structures  of  the  two  forms  of  malonaldehyde  investigated.  M  represents  the 
equilibrium  form  and  TS  represents  the  transition  state  of  the  proton  transfer. 


GEOMETRIES  OF  MALONALDEHYDE 


421 


frequencies  of  the  former  are  positive  while  the  latter  has  associated  with  it  a  single 
imaginary  frequency,  corresponding  to  the  transfer  of  the  proton.  The  proton  transfer 
barrier  is  calculated  as  the  energy  difference  between  the  M  and  TS  structures. 

Ground  State 

In  contrast  to  the  excited  states  which  have  been  largely  unexplored  by  ab  initio 
computations,  there  is  a  significant  body  of  theoretical  as  well  as  experimental  data 
concerning  the  ground  state  of  malonaldehyde.  These  earlier  results  can  serve  as  a 
valuable  measure  of  the  reliability  of  some  of  the  computational  methods  employed 
here.  The  salient  features  of  the  geometries  optimized  at  various  levels  of  theory 
are  reported  in  Table  I,  along  with  the  total  energy. 

All  the  data  confirm  a  number  of  structural  changes  caused  by  the  half  transfer 
of  the  proton.  The  O1C2  hydroxyl  bond  of  M  shortens  as  it  acquires  more  of  the 
carbonyl  character  it  will  have  following  completion  of  the  proton  transfer.  The 
opposite  trend  is  noted  in  the  originally  carbonyl  C4O5  bond.  The  formally  double 
C2C3  bond  in  M  elongates  while  the  single  C3C4  bond  contracts.  Most  of  these 
bond  length  changes  are  in  the  neighborhood  of  0.05  A.  Also  important  are  the 
parameters  relevant  to  the  H-bond  connecting  the  two  oxygens.  The  interoxygen 
distance  is  reduced  by  perhaps  0.3  to  0.4  A  when  the  bridging  hydrogen  is  moved 
midway  between  them.  This  change  is  accomplished  largely  by  reductions  of  up  to 
7®  in  some  of  the  internal  bond  angles.  The  shrinking  R(00)  is  accompanied  by 
a  more  linear  H-bond  in  the  sense  that  this  proton  lies  closer  to  the  O  •  •  •  O  axis. 

The  MP2/6-31 IG**  data  agree  nicely  with  the  MP2/6-31G**  values  reported 
by  Frisch  et  al.  [20].  The  results  also  correspond  nicely  to  those  determined  ex¬ 
perimentally  by  Wilson  and  coworkers  [17].  In  addition,  the  partial  transfer  of  the 
hydrogen  affects  the  other  geometric  properties  as  observed  experimentally  by 
Roubin  and  coworkers,  who  noted  that  the  relative  C — C  and  C — O  bond  orders 
vary  as  the  hydrogen  transfers  [13]. 

With  regard  to  the  effect  of  the  level  of  theory  upon  the  geometry,  the  larger 
basis  set  yields  shorter  C — O  and  longer  C — C  bonds  than  does  4-3 IG  at  the  SCF 
level.  There  is  evidence  that  6-31 IG**  provides  a  weaker  H-bond  in  the  M  con- 
former:  r(OiH)  is  shorter  by  0.01  A  and  the  interoxygen  distance  longer  by  0.02 
A.  (The  diffuse  functions  of  6-3 1  -t-G**  induce  very  little  change  in  the  geometrical 
parameters.)  The  same  indicators  suggest  that  correlation  strengthens  the  H-bond 
considerably. 


Excited  State 

The  lowest  lying  excited  state  of  malonaldehyde  is  a  triplet  derived  from  an 
excitation  from  the  highest  occupied  molecular  orbital  (HOMO)  into  the  lowest 
unoccupied  molecula  jrbital  (LUMO).  The  HOMO  has  n  symmetry  and  is  primarily 
of  C — C  bonding  ^naiacter  along  with  some  C — O  antibonding,  and  the  lumo 
is  almost  entirely  C — O  it*.  The  transition  is  denoted  -  it*. 

Before  discussing  the  geometries  in  detail,  a  brief  mention  should  be  made  of 
the  details  of  the  methods  used.  For  each  method  investigated,  except  cas,  the 


Table  I.  Optimized  geometries*  and  energies  of  ground  state  of  malonaldehyde,  M,  and  the  transition  state  of  the  proton  transfer.  TS. 
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Utilization  of  the  method  is  straightforward.  One  need  only  specify  which  state  is 
to  be  explored.  For  CAS,  an  additional  step  must  be  performed;  the  active  space 
must  be  defined.  In  this  study,  two  different  CAS  active  spaces  were  investigated. 
The  first  included  the  two  singly  occupied  rt  orbitals  and  one  additional  virtual 
orbital  of  jt  symmetry  (the  lowest  energy  virtual  tt),  giving  two  active  electrons  in 
three  active  orbitals,  this  prescription  is  designated  CAS-1.  cas-2  added  two  doubly 
occupied  ir  orbitals  and  the  next  lowest  unoccupied  x  orbital,  totalling  six  active 
electrons  and  six  active  orbitals. 

For  the  excited  state,  relaxation  to  a  lower  energy  state  of  the  same  symmetry  is 
not  a  problem.  However,  for  excited  state  calculations  in  general,  such  as  Si,  this 
collapsing  to  a  lower  state  can  be  a  significant  concern.  It  should  be  noted  that  of 
the  methods  used,  only  cis  does  not  suffer  from  this  problem. 

The  geometries  obtained  for  the  equilibrium  orientation,  M,  are  provided  in 
Table  II.  Initially,  the  excited  state  geometries  will  be  compared  to  the  values  ob¬ 
tained  for  the  ground  state.  Generally,  each  method  provides  similar  tr^.*  resulting 
from  excitation.  Understanding  the  differences  in  the  geometry  of  the  excited  state 
is  predicated  on  the  molecular  orbitals  that  are  involved  in  the  excitation.  The 
HOMO  and  the  LUMO  are  shown  in  Figure  2  for  both  the  M  and  TS  structures.  The 
characters  of  the  orbitals  are  similar  for  each  orientation,  although  the  interactions 
are  cleaner  in  the  TS  form  due  to  the  increased  symmetry.  The  phases  of  the  x 
orbitals  in  the  homo  are  such  that  bonding  interactions  exist  primarily  between 
like  atoms  (C — C  or  O — O)  while  those  between  unlike  atoms  are  x*  in  character. 
The  LUMO  introduces  a  second  nodal  plane  such  that  most  interactions  are  anti¬ 
bonding,  except  those  involving  C3. 

The  Oi  — C2  bond  lengthening  can  be  understood  simply  via  the  placement  of 
an  electron  in  the  LUMO  which  is  strongly  antibonding  between  the  two  pertinent 
atoms.  A  large  stretch  occurs  in  the  C2C3  bond  upon  excitation  from  HOMO  to 
LUMO  since  the  interaction  between  these  two  atoms  changes  from  bonding  in  the 
HOMO  to  antibonding  in  the  LUMO  for  both  arrangements.  The  appearance  of  a 
bonding  interaction  between  C3  and  C4  in  the  LUMO  of  the  M  form  accounts  for 
the  shorter  C3  —  C4  bond  in  the  ^x  —  x*  state.  The  strong  antibond  between  C4 
and  O5  in  the  LUMO  is  largely  responsible  for  the  stretch  in  this  bond  upon  electronic 
excitation. 

The  excited  state  results  indicate  that  the  hydrogen  bond  is  weakened  by  the 
excitation.  This  change  manifests  itself  in  several  geometric  adjustments.  First,  the 
O5  —  H9  distance  is  lengthened  by  roughly  0.2  A.  Second,  the  O — O  separation  is 
also  increased  by  a  similar  amount.  Finally,  the  hydrogen  bond  is  more  linear  in 
the  ground  state. 

Now  the  discrepancies  between  the  various  methods  of  calculating  excited  state 
geometries  can  be  addressed.  First,  for  the  M  form,  the  most  notable  differences 
occur  in  the  variables  related  to  the  bridging  hydrogen  atom.  The  deviation  of  this 
hydrogen  from  theO — Oaxis  is  around  35°  for  the  UHF,  UMP2,  and  CIS  methods, 
but  is  only  14.6°  and  17.5°  for  the  two  cas  calculations.  The  O5 — H9  and  the 
0| — O5  separations  are  also  shorter  in  the  cas  calculations,  although  in  this  case 


Table  II.  Optimized  geometries*  and  energies  of  the  first  excited  triplet  state  of  malonaldehyde  in  the  M  eonfiguration. 
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Figure  2.  Schematic  diagram  of  the  orbital  structure  of  the  homo  and  lumo  of  the  M 
and  TS  forms  of  malonaldehyde,  calculated  using  6-31 IG**  basis  set.  The  size  of  each 
lobe  corresponds  to  the  magnitude  of  the  molecular  orbital  coefficient. 

the  CIS  values  are  nearly  as  short.  There  thus  appears  to  be  significantly  more 
hydrogen  bonding  in  the  CAS  computations,  especially  with  the  smaller  active  space. 

Another  significant  difference  concerns  the  bond  lengths  around  the  ring.  The 
carbonyl  C4 — O5  distance  in  the  CIS  computation  is  much  shorter  than  the  other 
methods,  and  the  C3 — C4  bond  is  longer  than  the  cas  values.  These  differences 
result  in  much  more  distinct  single  and  double  bond  character  for  C3  —  C4  and 
C4 — O5,  respectively,  using  CIS.  Also,  the  ump2  bond  lengths  are  generally  longer 
than  those  from  any  of  the  other  methods.  Similar  comparisons  are  evident  from 
the  data  available  using  the  larger  basis  set. 

The  optimized  parameters  for  the  transition  state  of  the  transfer  are  given  in 
Table  III.  In  comparison  to  the  ground  state,  the  results  correspond  to  those  discussed 
for  the  M  form,  not  surprisingly  since  the  character  of  the  homo  and  lumo  are 
roughly  the  same  for  both  configurations.  The  main  variat’on  from  the  M  results 
is  that  like  bonds  are  now  equivalent  due  to  the  inclusion  of  the  C2  symmetry  axis. 

Before  comparing  the  excited  state  TS  geometries,  the  magnitude  of  the  changes 
expected  compared  to  those  for  the  M  structure  should  be  addressed.  The  TS  struc¬ 
ture  should  not  change  as  much  when  the  excited  state  is  optimized  due  to  the  C2„ 
constraint  that  requires  the  hydrogen  to  be  equidistant  from  both  oxygens.  This 
condition  prevents  the  hydrogen  from  having  a  strong  covalent  bond  to  either 
oxygen  atom.  Therefore,  the  stability  of  the  molecule  is  directly  related  to  both 
hydrogen  bonding  interactions  and  the  oxygens  cannot  move  too  far  apart  without 
destroying  these  hydrogen  bonding  interactions.  On  the  other  hand,  the  M  config¬ 
uration  does  possess  a  covalent  O — H  interaction,  allowing  the  hydrogen  bond  to 
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Distances  in  A,  angles  in  degrees,  energies  in  hartree. 

Using  smaller  active  space,  2  single  occupied  orbitals,  and  I  unoccupied  orbital. 

Using  larger  active  space,  2  doubly  occupied  orbitals,  2  singly  occupied  orbitals,  and  2  unoccupied  orbitals. 
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be  any  strength  without  destroying  the  main  covalent  interactions  which  hold  the 
molecule  together,  which  allows  more  significant  rearrangement. 

The  main  feature  of  the  comparison  between  methods  for  the  excited  TS  ori¬ 
entation,  is  the  catego^zation  of  the  results  into  two  distinct  groups.  The  uhf  and 
ump2  results  are  very  similar,  but  markedly  different  from  the  other  methods. 
Meanwhile,  the  CIS  and  both  cas  computations  give  essentially  identical  structures; 
the  largest  variations  are  0.0 1  A  and  0.3°.  Quite  remarkably,  the  O — H  and  O — O 
separations  are  all  within  0.001  A. 

The  major  differences  between  the  two  groups  of  data  concern  the  bonding  ar¬ 
rangement  around  the  ring  and  the  O — O — H  relationships.  The  uhf  and  ump2 
computations  both  give  much  longer  C — O  bonds  and  slightly  shorter  C — C  bonds 
than  the  other  methods,  indicating  more  delocalization  remains,  especially  using 
uhf  where  all  four  bonds  are  nearly  equivalent.  The  O — O  separation  and  the 
O — H  distances  are  smaller  by  approximately  0.1  A  and  0.03  A,  respectively,  in 
the  UHF  and  UMP2  computations.  This  adjustment  would  result  in  greater  hydrogen 
bonding.  However,  this  potential  increase  is  counteracted  by  the  linearity  of  the 
hydrogen  bond.  The  CIS  and  CAS  results  indicate  a  more  linear  bonding  arrangement 
by  roughly  3°.  Therefore,  even  though  there  are  changes  in  the  TS  structures,  the 
overall  effect  on  the  strength  of  the  hydrogen  bond  is  minimal,  as  expected.  Once 
again,  the  6-31 IG**  and  6-3 1+G**  data  confirm  the  trends  observed  using  the 
smaller  basis  set.  The  primary  difference  is  that  the  larger  bases  produce  structures 
that  have  more  hydrogen  bonding.  This  is  evident  in  the  increased  linearity  of  the 
H-bond,  by  3°,  and  shorter  O — O  and  O — H  distances. 

The  barrier  for  proton  transfer  is  defined  as  the  difference  in  energy  between  the 
TS  and  M  structures.  The  barriers  for  each  of  the  excited  state  methods  discussed 
are  listed  in  Table  IV.  Of  most  interest  is  that  each  method  has  a  transfer  barrier 
higher  by  a  factor  of  approximately  two  to  three  than  the  ground  state,  except  ump2 
which  is  only  slightly  above  .he  ground  state  value.  Comparison  of  the  rows  illustrates 
a  fairly  minor  basis  set  sensitivity.  In  addition,  the  barriers  correlate  well  with  the 
relative  hydrogen  bonding  strengths  discussed  earlier.  The  greater  the  amount  of 
hydrogen  bonding;  the  smaller  the  proton  transfer  barrier.  This  relationship  also 
explains  why  the  excited  state  barriers  are  so  much  higher  than  the  ground  state 
values,  which  have  indications  of  much  stronger  hydrogen  bonding.  Of  course,  this 


Table  IV.  Energy  barriers  (kcal/mol)  computed  for  proton  transfer  in  excited  triplet  state  of  malon- 

aldehyde. 


UHF 

UMP2 

CIS 

CAS  1* 

CAS  2” 

4-3  IG 

33.1 

16.1 

24.8 

28.9 

36.2 

6-31 IG** 

34.4 

13.6 

21.7 

6-3 1+G” 

35.0 

*  Using  smaller  active  space-  2  single  occupied  orbitals  and  1  unoccupied  orbital. 

Using  larger  active  space:  2  doubly  occupied  orbitals,  2  singly  occupied  orbitals,  and  2  unoccupied 
orbitals. 
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conclusion  remains  'ubject  to  results  that  might  be  obtained  with  better  account 
of  electron  correlation. 


Summary  and  Conclusions 

The  lowest  excited  electronic  state  of  malonaldehyde  is  ^ir  -  tt*  resulting  from 
an  excitation  from  the  highest  occupied  to  the  lowest  unoccupied  MO.  The  ge¬ 
ometries  of  both  the  equilibrium  and  transition  state  forms  of  malonaldehyde  reveal 
a  change  in  character  of  the  bonding  interactions  as  excitation  from  the  ir  tc  tt* 
orbital  occurs,  independent  of  the  computational  method  utilized.  The  excitation 
produces  more  delocalization  of  the  bonding  around  the  ring  and  a  larger  separation 
between  the  oxygens  and  central  hydrogen  atoms,  resulting  in  weakened  hydrogen 
bonding. 

The  proton  transfer  barrier  for  the  excited  state  is  consistently  higher  .nan  for 
the  ground  state,  by  a  factor  of  two  to  three.  This  result  is  in  accord  with  the  relative 
strengths  of  the  intramolecular  H-bond:  a  stronger  interaction  is  associated  with  a 
low  barrier.  Also,  there  is  little  sensitivity  to  the  basis  set  chosen.  On  the  other 
hand,  an  estimate  of  the  true  proton  transfer  barrier  would  be  premature  without 
more  complete  treatment  of  electron  correlation  with  more  flexible  basis  sets. 

While  the  levels  of  theory  discussed  here  are  inadequate  to  provide  a  definitive 
answer  as  to  the  particulars  of  the  true  geometry,  the  consistency  of  certain  trends 
observed  with  these  methods  serves  to  reinforce  their  validity.  For  example,  the 
qualitative  aspects  of  the  differences  between  the  ground  and  excited  state  geometries 
of  malonaldehyde  are  similar  for  all  the  methods  investigated.  The  structure  of  the 
excited  state  contains  less  hydrogen  bonding  than  the  ground  state  and  the  resulting 
barrier  to  proton  transfer  is  larger. 

The  quantitative  aspects  of  the  geometric  parameters  studied  for  the  excited  state 
methodologies  are  somewhat  less  consistent.  The  results  for  the  equilibrium  con¬ 
figuration  (M)  vary  depending  on  the  method.  There  are  no  clear  indications  as  to 
which  method  produces  the  best  values,  although  some  ump2  bond  lengths  appear 
long  and  the  CaS  H-bond  is  seemingly  too  linear.  On  the  other  hand,  the  CIS  and 
CAS  results  for  the  transition  state  configuration  (TS)  are  virtually  identical,  indi¬ 
cating  much  less  sensitivity  to  the  actual  method  employed. 
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Abstract 

Wc  study  im^rals  that  arise  in  the  calculation  of  electric  multipole  moments.  A  computational  approach 
is  developed  to  evaluate  these  integrals  in  an  sro  basis  set  in  a  straightforward,  yet  somewhat  untraditional 
fashion.  The  computation  involves  a  simple  three-step  procedure:  translation  of  all  Cartesian  parts  to 
one  center,  local  evaluation,  and  a  single  one-sided  rotation  into  the  molecular  frame  Some  assessment 
on  this  method  is  discussed.  An  implementation  of  the  method  is  presented  in  this  paper  to  compute 
overlap,  dipole,  quadrupole,  octopole,  and  hexadecapole  moment  integrals  in  a  basis  set  that  includes 
up  to  g  orbitals.  ©  1993  John  Wiley  &  Sons,  Inc. 

Introduction 

Electric  multipole  moments  have  been  of  both  experimental  and  t  t,oretical  in¬ 
terest  for  some  time.  While  direct  measurements  are  usually  limited  to  dipole  and 
quadrupole  moments  [1],  molecular  electronic  structure  theory  has  lately  made 
investigations  on  electric  moments  exceeding  tenth  order  [2,3  ] .  Due  to  experimental 
limitations,  studies  of  higher  moments  are  not  aimed  at  confirmation  of  measured 
result;  rather  they  are  driven  by  the  needs  of  recent  advances  in  molecular  dynamics 
[4],  computation  of  high  order  polarizabilities  [5-7],  molecular  optical  and  mag¬ 
netic  properties  [8-10],  intermolecular  interactions  in  terms  of  the  London  series 
[7,10-15],  and  quantum  mechanical  solvent  effect  in  the  formalism  of  the  self- 
consistent  reaction  field  theory  [16-18]. 

The  challenge  here,  then,  is  to  calculate  electric  moments  useful  to  develop  some 
of  the  above-mentioned  fields.  This  challenge  is  currently  being  addressed  utilizing 
methods  based  on  electronic  structure  theory  of  varying  sophistication.  A  molecular 
orbital  description  can  often  give  molecular  dipole  moments  with  greater  assurance 
than  it  can  give  the  quadrupole  moments.  For  moments  higher  than  quadrupole 
we  know  that  basis  set  choice  is  very  important  [19],  for  high  moments  are  very 
sensitive  to  electronic  distributions  in  the  outer  molecular  region,  which  can  only 
be  better  described  by  extended  basis  sets.  Basis  set  effects  are  believed  to  be  more 
important  than  correlation  on  these  high  moments  [20,3].  On  the  other  hand, 
most  multipole-moment  studies  are  carried  out  using  Gaussian  basis  sets;  in  fact, 
we  are  not  aware  of  any  recent  Slater-type  orbital  (sro)  basis  set  calculations  on 
moments  higher  than  quadrupole  for  polyatomic  molecules  (we  note  an  early  paper 
[21]).  The  drawback  of  Gaussian  basis  sets  is  apparent  here:  High  moments  ex¬ 
pansion  of  electrostatic  interactions  is  a  good  approximation  only  at  long  range. 
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where  the  Gaussian  functions  are  known  to  fall  too  sharply.  For  this  reason  there 
have  even  been  ab  initio  calculations  using  Gaussian  basis  sets  that  are  then  fitted 
to  STOs  to  study  long-range  interactions  [22]. 

The  introduction  of  Gaussian  basis  sets  into  molecular  orbital  theory  is  to  facilitate 
integration.  Multipole  moment  integrals  are  conveniently  worked  out  in  Gaussian 
basis  sets  by  Matsuoka  [23],  Augspurger  and  Dykstra  [24],  and  Mikkelsen  et  al. 
[17].  A  study  of  these  moments  using  numerical  techniques  can  be  found  in  the 
calculation  of  Sundholm  and  Pyykko  [3].  High  moment  integrals  over  STO  basis 
sets  are  not  well  studied,  although  they  would  not  appear  to  be  particularly  intrac¬ 
table.  Originally  molecular  orbital  theory  of  polyatomic  molecules  was  formulated 
using  STO  basis  sets  and  a  series  of  now  classical  work  on  STO  integrals  was  -:arried 
out  by  Mulliken  et  al.  [25],  Roothaan  [26],  Ruedenberg  [27],  and  Reudenbeig 
et  al.  [  28  ] .  Local  dipole  moment  integrals  were  formulated  by  Hamilton  [  29  ]  using 
C  functions.  Quadrupole  moment  integrals  were  included  in  the  study  by  Wahl  et 
al.  [30],  and  implemented  by  Bagus  et  al.  [31]  for  linear  molecules.  A  specific  and 
detailed  treatment  of  dipole  moment  integrals  for  polyatomic  molecules  was  pre¬ 
sented  by  Rein  et  al.  [32],  which  was  extended  to  handle  quadrupole  moment 
integrals  by  Rabinowitz  and  Rein  [33],  and  octopole  moment  integrals  by  Swissler 
and  Rein  [21].  Higher-order  moment  integrals  have  not  been  studied  in  detail  in 
extended  STO  basis  sets  for  polyatomic  molecules. 

Electric  moment  integrals  are  of  one-electron  and,  at  most,  three-center.  A  method 
of  evaluating  integrals  of  this  nature  is  generally  outlined  by  Harris  and  Michels 
[34],  Steinbom  and  Ruedenberg  [35],  and  Guseinov  and  Sadichov  [36].  These 
traditional  approaches  all  invoke  translational,  rotational,  and  combinational 
properties  of  spherical  harmonics  of  various  kinds  that  are  nicely  examined  in  the 
works  of  Steinbom  and  Ruedenberg  [37]  and  Steinbom  [38].  This  paper  presents 
a  novel  method  of  evaluating  multipole  moment  integrals  over  STO  basis  sets.  We 
will  show  explicit  formulas  for  up  to  hexadecapole  moment  in  a  basis  set  that 
includes  up  to  g  orbitals.  The  method,  although  somewhat  tedious,  is  completely 
generalizable  and  results  in  very  fast  computer  code. 


Preliminary  Considerations 

Cartesians  and  Tesseral  Harmonics 


An  STO  is  usually  given  by 


(2n 


in:  l-|l/2 


(2n)!  J 


(la) 


where  f  <l>)  are  the  normalized  real  spherical  harmonics  [39]  and  the  square 
root  factor  is  the  normalization  cor.r»«'’t!  for  the  radial  component.  We  regroup 
these  terms  as 


(2f) 


2n+n 


(2n)! 


1/2 


r-'-'e-f'lr'YTie,  <!>)], 


(lb) 


ELECTRIC  MULTIPOLE  MOMENT  INTEGRALS 


433 


which  allows  a  convenient  representation  of  the  angular  part  in  terms  of  Cartesian 
components.  The  quantities  in  the  second  bracket  are  known  as  Tesseral  harmonics 
[40],  which  we  note  as 

L  =  WyT(6,  <i>)] , 

where  /  has  its  usual  meaning  (/  =  0, 1, 2, 3  •  •  •  for  5,  p,  d,f,  •  •  • )  and  m  enumerates 
the  real  components  obtained  from  adding  YT  ±  YJ"". 

We  define  a  product  of  the  powers  of  x,  y,  and  z  to  be  a  Cartesian, 

{ij,k)  =  x‘y'z^,  (2) 

where  i,j,  and  k  are  integers.  In  this  notation  the  normalized  Tesseral  harmonics 
are  written  below  for  up  to  g  type  (we  follow  Pauling  and  Wilson  [41]): 
s  Harmonic: 

s  =  V(l/47r)  (0,0,0) . 
p  Harmonics: 

p,=  V(3/4:r) (1,0,0),  p,.  =  V(3/47r) (0,1,0),  p,  =  V(3/47r)(0,0,l ) . 

d  Harmonics: 

d:2  =  V('57l67r)[2(0,0,2)  -  (2,0,0)  -  (0,2,0)] , 
d,2^y2  =  V(  15/ IfiTT)  [(2,0,0)  -  (0,2,0)] . 

V(15/47r)(  1,1,0),  V(  15/4,r)(  1,0,1 ),  =  V(  15/47r)(0,l,l ) . 

/  Harmonics: 

=  V(63/167r)[2(0,0,3)  -  (2,0,1)  -  (0,2,1 )] . 
f,,2  =  V(21/327r)[4(  1,0,2)  -  (3,0,0)  -  ( 1,2,0)]  , 
fy,2  =  V(21/32^)[4(0,l,2)  -  (2,1,0)  -  (0,3,0)] , 
f,^,2.y2^  =  V(  105/ IOtt) [(2,0,1 )  -  (0,2,1 )],  =  V(105  /47r)(  1,1,1 ) , 

/^3_3;,,3  =  V(35/32,r)  [(3,0,0)  -  3(  1,2,0)], 
fi,2,^y2  =  V(35/323r)[3(2,l,0)  -  (0,3,0)] . 
g  Harmonics: 

=  (9/16]^)[(4,0,0)  +  2(2,2,0)  +  (0,4,0) 

-8(2,0,2)-8(0,2,2)  +  |(0,0,4)]. 
gx-J  =  |V(10/^)I!(  1  A3)  -  (3,0,1 )  -  ( 1,2,1 )] , 

^,.-3  =  |V(10/ir)  [^(0,1,3)  -  (2,1,1 )  -  (0,3,1 )] , 

gzHx^-y^)  =  iV(5/3r) [-(4,0,0)  +  (0,4,0)  +  6(2,0,2)  -  6(0.2,2)] , 

gxyz^  =  jV(5/7r)[6(  1,1,2)  -  (3,1,0)  -  ( 1,3,0)] , 

^zu3_3^v3)  =  IV^TO/tQ [(3,0,1)-  3(1,2,!)], 
gz(3.V>h  =  |V( 70/77)  [3(2,1,!)-  (0,3,1)] , 
gx^y^  =  1^V(35/7r) [(4,0,0)  -  6(2,2,0)  +  (0,4,0)], 
gxyW-yh  =  |V(35/7r) [(3,1,0)  -  (1,3,0)] . 
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The  Multipole  Moment  Operator 

There  has  been  two  ways  of  defining  multipole  moments  in  the  literature.  The 
earlier,  and  perhaps,  simpler,  definition  (see,  for  example.  Ref.  [42])  is 

=  YUe,4>),  (3) 

where  the  <}>)  are  the  spherical  harmonics.  An  alternative  definition,  following 
Buckingham  [43],  is 

monopole  <7=1, 

dipole  lia  =  r„, 

quadrupole  d„ji  =  ^(3r„r^  -  r\^) ,  (4) 

octopole  ~  r„rUgy  -  r^rlby^  -  r^rlb^^) , 

etc. 

where  we  omit  all  charges.  The  tensor  operators  of  the  first  several  moments  in  Eq. 
(4)  take  the  following  form; 

Dipole: 

a,  =  (1,0,0),  tiy  =  (0,1,0),  ii,  =  (0,0,1 ) . 

Quadrupole: 

a,  =  |[2(2,0,0)  -  (0,2,0)  -  (0,0,2)], 

Q,y  =  ^[2(0,2,0)  -  (2,0,0)  -  (0,0,2)] , 

Q-.Z  =  5[2(0,0.2)  -  (2,0,0)  -  (0,2,0)],  Q,.  =  l(  1,1,0) , 

(2..-  =  5(1-0,1),  0..  =  §(0,1,1). 

Octopole: 

fix.v  =  5[2(3,0,0)  -  3(  1,2,0)  -  3(  1,0,2)], 

V  =  §[2(0,3,0)  -  3(2,1,0)  -  3(0.1,2)] , 

=  i[2(0,0,3)  -  3(2,0,1 )  -  3(0,2,1 )], 

«.x,-  =  5[4(2,1,0)  -  (0,3,0)  -  (0,1,2)] , 

n.x.-  =  §[4(2,0,1 )  -  (0,2,1 )  -  (0,0,3)], 

fixn- =  5l4(  1,2,0)  -  (3,0,0) -( 1,0,2)] , 

fi,,,  =  §(  1,1,1 )  =  §[4(  1,0,2)  -  (3,0,0)  -  ( 1,2,0)] , 

nnr  =  5[4(0,2,l)- (2,0,1) -(0,0,3)], 

^yzz  =  5[4(0,1,2)  -  (2,1,0)  -  (0,3,0)] , 
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Hexadecapole: 

=  (4,0,0)  -  3[(2,2,0)  +  (2,0,2)]  +  i[(0,0,4)  +  (0,4,0)]  +  t(0,2,2) , 
Tnu-  =  §(3,1,0)  -  -^[(1.3,0)  +  (1,1,2)], 
rxvv.-  -  §(3,0,1 )  -  ■^((  1,2,1)  +  ( 1,0,3)] , 
r,,.,  =  i[9(2,2,0)  -  (2,0,2)  -  (0,2,2)]  -  |l(4,0,0)  +  (0,4,0)]  +  i(0,0,4) , 
rxu:  =  i(2,i,i)-i[(0,i,3)  + (0,3,1)], 

rv,.-,  =  i[9(2,0,2)  -  (2,2,0)  -  (0,2,2)]  -  §[(0.0,4)  +  (4,0,0)]  +  §(0,4,0) , 
rMn  =§(  1,3,0) -^((1,1.2) +  (3,1,0)], 

Tm,-.-  =  t(  1,2,1 )  -  §[(1,0,3)  +  (3,0,1 ) . 

ru. .-.-  =  -^(  1,1,2) -§[(1,3,0) +  (3,1,0)], 

rv. -.-.-  =  §(  1,0,3) --^[(1,2,1) +  (3,0,1)], 

r,,,,  =  (0,4,0)  +  §[(0,0,4)  +  (4,0,0)]  -  3[(0,2,2)  +  (2,2,0)]  +  §(2,0,2) , 
r, =  §(0,3,1 )  -  ^[(0,1,3)  +  (2,1,1 )] , 
r,. =  i[9(0,2,2)  -  (2,0,2)  -  (2,2,0)]  -  §[(0,0,4)  +  (0,4,0)]  +  §(4,0,0) , 
r,.-.-.-  =  §(0,1,3)  -  -^[(0,3,1)  +  (2,1,1)] , 

=  (0,0,4)  +  §(2,2,0)  +  §((0,4,0)  +  (4,0,0)]  -  3[(2,0,2)  +  (0,2,2)]  . 

Following  Rabinowitz  and  Rein  [33],  we  can  define  a  primitive  multipole  mo¬ 
ment  operator  as 

=  x'y'z’‘  =  ,  (5) 

with  its  expectation  values  giving  all  the  integrals  necessary  to  compute  the  multipole 
moment  tensors  defined  above.  The  computation  of  the  nuclear  contribution  to 
the  multipole  moments  is  straightforward,  simply  replacing  the  electronic  coordi¬ 
nates  x,y,z  in  with  the  nuclear  ones  X,Y,Z.  We  therefore  consider  only  the 
electronic  contribution  in  this  paper. 

Coordinate  Systems  and  the  Rotational  Matrix 

The  STO  basis  functions  are  generally  associated  with  centers  (usually  on  nuclei) 

denoted  2&  A,  B, - The  center  of  a  primitive  moment  operator  of  Eq.  (5)  is 

denoted  as  O.  Lower-case  subscripts  a,  b,  and  o  specify  the  vector  components 
measured  from  the  centers  A ,  B,  ana  O,  respectively.  Unlike  Cartesian  Gaussians, 
STO  two-center  integrals  are  generally  evaluated  over  a  local  system  (see  Fig.  1) 
with  the  axis  between  the  two  centers  defining  the  local  Z,  Za,  and  Z,,  axes,  with 
the  local  X,  Xa,  Xh  and  Y,  Ya,  Y/,  axes  parallel.  Here  the  upper-case  notations  X, 
Y,  Z  represent  axes  in  the  local  system.  The  Za  and  Z*  axes  are  either  pointing  at 
each  other,  or  are  directed  in  the  same  direction.  We  adapt  the  latter,  as  shown  in 
Figure  1. 
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With  the  origin  of  the  Cartesian  coordinate  system  [X,y,Z]  placed  halfway  be¬ 
tween  the  two  basis  function  centers  A  and  B,  the  local  system  described  above  is 
used  to  define  a  prolated  spheroidal,  or  ellipsoidal  coordinate  system  focused  on  A 
and  B,  respectively  [35],  Adapting  the  definition  and  notations  of  reference  (35], 
we  list  some  important  relations  below  for  an  ellipsoidal  coordinate  system, 

ra  =  (B/2)(^  +  v),  r,  =  {R/2){^-r,), 

Xa=Xb  =  (i?/2)V($^  -  1)(1  -  V^)C0S  (p  , 
ya^yb  =  W2)V(?^-  1)(1  -I7^)sin  (P  , 

2„=={R/2){1+^v),  Zi  =  {R/2){[-^7,), 

dT==(R^IS){^^-V^)d^dvd<p.  (6) 

The  integrals  evaluated  in  a  local  ellipsoidal  coordinate  system  then  need  to  be 
back-transformed  into  a  laboratory,  or  molecular-based  coordinate  system,  which 
we  define  to  be  the  molecular  frame  lx',y’,z'] .  The  rotation  necessary  for  this  trans¬ 
formation  is  based  on  the  directional  cosines  (see  Fig.  1 ), 

cos  d  =  Az'/R  , 
cos  (p  =  Ay'/^Ax'^  +  Ay'^ , 


(7) 
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and  the  rotation  matrix  R{6,(t>)  can  be  obtained  in  two  steps; 

(i)  Rotate  about  the  y  axis  to  align  the  z  axis:  Rt{d), 

/X'’\  /X\  /cose  0  sme\/X\ 

r  =7?,(0)  y  =  0  1  0  y  .  (8) 

\Z7  \z/  \-sine  0  cos0/\z/ 

(ii)  Rotate  about  the  z"  axis  to  align  the  x"  and  y"  axes:  i?2{0), 

/X'\  /X''\  (cos4>  -sin<p  0\/X''\ 

r  =  R2(«^)  y"  =  sin  ^  cos<f>  0  y"  .  (9) 

\Z7  \Z'7  \  0  0  1/\Z7 

The  final  rotation  matrix  based  on  Eqs.  (8)  and  (9)  is  therefore 

(cos  6  cos  <}>  -sin  4>  sin  6  cos  </>\ 
cos  6  sin  0  cos^  sin  0  cos  ^  j  .  (10) 

-sin  e  0  cosd  / 

Method 

Translation  of  Centers 

The  most  common  way  to  proceed  from  here  is,  assuming  the  moment  definition 
of  Eq.  (3),  to  evaluate  the  integrals  such  as 

I  =  J  dQYT'iea,  <t>a)Yf{eo,  <i>„)Ypr(et,  <t,f)  ( i  la) 

and  to  translate  the  center  O  to  the  center  ^4  or  B  with  the  lowest  /  value,  say,  center 
B  with  t".  Making  use  of  the  translational  property  of  sperical  harmonics  [44]  and 
Gaunt  coefficients  [45],  this  yields  an  integral  with  sigma  {m  =  0),  pi  (m  =  1 ), 
delta  (m  -  2),  phi  (m  =  3),  etc.,  components.  The  maximum  m  component 
required  is  then  the  lesser  of  /"  +  /"'  and  i.e., 

/=  2  R„I„,  =  RJ,  +  {R,  +  Rr-)Ir  +  {R6  +  Rs)h+  (11b) 

rn“0 

where  the  R„'s  are  the  factors  due  to  rotating  the  integral  back  to  the  molecular 
frame,  and  the  /^’s  are  the  appropriate  integrals  evaluated  in  the  local  frame.  Note 
that  Rm  =  Rm{A  )m(B) — cach  center  must  be  rotated,  and  there  are  two  pi  components, 
two  delta  components,  etc. 

In  this  study  we  will  translate  the  angular  components  of  both  the  operator  and 
the  orbital  centered  on  ^  to By  doing  so,  center  A  will  only  have  a  sigma  com¬ 
ponent  and  Eq.  ( 1  lb)  will  have  the  simple  appearance  of 

r  =  R^B)I,-  (11c) 

We  will  show  that  the  evaluation  of  I,  and  the  rotation  R^a)Rc{B)  =  R^b)  are  very 
simple.  The  use  of  Eq.  ( 1  Ic)  rather  than  Eq.  ( 1  lb),  and  its  simplicity,  are  at  the 
heart  of  this  work. 
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Considering  a  general  laboratory  coordinate  system  [x,y,z]  and  a  particular  one 
[Xa,ya,Za]  Centered  atA{ax,ay,a:),  we  have 

Xa  ~  X  9 


y^  =  y-  ay , 

2a^z-a..  (12) 


To  translate  the  coordinate  system  [Xa,ya,Za]  to  another  one  [Xb,yb,Zh]  centered 
at  B(bx,by,b;),  we  write 

Xa  =  Xb  +  (bx  -  ax)  =  Xa  +  bax , 
ya  =  yh  +  (by  -  ay)  =  +  bay , 

z^  =  Zb  +  (b.  -  a^)  ^  Zb  +  l\i, .  (13) 


An  arbitrary  Cartesian  defined  in  the  previous  section  is  therefore  translated  from 
center  A  to  center  B  by  way  of 


X'ay'aZa  =  2 

r«0 


ba'x'x'b  2 
s-Q 


U.jM  _ _ 

2  Cr,u(bax,bayMz)XhyU'h  • 


(r.j.O- (0,0,0) 


(14) 


The  translations  in  terms  of  Eq,  ( 14)  enable  a  multipole  moment  integral,  apart 
from  a  normalization  factor,  to  assume  the  following  form, 

=  (r2‘''‘‘~'x‘ayiz^„e'^<‘'<‘\x‘„y:^z'^\r"b0~''’''x^byhzU'^^‘’} 

(/,;.*)  _  _  _  _  _  _ 

“  2  bUy,  bdz)  2  ^W.U,K'( 

(r,j,/)-(0,0.0)  0.0.0) 


where  center  0(0x,0y,02)  places  the  origin  from  which  the  multipole  moment 
integral  is  evaluated. 

In  this  development  it  is  clear  that  a  moment  integral  is  a  combination  of  overlap 
integrals,  which,  in  terms  of  Eq,  ( 1 5 ) ,  are  always  between  an  s  orbital  and  a  higher- 
order  orbital.  The  translation  to  achieve  this  is  straightforward.  However,  the  sixfold 
sum  that  appears  in  Eq.  ( 15),  if  directly  implemented,  is  computationally  inefficient, 
as  was  first  experienced  by  Taketa  et  al.  in  their  work  on  Gaussian  integrals  [46]. 
For  this  reason  we  translate  an  STO  centered  at  A(Xa,ya,Za)  and  the  moment  operator 
centered  on  OiXo,yo,  Zg)  separately  onto  the  center  B(Xb,yb,Zb)  using  two  sets  of 
predetermined  formulas.  With  the  convention  that  all  left-hand  quantities  in  a 
formula  are  centered  on  A ,  and  those  on  the  right-hand  side  are  centered  on  B,  we 
list  these  formulas  as  follows: 

Master  Formulas  for  STO  Translation: 

lm='L  g“(bax,bay,ba:)(ij,  k)^ , 
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p,  =  V(3/47r)[to,(0,0,0)  +  ( 1,0,0)] , 

Py  =  V(3/4,r)[H( 0,0,0)  +  (0,1,0)] , 

P.  =  V{3/4,r)[^,(0,0,0)  +  (0,0,1)] , 

=  V(5/167r)[(2^?  -Ml-  Mf.)(0,0,0)  -  2baA  1,0,0)  -  2K(0,1,0) 

+  4R-(0,0,1  )  -  (2,0,0)  -  (0,2,0)  +  2(0,0,2)] , 

=  V(15/167r)[to2  -  bdl)(0,0,0)  -  2bay{0,\,0) 

+  26H;,(1,0,0)  + (2,0,0) -(0,2,0)], 

d:,y  =  V(15/47r) [^,^^(0,0,0)  +  Myi  1,0,0)  +  baAOAfi)  +  ( 1,1,0)] , 
dy,  =  V(15/47r)[^;,^,(0,0,0)  +  MA10,0)  +  ^^(0,0,1)  +  ( 1,0,1 )] , 

4,  =  V(15/47r)[^,i;a,(0,0,0)  +  MA0,l,0)  +  H(0,0,1 )  +  (0,1,1 )] , 

/.}  =  V(7/I6ir)[(2bal  -  ^balba^  —  3(>ajftfli)(0,0,0)  -  6baxbaA  1,0,0) 

-  6H^,(0,1,0)  +  i6bal  -  3bal  -  3^^,)(0,0,1 ) 

-  3^s(2,0,0)  -  eboxi  1,0,1 )  -  35a,(0,2,0)  -  6^(0,!, 1 ) 

+  6^.(0,0,2)  -  3(2,0,1)  -  3(0,2,1)  +  2(0,0,3)] , 

fx:2  =  V(21/327r)  [{AbOxbal  -bal-  bajbal )(0,0,0) 

+  [Aba]  -  3bal  -  6flJ.)(  1,0,0)  -  2baxb>ay(0,\ ,0) 

+  SbajbaAOAl )  -  3^;,( 2,0,0)  -  2%(  1,1,0)  -  K(0,2,0) 

+  8M,(  1,0,1)  +  4R(0,0,2)  -  (3,0,0)  -  ( 1,2,0)  +  4(  1,0,2)] , 
fy.2  =  y(21/32-ir)l(4baybal  -  balbUy  -  ba].)(,0fi,0)  -  2fcO;cfray(  1,0,0) 

+  8(>a^.6a.(0,0,l)  +  {4ba\  -  bal  -  3()flJ)(0,l,0) 

-  H(2,0,0)  -  2bax{  1,1,0)  -  3fta, (0,2,0)  +  83a;(0,l,l ) 

+  4to,(0,0,2)  -  (2,1,0)  -  (0,3,0)  +  4(0,1,2)] , 

=  V(105/16ir)[to..(^i  -  ba].){Q,0fi)  +  2baxba,{  1,0,0) 

-  2baybaA^^A>)  +  {bal  -  ba].){0S>,\ ) 

+  MA2,0,0)  +  2bax(  1,0,1 )  -  baA0,2,0) 
-2K(0,1,1)  + (2,0,1) -(0,2,1)], 

=  V(  105/47r)  [^,vftfl,.(>a.-(0,0,0)  +  baybaAUOfi)  +  baJjaA^^Afi) 

+  baxbay{0,0,l)  +  baA  1,1,0)  +  bay{  1,0,1 ) 

+  i^,(0,l,l)  +  (  1,1,1)], 

/,3_3,,2  =  V(35/32x)[(R  -  MMl){0,0,0)  T  3{Ml  -  ^J)(  1,0,0) 

-  6^;^, (0,1,0)  +  36a.v(2,0,0)  -  6H.(  1,1,0) 

-  3^;,(0,2,0)  +  (3,0,0)  -  3(  1,2,0)] , 
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fi;,2y^yi  =  ^(35 /32ir)[{2balbay  -  bal)(0j0j0)  +  6baxbay{l,0,0) 

+  3(bal  -  ^?.)(0.1.0)  +  3bay{2,0,0)  +  1,1,0) 

-  3K(0.2,0)  +  3(2,1.0)  -  (0,3,0)] , 

g;*  =  {3/l6^){(3bat  +  6balbal  +  3bay  -  24balbal  +  Sbat 

-  24balba])(0,0,0)  +  ( I2bal  +  2baJ)a\  -  4ibaJ)a\){  1,0,0) 
+  \2{balbay  +  baj.  —  4baybal)(0,l,0)  +  \6(2bal  —  3balba; 

-  3balba-){0,0,l)  +  6{3bal  +  ba].  -  4ba]){2,0fi) 

+  24baxbay(  1,1,0)  -  96baJ)a:{  1,0,1 )  +  6(bal  +  3ba]. 

-  4fca?)(0,2,0)  -  96ftayZ)a;( 0,1,1 )  -  24{bal  +  ba], 

-  2^?)(0,0,2)  +  12teJ(3,0,0)  +  (1,2,0)  +  4(  1,0,2)) 

+  12K[(2,1,0)  +  (0,3,0)  -  4(0,1,2)]  -  16R-[3(2,0,1 ) 

+  3(0,2,1)  -  2(0, 0,3)1  +  3(4,0,0)  +  6(2, 2,0) 

-  24(2,0,2)  +  3(0, 4,0)  -  24(0,2,2)  +  8(0.0,4)}  , 

gx:i  =  {(4baJ)a\  -  3bal^.  -  3^xbajM.){0,0,0) 

+  i4ba]  -  9balba.  -  3balba:)(  1,0,0 'i 

-  6baxbayba:{0,l,0)  +  3{4baxba]  -  bal  -  'ba^l){0,0A) 

+  3{4ba\  -  3bai  -  baj){  1,0,1 )  -  3^,R-[3(2,0,0) 

+  (0,2,0)  -  4(0,0,2)]  -  6ba,Mx{l,\,0)  -  6baxbay{0,l,\) 

+  3^,[4(  1,0,2)  -  (3,0,0)  -  ( 1,2,0)]  +  te^[4(0,0,3) 

-  3(0,2,1 )  -  9(2,0,!)]  -  6bBy{  1,1,1 )  -  3(3,0, 1 ) 

-  3(  1,2,1  )  +  4(  1,0,3)}, 

gy;}  =  I  V(10/7r)  { (4bayba\  -  3balMx  -  3balbayM:){0,0,0) 

+  (4bal  -  9baj:bax  -  3balbax){0,l,0)  -  bbaMMzi.  1,0,0) 

+  3(4baybal  -  ba].  -  baibay)(0,0,\ )  +  3{4bal  -  3ba] 

-  R)(0,1,1 )  -  3bayba,[{2,0,0)  +  3(0, 2,0)  -  4(0,0,2)] 

-  6K^,(  1,1,0)  -  bba]My{  1,0,1 )  +  3^,[4(0,1,2) 

-  (0,3,0)  -  (2,1,0)]  +  5^, 14(0,0,3)  -  3(2,0,1)  -  9(0,2,!)] 

-  6^;,(1,1,1)  -  3(0,3,!)-  3(2,1,1)  +  4(0,1,3)}  , 

=  i  V(5/7r)  {(tej  -  ba]  +  bbalbai  -  6^55a?)(0,0,0) 

+  4{3bayMl  -  bal)(l,0,0)  -  4{3baMl  -  ba])(Q,lfi) 
+  I2(balbai  -  ba]ba.){0,0,l)  -  6(bal  -  6a?)*(2,0,0) 
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+  246a;^a,(  1,0,1 )  +  (){bal  -  fta?)(0,2,0) 

-  lAbUyba-iO^A)  +  6(6aJ  -  bal.){Qfi,2) 

+  43fl,[3(  1,0,2)  -  (3,0,0)]  +  12R.[(2,0,1)  -  (0,2,1)] 

+  4H[(0,3,0)  -  3(0, 1,2)]  -  (4,0,0) 

_  +6(2,0,2)  + (0,4,0) -6(0,2,2)}, 

gxyz^  =  J  V(5/ir)  {{(>bajbayba\  -  balba,,  -  ba^bal)(0,0,0) 

+  {6baybal  -  SbalbOy  -  bal){  1,0,0)  +  (6baJ)al  -  ba\ 

-  2baxbal)(0,l,0)  +  llbOxbayboziOfi,!)  +  3(2bal  -  bal 

-  ^J)(  1,1,0)  -  3to;^,((2,0,0)  +  (0,2,0)  -  2(0,0,2)] 

+  l2baybaAl,0,i)+  l2baMz{0,l,l)  -^,[{2,0,0) 

+  3(  1,2,0) -6(  1,0,2)] -R[3(2,1,0) 
f  (0,3,0) -6(0, 1,2)]  +  125^,(  1,1,1) 

-  (3,1,0)- (1,3,0) +  6(  1,1,2)}, 

gnxi-3xy^)  =  i  y(.20/ir){[balba!  -  2baJ)alba2){0,0fi) 

+  2(ba}^z  -  3aJtoj)(  1,0,0)  -  6tox^y^Oj(0,l,0) 

+  (bal  -  2baJ)al)(0,0,\ )  -  6baybaz(  1,1,0) 

+  36H;^,I(2,0,0)  -  (0,2,0)]  +  3(R-2iJ)(  1,0,1) 

-  6ia;^a^(0,l,l)  +  to,[(3,0,0)  -  3(  1,2,0)] 

+  3^J(2,0,1 )  -  (0,2,1 )]  -  6Uy(  1,1,1 ) 

_ +(3,0,1) -3(1,2,!)}  , 

gzox^y-y^'  =  IHtOM  {i2balbaybaj  -  Wyba,)(0,0,0) 

+  2(balbaj  -  bajba^XOyX ,0)  +  6baxbaybai(  1,0,0) 

+  (2balbay  —  bal)(0,0,l )  +  6baJ)az{  1,1,0) 

+  3H^^[(2,0,0)  -  (0,2,0)]  +  2{bal  -  ^5)(0,1,1) 

-  i,b^M(  1,0,1 )  +  *^,[3(2, 1,0)  -  (0,3,0)] 

+  3^^(2,0,1 )  -  (0,2,1 )]  +  6BA  1,1,1 ) 

-(0,3,1) +  3(2,1,!)}, 

gxV  =  ^  V(35/ir)  { (bal  -  6bal^l  +  )(0,0,0)  +  4(bal  -  2baMl- )( •  AO) 

+  4(bal  -  2balbay)(0,\S))  +  ()(bal  -  i»flj)[(2,0,0) 

-  (0,2,0)]  -  24baMy(  1,1,0)  +  4R((3,0,0) 

-  3(  1,2,0)]  +  4fca,J(0,3,0)  -  3(2,1,0)] 

+  (4,0,0) -6(2,2,0)  + (0,4,0)}, 
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8x)u^-y^)  =  I  \{35/-jr){{balbay  -  ba^bal){0,0,0)  +  {3balbay  -  bal){  1,0,0) 

+  (bal  -  3baJ)al)(0,l,0) 

+  3BM[(2,0,0)  -  (0,2,0)]  +  3(Ml  -  biDUXO) 

+  ^,{(3.0,0)  -  3(  1,2,0)]  +  ^^[3{2,1,0)  -  (0,3,0)] 

+  (3,1,0) -(1,3,0)}. 

Master  Formulas  for  Operator  Translation: 

(n,  /,  m)°  =  'Zf°{box,  bOy,  bo,)(iJ,  k)f , 

w 

( 1,0,0)  =  te,(0,0,0)  +  (  1,0,0), 

(0,1,0)  =  to,.(  0,0,0)  +  (0,1,0), 

(0,0,p.  =  to,(0,0,0)  + (0,0,1), 

(2,0,0)  =  R(0,0,0)  +  2R(  1,0,0)  +  (2,0,0) , 

(1,1,0)  =  ^^Oy{Q,m  +  H(  1,0,0)  +  R(0,1,0)  +  (1,1,0) , 

( 1,0,1 )  -  bojbb-A0,(),0)  +  bo.{  1,0,0)  +  R(0,0,1 )  +  ( 1,0,1 ) , 

(0,2,0)  =  R(0,0,0)  +  2R(0,1,0)  +  (0,2,0) , 

(0,1,1)  =  RR(o,o,o)  +  R(0,i,0)  +  R.(0,o,i)  +  (o,i,i ) , 

(0,0,2)  =  ^?(0,0,0)  +  2R(0,0,1  )  +  (0,0,2) , 

(3,0,0)  =  R(0,0,0)  +  3R(  1,0,0)  +  3R(2,0,0)  +  (3,0,0) , 

( 1,2,0)  =  boJ}o].{0,Q,Q)  +  R(  1,0,0)  +  2bOxbOy{0,\,0) 

+  2R(  1,1,0)  +  R(0,2,0)  +  ( 1,2,0) , 

(1,0,2)  =  bOxbo](0,0,0)  +  boli  1,0,0)  +  2boJ)Oz{0,0,l) 

+  2R(  1,0,1 )  +  R(0,0,2)  +  ( 1,0,2) , 

(2,1,0)  =  bolbOy{0,0,0)  +  2boJ)Oy(l,0,0)  +  R(0,1,0) 

+  2R(  1,1.0)  +  R(2,0,0)  +  (2,1,0) , 

(2,0,1 )  =  RR(0,0,0)  +  2bofboA  1,0,0)  +  R(0,0,1 ) 

+  2R(  1,0,1 )  +  R(2,0,0)  +  (2,0,1 ) , 

( 1,1,1 )  =  bo^bpyboA0,0,0)  +  boybozi  1,0,0)  +  6oj<5oj(  0,1,0)  +  ftO;(5o^(0,0,l ) 

+  R(1,1,0)  +  R(1,0,1)  +  R(0,1,1)  + (1,1,1), 

(0,3,0)  =  R(0,0,0)  +  3R.(0,1,0)  +  3R(0,2,0)  +  (0,3,0) , 

(0,1,2)  =  bOybol(0,0fi)  +  bol{0,lfi)  +  2bOybo2{0,0,l) 

+  2R(0,1,1 )  +  R(0,0,2)  +  (0,1,2) , 
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(0,2,1 )  =  to?5o,(0,0,0)  +  2^to.-(0,l,0)  +  toj(0,0,l ) 

+  2io,.(0,l,l )  +  ^.-(0,2,0)  +  (0,2,1 ) , 

(0,0,3)  =  to|(0,0,0)  +  36o?(0,0,l)  +  3boA0,0,2)  +  (0,0,3) , 

(4,0,0)  =  (0,0,0)  +  4to^(  1,0,0)  +  6^2  (2,0,0)  +  4^^  (3,0,0)  +  (4,0,0) , 

(3,1,0)  =  bolbo, (0,0,0)  1-  ibolboyi  1,0,0)  +  bol(0,l,0)  +  3bojoy{2,0,0) 

+  3^?(  1,1,0)  +  (3,0,0)  +  3boA2,l.O)  +  (3,1,0) , 

(3,0,1 )  =  ^l^A0,0,0)  +  ibolboA  1.0,0)  +  boU0,0,\ )  +  3boli  1,0,1 ) 

+  3boM..(2,0,0)  +  to.-(3.0,0)  +  3(wU2,0,l )  +  (3.0,1 ) , 

(2,2,0)  =  ^^^^(0,0.0)  +  2bbjbo].(  1,0,0)  +  2bolbOy(0,l,0) 

+  ^?(  2,0,0)  +  4bOybOy(  1,1,0)  +  to^(  0,2,0) 

+  2boA  1,2.0)  +  2*0, .(2, 1,0)  +  (2,2,0) , 

(2.1.1 )  =  bo\bo^bO;(0,0,0)  +  2boJ}Oybo:(  1,0.0)  +  bo\bO:(0,\,0) 

+  bolbOy(0,0,\)  +  bOybo:(2,0,0)  +  2*o^O;(  1,1,0) 

+  2*o^H(  1,0.1 )  +  *oi(0.1,l )  -I-  ^.(2,1,0) 

+  *3,(2,0,1)  +  2*o^(  1,1,1) +  (2,1,1), 

(2,0,2)  =  ^i*o?(0,0.0)  +  2bojbol(  1,0,0)  +  2*o?to..(0,0,l ) 

+  AboJboA  1,0,1 )  +  *o?(2,0,0)  +  bol(0,0,2) 

+  2boA  1,0,2)  +  2to.-(  2,0,1 )  +  (2,0,2) , 

( 1,3,0)  =  boxbo]  (0,0,0)  +  3*Oy*Oj.(  0,1,0)  +  *Oy(  1,0,0)  +  3boJ)Oy(0,2,0) 

+  3*0,^  ( 1,1,0)  +  to;,(0,3,0)  +  3bOy(  1,2,0)  +  (1,3,0) , 

(1.2.1 )  =  bOxbolbO;(0,0,0)  ■'  2boJ)OybO:(0,l ,0)  +  bo^boA  1,0,0) 

+  *O;t*Oy(0,0,l )  +  bOxboA0,2,0)  +  2*o,.*0;(  1,1,0) 

+  2to;,*o^.(0,l,l )  +  *oJ.(  1,0,1 )  +  *o,(  1,2,0) 

+  ^;,(0,2,1)  +  2K(  1,1,1) +  (1,2,1), 

(1.1.2)  =  boJ)Oybo\(0,0,0)  +  *o,*0;(  1,0,0)  +  bOxbo\(0,\,0) 

+  2boxbOybO:(0,0,\ )  +  *o?(  1,1,0)  +  2bOyboA  1,0,1 ) 

+  2boxboA0,l,l )  +  bOxbOy(0,0,2)  +  bOy(  1,0,2) 

+  2^,(1, 1,1) +  ^,(0,1,2) +  (1,1, 2), 

( 1,0,3)  =  *ojo5(0,0,0)  +  bo\(  1,0,0)  +  3^:^o?(0,0,l )  +  3*o?(  1,0,1 ) 

+  3*0;<*o.(  0,0,2)  +  3bbA  1,0,2)  +  ^,(0,0,3)  +  (1,0,3) , 

(0,4,0)  =  (0,0,0)  +  4bol(0,l,0)  +  6boj(0,2,0)  +  4*o,. (0,3,0)  +  (0,4,0) , 
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(0,3,1)  =  bojboAOAO)  +  3to?.to,(0,l,0)  +  to,^.(0,0,l)  +  36oJ, (0,1,1) 

+  3bojo:{0,2-0)  +  ^.-(0,3,0)  +  36o,.(0,2,l )  +  (0,3,1 ) , 

(0.2,2)  =  to,2.^?(0,0,0)  +  2Hto?(0,l,0)  +  2bojbo,{0,0,l) 

+  4KR-(0,1,1  )  +  Aoi(0,2,0)  +  Zio;.(0,0,2) 

+  23oj  (0,1,2)  +  2^,(0,2,1)  +  (0,2,2) , 

(0,1,3)  =  tejto?(0,0,0)  +  6o?(0,l,0)  +  3*o..i^?(0,0,l)  +  3to?(0,l,l) 

+  3to,.R-(0,0,2)  +  3to.-(0,l,2)  +  H(0,0,3)  +  (0,1,3) , 

(0,0,4)  =  tol(0,0,0)  +  4boli  1,0,0)  +  6tor(0,0,2)  +  4to.-(0,0,3)  +  (0,0,4) . 

With  this  accomplishment  we  can  now  write  the  sixfold  sum  of  Eq.  ( 1 5 )  in  terms 
of  a  twofold  sum  over  the  two  Cartesians,  both  centered  on  B,  in  the  following 
fashion: 


=  {r”‘‘  'x'ayiz'^  ^‘'‘•\x‘oy”z''„\rf  ‘‘’''xlyUle'^'^'') 

.<^2,-4-.^  2  g"(i^.x,  T^v,  ba:)xf-'‘y'l<‘z'§“^e~^‘‘'‘  \ 

' 1 2  f«(bOx,  bby,  bO:)x‘l’yf'z'§-^^\rl>'-'>’-'x%yizle~^^^^ 

=  2  gl(^x,  bay,  ba:)fUbOx,  bOy,  bO:) 

U,V 

where  the  coefficients  gliba^,  ba^,  ba.)  and  fvibOx,  bo,,  bo^)  are  taken  from  the 
master  formulas  through  two  computational  do  loops. 


Local  Integral  Evaluation 

One-Center  Integrals:  After  the  translations  described  by  Eq.  ( 1 6 ) ,  all  one-center 
integrals,  apart  from  a  normalization  factor,  take  the  form 


Jo 


'0, 


r2x 

COS^  dt.  ddh  I 


(17) 


and  we  proceed  to  evaluate  the  nonvanishing  angular  part  of  I\  first. 
Assuming  m  and  « to  be  integers,  we  have,  in  general. 


cos"'  (psin”  (j)  d(i> 


m  -  1  r  -' 
m  +  n  Jo 


cos 


m-2 


sin"  <i>  dep  . 


(18) 
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Applying  Eq.  ( 18)  in  a  recursive  manner,  we  get 


—  1  yyi  —  3 

'  <t>  sin"  (l>  - - r  cos'”"'*  0  sin"  4>  cl4> 

m  +  n  m  +  n  -  2  Jo 

_  m - 1  m  —  3  m- 5 
m  ■¥  n  m  +  n  -  2  m  +  n  -  4 


X  cos'”-*  (j)  sin"  (p  dif>  = 
Jo 

(m-  !)!!«!!  ^ 

=  — —  sin”  <j)  d(l> . 

+  «)!!  Jo 


{m  +  «)!!  Jo 

In  Eq.  ( 19)  we  need  to  examine  two  cases  of  the  sine  integral.  First,  if  n  is  even, 


sin"  <j>  d<t>  = -  sin"-^  <j>  d(]> 

n  Jo 

«  -  1  M  -  3 

- - r  sin" 

n  n  —  2  Jo 


sin"  *  (t>  d<f>  =  •  • 


(«-  1)!! 


(tt.  Iff) . 


Therefore,  for  even  m  and  even  «,  Eqs.  (19)  and  (20)  give  the  closed  formulae 

r  _(m- !)!!(«- l)!!,r 


cos'"  <l>  sin"  4>d4>  = 


{m  +  n)!! 


(w  -  1 )!!(«-  1)!! 

cos'”  (t>  sin”  4>  d4>  = - - — — — — ^  (Ztt)  =  F(m,  n) .  (21b) 

Jo  (m  +  n)!! 


We  then  examine  the  sine  integrals  with  odd  «  in  Eq.  ( 19): 


sin"  <t>  d<t>  =  -cos  <f>  - 


sin"  (p  d(p  =  0  .  { 

Jo 

Finally  Eqs.  (19)  and  (22)  give,  for  even  m  and  odd  n,  the  closed  formulae 


cos'"  0  sin"  d<t)  = 


(m  +  n)!! 
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All  the  angular  integrals  necessary  for  I\  are  given  by  Eqs.  (21 )  and  (23).  The 
radial  part  of  Ii  is  simply  Jq“  . 

Two-Center  Integrals:  In  an  ellipsoidal  coordinate  system  we  use  Eq.  (6)  noted 
in  the  previous  section  to  express  a  two-center  integral  that  evolves  from  the  trans¬ 
lation  ofEq.  (16), 

Jl  J-1  Jo 

X  -  1 )( 1  -  1  -  cos'  <t>  sin-*  <f> 

X  -  1 )( 1  -  1  -  |>?)*^-'’‘-^'”’-F(/,  J) ,  (24) 

where 

P  =  (F/2)(f,  +  ffc), 

r  =  (L-  f6)/(f<,  +  M 
and  FUJ)  is  given  by  Eq.  (21b). 

At  this  point,  we  feel  it  is  convenient  if  a  new  function  is  introduced  as 
Za.fl.r.a(p,  r)  =  J"  di  £  dn{^  +  r,m  "  nY 

X  [(?'  -  1 )( 1  -  n')]’(  1  -  ,  (25) 

which  we  shall  call  “Z  functions.”  The  recursion  formulas  of  Z  are  simpler  than 
those  derived  by  Ruedenberg  et  al.  [28].  In  this  work  we  adapt  an  algorithm  pro¬ 
posed  by  Stevens  [47,48]  based  on  the  binomial  theorem, 

X  Za+(J+2T+{-i-^-2i-n)(p)F,+y+/+ni(p,  r)  ,  (26) 


where  the  A  and  B  functions  are  as  defined  by  Mulliken  et  al.  [25],  and  have  been 
well  studied  by,  among  others.  Miller  et  al.  [49]  and  Harris  [50]. 

In  this  formalism,  the  two-center  integral  of  Eq.  (24)  is  evaluated  as  the  product 
of  Z  and  F  functions, 

~  Zn„-/a,nj-/t,(i+2)/2,ft(P»  ‘r)F{^i,j)  .  (27) 


Rotational  Transform  into  Molecular  Frame 

As  discussed  in  the  previous  section,  all  integrals  evaluated  in  this  section  need 
to  be  back-transformed  into  molecular  frame  [x',y',z']  (see  Fig.  1 )  through  a  matrix 
rotation 
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I  x\  /r,,  ri2  ’'ti'K/xX  /  x'\ 

0)1  >')  =  I ''21  T22  r23|l  >'1  =  I  /  I  , 

\z/  \r3i  0  rij\zl  \z'J 


(28) 


where  the  matrix  elements  are  given  by  Eq.  ( 10).  We  now  let  the  rotation  described 
by  Eq.  (28)  act  on  an  arbitrary  Cartesian  in  the  ellipsoidal  coordinate  system, 

Rx'yJz'‘  =  {rux  +  ri2y  +  r,3z)'‘(r2iX  +  r22j'  +  riizYir^x  +  r^^zf 


2  tr.j./(ni,  ri2,  n3)ci.r.»(l'2l,  ''22,  r2l)c’l,_^{rn, 


(29) 


where 


cL/(rii.  ri2,  r,3)  =  (/!/r!i!ri)r;irl2r'i3 
c  «.v.H(r2i,  r22,  r2i)  =  (j!/M!n!w!)r?,r^2>‘23 
4,«(''3i.  r33)  =  {k\/plq\)r3irl2  (30) 

Considering  I2  of  Eq.  (24),  we  now  have 
RI2  =  R{rl‘‘e~^‘'‘‘\rb'’XbybZbe~'‘’^'’') 

=  {r''a‘‘e~^‘'‘\rb‘’{RXby^b2b)e~'‘'^'’) 

=  2  c'rjjCij,,„Cp,g{rl‘-e~^'‘''‘\rfx'b'‘*^ylb''Zb'^‘’e'’‘‘^>’)  y  (31) 

(r,s,tHu.vM(p,<l) 

where  we  have  applied  Eq.  (29)  and  used  the  rotational  equivalence  of  coordinate 
systems  [x,y,z\  and  [Xb,yb,Zb]  (see,  again.  Fig.  1 ). 

In  this  approach  it  is  remarkable  that  the  rotation  i?(0,  0)  needs  to  act  only  on 
the  right-hand  side  of  Eq.  (24)  to  rotate  the  integral  h  into  the  molecular  frame. 

Computational  Procedure 
The  method  presented  here  proceeds  as  follows: 

(i)  For  two  orbitals  on  the  same  center,  the  operator  is  translated,  the  sym¬ 
metry  is  checked,  and  if  nonzero,  integral  evaluation  proceeds  as  outlined  two 
subsections  ago. 

(ii)  For  orbitals  on  different  centers: 

( 1 )  Choose  the  orbital  with  bigger  /  in  Eq.  ( la)  to  be  centered  on  B. 

(2)  Translation  is  made  in  two  do  loops  of  Eq.  ( 16)  by  using  the  master  formulas. 

(3)  Determine  the  order  L  =  ^+  /+g  +  + 1%  +  ty.u  +  ty.v  + of  the 

Cartesian  on  the  righthand  side  of  Eq.  ( 16). 

( 4 )  Sort  out  the  angular  symmetry  “allowed”  Cartesians  of  the  order  L  determined 
in  step  (3)  by  way  of  the  F(i,j)  functions  shown  two  subsections  ago. 

(5)  Compute  the  necessary  Z  functions  {Eq.  (26)],  and  then  the  local  integrals 
[Eq.  (27)]  required  by  step  (4). 

(6)  Rotate  the  computed  local  integrals  into  the  molecular  frame  by  using  Eq. 
(31). 
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Discussion  and  Conclusions 


A  general  method  of  evaluating  electric  moment  integrals  over  Slater-type  orbitals 
is  developed.  A  program  that  implements  the  procedure  just  described  above  has 
been  in  use  in  this  laboratory  for  some  time.  We  have  been  using  ihese  moments 
for  studies  on  solvation  using  the  self-consistent  reaction  field  model,  and  for  ex¬ 
amining  the  long-range  interactions  between  molecules. 

The  method  we  outline  is  somewhat  unusual  in  that  the  Cartesian  components 
of  one  orbital  and  the  operator  are  both  translated  to  the  center  of  another  orbital 
and  thus  only  a  single  sigma  symmetry  integration  remains  between  an  5  orbital 
on  one  center  and  a  complex  Cartesian  orbital  of  the  form  x'y^z''r'e~^'  on  the 
other.  The  local  evaluation  of  this  integral  is  easy,  as  is  the  rotation  back  into  the 
molecular  frame.  Although  the  method  is  quite  general,  the  price  we  pay  for  this 
computational  simplicity  is  formal  complexity.  Generating  explicit  formula  for 
higher  moments,  or  higher  angular  momentum  atomic-like  orbitals,  is  tedious.  We 
do  not  maintain  the  “symmetry”  of  the  spherical  harmonics,  and  we  pay  a  penalty 
for  this. 

A  definite  advantage  of  the  method  described  is  that  we  need  to  consider  the  z 
component  of  one  atomic  orbital,  and  the  Z  function  we  need  ( Eq.  ( 25 ) )  is  simpler 
than  the  traditional  L  function  [30], 


£  drAi  +  -  »?)"( 1  +  kriV 

X(I  1)(1 


(32) 


or  the  C  function  [28], 


(33) 

Table  1.  Percentage  of  Cartesians  needed  to  integrate  with 

Number  of  Cartesians 

Number  of  Cartesians  with 

Order 

{A) 

nonzero  angular  integral  (B) 

B/A 

0 

1 

1 

1.00 

1 

3 

1 

b.33 

2 

6 

3 

0.50 

3 

10 

3 

0.30 

4 

15 

6 

5 

21 

6 

0.29 

6 

28 

10 

0.36 

7 

36 

10 

0.28 

8 

45 

15 

0.33 

9 

55 

15 

0.27 

10 

66 

21 

0..32 

11 

78 

21 

0.27 

12 

91 

28 

0.31 
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Both  L  and  C  functions,  if  evaluated  in  the  fashion  of  Eq.  (26)  require  a  sixfold 
sum,  while  the  evaluation  of  Z  functions  needs  a  fivefold  sum,  a  saving  of  the 
innermost  do  loop  in  computation.  It  would  be  worthwhile  to  further  investigate 
the  Z  function  recursion  relations. 

Before  concluding,  we  further  note  that  the  “Cartesian  STOs"  are  prescreened 
based  on  angular  symmetry  before  the  integration  takes  place.  Note  that  /  andy  in 
the  Cartesian  (/,  j,  k)  that  occurs  in  Eq.  (24)  must  both  be  even  or  the  integral 
vanishes.  For  example,  a  typical  nonvanishing  quadrupole  integral  between  d  orbitals 
contains  on  average  1 8  different  terms  of  64  possible,  but  only  five  or  six  are  nonzero. 
In  Table  1  we  list  the  number  of  Cartesians  for  a  given  order,  and  the  number  of 
Cartesians  that  have  nonvanishing  angular  integral  based  on  the  prescreening  of 
F(/,  j).  This  table  shows  that  in  the  integration  with  Cartesian  STOs,  there  is  a 
saving  of  about  70%. 
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Abstract 

Recently  we  presented  an  extension  of  the  direct  reaction  field  (drf)  method,  in  which  a  quantum 
system  and  a  set  of  point  charges  and  interacting  polarizabilities  are  embedded  in  a  continuum  that  is 
charactenzed  by  a  dielectric  constant  c  and  a  finite  ionic  strength.  The  reaction  field  of  the  continuum 
is  found  by  solving  the  (lineanzed)  Poisson-Boltzmann  equation  by  a  boundary  element  method  for 
the  complete  charge  distribution  in  a  cavity  of  arbitrary  size  and  form.  Like  many  other  authors,  we 
found  that  the  results  depend  critically  on  the  choice  of  the  size  of  the  cavity,  in  the  sense  that  the 
continuum  contnbution  to  the  solvation  energy  decreases  rapidly  with  the  relative  cavity  size.  The  literature 
gives  no  clues  for  the  definition  of  the  cavity  size  beyond  “physical  intuition"  or  implicit  fitting  to 
experimental  or  otherwise  desired  results.  From  theoretical  considerations,  a  number  of  limitations  on 
the  position  of  the  boundary  are  denved.  With  a  boundary  defined  within  these  limitations,  the  exper¬ 
imental  hydration  energies  cannot  be  reproduced,  mainly  because  of  the  neglected  specific  interactions. 
In  addition,  we  found  that  the  description  of  the  solute’s  electronic  states  also  depends  on  the  solvation 
model.  We  suggest  that  one  or  more  explicit  solvent  layers  are  needed  to  obtain  reliable  solvation  and 
excitation  energies.  ©  1993  John  Wiley  &  Sons,  me. 


Introduction 

Dielectric  continuum  models  have  become  quite  popular  in  the  calculation  of 
solvation  free  energies  and  solvation  effects  on  equilibria  and  electronic  spectra, 
combining  them  with  quantum  theoretical  [  1  -5  ] ,  semiempirical  [  6,7  ] ,  and  entirely 
classical  treatments  [8-10]  of  a  solute.  The  methods  are  based  on  the  notion  that 
electrostatic  behavior  of  solvent  bulk  can  approximately  be  described  by  a  single 
parameter,  viz.,  the  dielectric  constant  [11].  Some  methods  also  allow  for  a  finite 
ionic  strength  of  the  bulk,  introducing  an  additional  parameter  for  description  of 
the  bulk,  the  Debye  screening  length 

The  popularity  of  the  dielectric  model  is  based  on  the  success  in  reproducing 
solvation  energies  with  relatively  little  computational  cost.  For  hydration  energies, 
good  agioement  is  found  with  more  explicit  water  models,  as  used  in  molecular 
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dynamics  (MD)  [12]  and  Monte  Carlo  (MC)  [13-15]  simulations,  which  are  com¬ 
paratively  rather  expensive. 

In  this  paper,  the  theoretical  foundations  of  the  continuum  approach  are  revisited, 
in  order  to  derive  criteria  for  the  appropriateness  of  the  dielectric  model  and  for 
the  location  of  the  boundary  separating  solute  and  dielectric  from  the  limitations 
set  by  theory.  It  is  shown  that  most  current  models  do  not  correctly  apply  these 
criteria.  As  a  result,  these  models  are  not  general  in  the  sense  that  they  can  be 
extended  to  nonaqueous  solutions  without  reparameterization. 


Theoretical  Framework 

Consider  a  macroscopic  volume  described  on  the  quantum  chemical  level  (nuclei 
and  electrons  as  the  constituent  particles).  The  properties  of  the  system  follow  from 
the  wave  function  ^  of  the  system,  a  function  of  the  coordinates  of  all  particles, 
and  time,  defining  a  state  of  the  system.  In  the  case  of  stationary  states,  time  enters 
only  as  a  phase  factor.  Then  the  wave  function  can  be  found  by  solving  the  time- 
independent  Schrddinger  equation  H'i  =  where  the  Hamiltonian  .^contains 
the  kinetic  and  interaction  operators  of  all  particles.  Assuming  is  known,  properties 
of  the  system  can  be  calculated  by  integrating  over  all  space,  where  A  is  the 
operator  connected  with  the  property  of  interest.  The  effort  needed  for  this  inte¬ 
gration  can  be  reduced  enormously  if  ^  vanishes  for  certain  coordinates  in  large 
parts  of  space.  One  then  deals  with  localized  particles. 

If  a  group  of  particles  is  localized  in  the  same  region  in  space,  say  S,  the  integration 
may  be  carried  out  separately  over  the  particles  of  this  group  and  is  confined  to  the 
selected  volume  S  in  space.  If  no  other  particle  outside  the  group  has  a  contribution 
inside  the  volume,  the  part  of  the  wave  function  describing  the  particles  of  the 
group,  ^5,  may  be  factored  out  of  the  total  wave  function,  leaving  for  the  rest 
of  the  particles: 


^  =  (1) 

Because  of  the  indistinguishability  of  electrons,  this  can  never  be  done  in  principle, 
but  no  error  is  made  if  the  scalar  product  of  the  component  functions  equals  zero 
(i.e.,  if  the  component  functions  “do  not  overlap”). 

If  the  property  of  interest  is  the  energy,  the  result  of  integrating  the  AS  interaction 
part  of  the  Hamiltonian,  V^si  over  5  is  the  potential  of  the  particles  constituting 
group  S  in  space: 
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where  z,  and  r,  are  the  charge  and  position  vector  of  particle  /,  respectively.  Here 
the  regions  of  integration  are  indicated  by  the  italic  capitals  S  and  A .  Solving  the 
Schrodinger  equation  ( SE )  for  the  whole  system  can  now  be  partitioned  into  smaller 
problems,  for  which  the  SE  is  to  be  solved  in  an  external  potential.  This  is  the  group- 
function  approach  advocated  by  McWeeny  [16]  in  which  the  energy  E  is  minimized 
in  an  iterative  scheme.  Starting  from  some  trial  functions  for  subsystems  A  and  5, 
the  potential  in  A  due  to  S  is  calculated  as  in  (3).  The  energy  of  A  in  this  potential 


+  2  Za^sira) 

a^A 


(4) 


is  minimized,  giving  a  new  wave  function  .  Next,  the  potential  due  to  in  S  is 
calculated,  and  in  turn  is  optimized  in  this  potential.  A  new  potential  due  to  S 
in  A  is  found  and  applied,  etc.,  until  self-consistency  is  reached. 

In  a  perturbation  formalism,  where  the  starting  functions  for  and  are  the 
vacuum  (ground)  state  functions  and  ^5  of  the  subsystems  A  a.’d  S  and  the 
optimized  functions  are  expanded  in  the  orthogonal  set  of  solutions  to  the  vacuum 
problem  for  subsystem  A  (and  similarly  for  5),  it  is  easily  seen 

that  #5  can  be  split  into  two  contributions,  resulting  from  the  vacuum  charge 
density  and  the  density  change  induced  by  subsystem  A .  The  former  is  called  the 
static  potential  and  the  latter  the  response  pote.nial.  To  second  order  in  the  per¬ 
turbation  Vjs, 
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In  ( 5 )  the  coefficients  c*  and  wave  functions  are  taken  to  be  real.  Expanding  the 
perturbation  operator  V^s  around  a  representative  point  r?  in  S,  Ck  can  be  written 
as 

Eo  ~  Ek 

iffk 


using  the  orthogonality  of  the  zeroth-order  functions.  The  field  operator  of  subsystem 
A  at  the  representative  point  in  S  and  the  dipole  operator  of  subsystem  S  with 
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respect  to  the  representative  point  are  now  recognized.  Inserting  (6)  in  the  second 
term  of  (5)  and  making  the  same  expansion  for  the  potential  operator,  truncating 
at  the  dipole  term,  the  second-order  expression  for  the  dipole  polarizability  of  sub¬ 
system  S  emerges,  thus  showing  the  connection  of  the  second  term  in  (5)  to  the 
linear  response  functions  of  subsystem  S: 
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The  meaning  of  (7)  is  evident:  the  field  due  to  a  source  in  A  at  r?  induces  a 
dipole  proportional  to  the  polarisability  as  at  r?,  which  in  turn  gives  rise  to  a 
(response)  potential  at  the  source  in  subsystem  A.  Higher  order  responses  follow 
similarly  from  the  continued  expansion  of  the  second  term,  and  from  the  third 
term  in  (5),  connecting  to  the  quadratic  response  functions  of  S. 

In  order  to  reduce  the  number  of  degrees  of  freedom  of  the  system,  the  static 
and  response  potentials  due  to  S  should  be  mapped  on  some  functional  form  con¬ 
taining  far  less  parameters  than  the  original  number  of  particles  in  S.  The  reduction 
is  achieved  by  truncating  the  expansions  in  the  Taylor  series  of  the  operators  and 
the  order  of  response  theory.  In  practice,  the  Taylor  series  are  usually  truncated  at 
the  dipole  level  and  the  responses  at  second  order  in  the  perturbation,  as  shown 
here.  Apart  from  these  truncations,  the  size  of  the  subsystems  of  which  the  properties 
are  determined  is  also  reflected  in  the  sophistication  of  the  mapping  on  classical 
models  [17], 

The  great  advantage  of  the  treatment  outlined  above  is  that  both  the  permanent 
moments  and  the  response  functions  are  defined  entirely  in  terms  of  the  subsystem(s) 
S,  and  can  be  obtained  from  explicit  calculation  at  any  level,  or  from  experiment. 
The  permanent  moments  and  the  response  functions  may  be  regarded  as  parameters 
describing  the  collective  properties  of  the  particles  comprising  subsystem  S. 

The  theoretical  development  so  far  requires  solute/solvent  separation  for  more 
or  less  formal  reasons,  and  linear  response  of  the  solvent  for  practical  reasons.  In 
the  next  section  we  will  analyze  various  model  response  functions  with  respect  to 
these  requirements. 


Dielectric  Models  Scrutinized 

The  separation  of  two  or  more  systems  and  the  (non)linearity  of  their  mutual 
interactions  are,  of  course,  strongly  connected.  For  example,  the  interactions  between 
the  three  atoms  in  a  single  water  molecule  are  highly  nonlinear  because  their  charge 
distributions  overlap  strongly  and  the  electric  potentials  and  fields  are  too  large  to 
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expect  linear  response.  As  far  as  the  enei^y  is  concerned,  the  only  way  to  deal  with 
this  probler;  to  describe  the  system  in  terms  of  electrons  and  (effective)  nuclei, 
and  solve  a  set  of  coupled  equations  which  contain  all  of  the  nonlinear  problems. 
One  of  the  pi  operties  of  the  resulting  charge  distribution  is  its  ( dipole )  polarizability 
describing  th-  ii  ear  response  of  this  collection  of  atoms. 

In  the  watC'  iiner  at  its  optimal  geometry,  the  largest  overlap  between  two  MOs 
is  about  0.05  { this  was  calculated  with  program  suites  SYMOL  and  GNOME  [18], 
with  '"'♦imized  orbitals  in  the  water  dimer).  Whether  or  not  this  is  a  “large'’  overlap 
depends  on  the  context  in  which  the  o-^erlap  criterion  is  used.  One  may  wonder  if 
the  electrostatic  interaction  between  the  molecules  can  be  described  with  a  linear 
response  model  at  this  distance,  or  more  generally:  Up  to  what  distance  is  this 
model  acceptable?  One  possible  answer  comes  from  simple  electrostatics  given,  e.g., 
by  Thole  [19].  Consider  two  isotropic  interacting  polarizabilities  a,  and  Uj  at  distance 
r.  The  effective  polarizability  along  the  axis  is 

a,  +  aj  +  AuiUj/ 

““ - [-4.,Vr‘  ■ 

When  r  approaches  (4a,a^)‘^*,  an  tends  to  infinity,  caused  by  the  cooperative 
interaction  between  the  two  induced  dipoles  along  the  axis.  The  trouble  starts  ob¬ 
viously  already  when  the  polarizability  volumes — i.e.,  spheres  with  radius  — 
start  overlapping.  Such  a  situation  should  either  be  avoided,  or  one  must  account 
for  the  nonlinearities,  e.g.,  by  screening  the  interactions.  In  our  model  for  reaction 
potentials  we  have  adopted  Thole’s  method  for  defining  an  effective  “many-body” 
polarizability,  in  which  d,,  =  1.662  (a,a,y^  is  the  minimum  distance  for  two  in¬ 
teracting  polarizabilities  without  screening  function,  where  the  factor  1.662  and 
the  polarizabilities  are  model  parameters  fitted  to  (experimental  or  calculated)  mo¬ 
lecular  dipole  polarizabilities.  If  the  polarizabilities  come  closer,  the  fields  of  the 
induced  dipoles  are  damped  to  account  for  the  overlap  of  the  electronic  densities 
connected  to  the  polarizabilities.  In  the  water  dimer  we  may  put  the  monomer 
electronic  polarizabilities  of  10  bohr^  on  the  oxygen  atoms,  which  are  in  the  equi¬ 
librium  geometry  about  6  bohr  apart.  Thole’s  criterion  shows  that  already  at  this 
distance  the  treatment  of  mutual  induction  effects  may  be  treated  completely  clas¬ 
sical  [20],  provided  the  inducing  fields  correspond  to  charge  distributions,  which 
are  correct  up  to  at  least  the  dipole  term  [21] .  Apparently  both  the  separation  and 
the  linearity  requirements  are  satisfied  in  this  situation. 

The  dielectric  continuum  model  maps  the  electric  behavior  of  a  bulk  volume 
onto  a  single  nonvanishing  parameter,  the  dielectric  constant,  describing  its  linear 
response  [11].  Jackson  [22]  has  shown  on  experimental  grounds  that  the  size  of 
representative  bulk  volumes  in  which  the  solvent  is  partitioned  should  be  at  least 
10’  bohr^,  being  macroscopic  on  an  atomic  scale,  but  microscopic  on  an  everyday 
experimental  scale.  On  this  scale,  the  bulk  is  homogeneous  and  structureless,  re¬ 
sulting  in  vanishing  electric  moments,  so  that  the  bulk  has  no  static  potential  con¬ 
tribution. 

In  continuum  models  for  solvation  it  is  necessary  to  define  a  cavity  in  ihe  dielectric 
to  contain  the  solute.  The  solute’s  response  potential  is  found  by  solving  Poisson’s 
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equation,  which  is  practicable  only  if  no  higher  order  than  linear  response  is  allowed. 
In  practice,  two  types  of  method  for  solving  the  Poisson  equation  are  in  use:  one 
based  on  spatial  integration  (on  some  grid)  [9],  the  other  based  on  (numerical) 
integration  on  the  surface  enveloping  the  solute  (usually  built  from  small  adjacent 
polygons)  [8,10].  The  surface  defines  the  boundary  between  the  solute  cavity  and 
the  solvent  bulk.  Of  the  second  type,  analytic  methods  exist  [1,3,7,23],  but  they 
are  only  possible  for  spherical  and  elliptical  cavities,  making  them  less  useful  for 
general  purposes. 

The  necessity  of  defining  a  cavity  around  the  solute  immediately  poses  the  problem 
of  where  to  put  its  boundary.  Because  the  reaction  potential  is  very  sensitive  to  the 
solute-boundary  distance,  getting  “physically  plausible  results”  strongly  depends 
on  the  careful  definition  of  the  boundary.  The  surfaces  employed  in  most  models 
resemble  a  Connolly  surface  [24],  which  is  defined  by  rolling  a  probe  sphere  with 
the  size  of  a  solvent  molecule  over  a  set  of  overlapping  spheres  centred  on  the 
solute’s  atomic  sites.  If  the  atomic  spheres  are  given  the  atomic  van  der  Waals  radii, 
Connolly’s  “van  der  Waals  surface”  is  defined  by  the  “contact  points”  between 
probe  and  atomic  spheres.  The  suiface  traced  by  the  center  of  the  probe  is  usually 
called  the  “solvent  accessible’'  surface. 

We  want  the  bounda'y  first  to  be  consistent  with  the  nonoverlap  requirement, 
meaning  that  no  significant  part  of  solute’s  (electronic)  charge  is  to  be  found  outside 
the  cavity  ’tegardinj  this,  Miertus  et  al.  [2]  tell  us  that  this  requirement  leads  to 
“unrealistically  small”  reaction  potentials.  Indeed  they  found  with  the  standard 
van  der  Waals  radii  that  a  sizable  amount  of  electronic  density  (for  water  about 
1%)  extends  over  the  boundary.  This  means  that  the  overlap  of  0.05,  mentioned 
above,  is  here  too  large.  To  remedy  this  effect,  a  correction  was  made  for  the  “charge 
leakage,”  and  at  the  same  time  the  atomic  radii  were  scaled  up  by  20%,  to  keep 
the  charge  distribut'on  within  the  cavity  to  within  0.5%.  The  analytic  methods 
employ  a  cavity  with  a  volume  derived  from  the  density  of  the  solute,  which  leads 
to  a  cavity  of  approximately  the  same  size  as  defined  by  the  Connolly  van  der 
Waals  radii  [3,7],  so  that  they  may  suffer  from  the  same  defect.  Choices  published 
so  far  are  all  of  the  type  “van  der  Waals  plus  something”  or  “van  der  Waals  times 
something”  to  arrive  at  “realistic”  solvation  energies. 

At  this  point  we  conclude  that  the  van  der  Waals  surfaces  are  too  close  to  the 
solute  to  be  consistent  with  the  nonoverlap  criterion,  and  we  suggest  increasing  the 
“atomic  spheres”  by  adding  at  least  one  solvent  radius.  In  case  of  a  water  solute  in 
bulk  water,  this  means  that  the  boundary  is  located  7.3  bohr  from  the  center  of 
mass  of  the  sclute.  Already  here  we  note  that  by  this  procedure  the  experimental 
hydration  energy  cannot  be  reproduced. 

Next  we  look  in"'*  the  combined  overlap/ nonlinearity  problem.  Obviously,  no 
source  charge  or  (i»  :.'ced)  dipole  may  ever  be  positioned  precisely  on  the  boundary, 
because  its  reaction  p-  I’ential  will  then  be  infinitely  large.  A  measure  for  the  minimal 
distance  can  be  o'^tained — at  least  for  a  neutral,  polarizable  solute — by  repeating 
Thole’s  reasoning  for  two  polarizabilities. 

The  dielectric  constant  of  the  bulk  is,  via  the  dielectric  susceptibility,  x,  =  (e  - 
1  )/47r,  connected  to  the  polarizability  density,  which  includes,  apart  from  the  elec- 
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tronic  polarization,  vibrational  and  rotational  contributions.  For  polar  solvents  the 
rotational  contribution,  i.e.,  the  effect  of  the  permanent  dipoles  orienting  themselves 
along  the  applied  field,  dominates.  By  simply  multiplying  the  dielectric  susceptibility 
by  the  volume  of  one  solvent  molecule  (assuming  that  the  dielectric  is  homogeneous 
to  a  microscopic  limit),  one  gets  an  effective  molecular  polarizability.  Again  taking 
water  as  an  example,  we  obtain  (from  the  density  and  susceptibility  at  298  K)  an 
effective  polarizability  of  1240  bohr^  per  molecule.  Applying  Thole’s  criterion  for 
the  interacting  polarizabilities  of  the  central  water  (10  bohr^)  and  an  effective 
neighboring  bulk  molecule,  the  minimum  distance  should  be  8.0  bohr,  which  is 
slightly  larger  than  that  derived  from  the  first  criterion.  This  second  criterion  is  not 
always  the  strongest.  For  benzene  in  benzene,  e.g.,  the  first  criterion  (no  overlap) 
would  put  the  boundary  at  12.4  bohr  from  the  center  of  mass,  the  interacting 
polarizabilities  criterion  at  only  7.2  bohr. 

The  last  possible  test  regards  the  linearity  of  the  response  per  se.  If  the  solute’s 
field  strength  is  too  large,  nonlinear  effects  will  give  rise  to  an  enhanced  or  reduced 
polarization  of  the  bulk,  depending  on  the  type  of  effect  [11,  Chap.  7].  For  polar 
solvents,  the  main  nonlinear  effect  is  saturation:  The  molecular  dipoles  become 
more  or  less  fixed  in  space  with  respect  to  the  solute,  strongly  reducing  the  orien¬ 
tational  contribution  to  the  local  polarizability. 

At  the  macroscopic  level  this  can  be  remedied  quite  satisfactorily  by  introducing 
a  correction  function  to  the  dielectric  constant  proportional  to  the  square  of  the 
field  strength,  but  at  microscopic  level  the  dielectric  model  breaks  down  and  at 
least  the  first  solvent  layer  should  be  included  in  a  more  detailed  way  in  the 
description.  From  experiment  it  is  known  that  nonlinear  effects  show  up  at  field 
strengths  of  about  2  10~*  a.u.  ( 10'*  V/cm),  which  is,  for  example,  generated  by 
a  unit  dipole  (2.54  D)  at  100  bohr  in  vacuo!  Considering  again  water  in  water, 
with  a  solute  dipole  moment  equal  to  1.85  D,  the  previous  criteria  look  pallid.  If 
the  boundary  would  be  put  at  the  required  distance  (73  bohr),  many  layers  of 
solvent  water  would  have  to  be  included  to  make  the  model  still  look  like  a 
condensed  phase. 

In  practice,  one  can  probably  do  with  a  smaller  number  of  solvent  layers,  as  the 
collective  field  of  solute  and  solvent  layers  will  fall  off  faster  than  the  field  of  the 
solute  alone.  Insight  in  the  minimum  number  of  solvent  layers  to  be  included  can 
be  gained  from  the  radial  distribution  functions  of  the  solvent.  A  typical  radial 
distribution  for  bulk  water  after  Narten  et  al.  [  25  ]  is  schematically  shown  in  Figure 
1 .  It  is  clear  that  at  least  the  first  two  layers  should  be  considered  in  detail,  since 
the  requirement  of  homogeneity  is  not  fulfilled.  The  problem  of  smoothing  the 
transition  from  explicit  to  continuum  description  of  the  bulk  molecules  has  recently 
been  addressed  by  one  of  us  [26]. 

Computational  Methods 

Our  model  of  the  condensed  phase  [5]  is  illustrated  in  Figure  2.  The  part  of 
interest  is  treated  quantum  mechanically,  and  is  called  the  “quantum  system.”  This 
may  be  the  active  site  of  an  enzyme  or  a  solute  molecule,  or  a  small  cluster  of 
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Figure  I.  O — O  radial  distribution  function  of  bulk  water. 


molecules.  A  number  of  subsystems  may  be  defined,  which  are  treated  in  a  discrete 
classical  way.  The  vacuum  static  potential  of  the  subsystems  is  reproduced  by  an 
expanded  monopole  representation  (usually  charges  at  the  nuclei  of  the  subsystems 
are  sufficient  to  obtain  accurate  fields),  whereas  the  response  potential  is  mediated 
through  (anisotropic)  dipole  polarizabilities  (usually  one  per  subsystem). 

The  “quantum  +  discrete  classical”  system  is  enveloped  by  a  surface,  defining 
the  boundary  between  discrete  and  continuum  systems.  The  system  outside  the 


Figure  2.  General  condensed  phase  ruodei. 
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boundary  is  modeled  by  a  single  response  parameter,  viz.,  the  dielectric  constant. 
The  response  potential  in  the  region  of  the  quantum  system  is  found  by  solving 
Poisson  s  equation  for  the  system  of  interacting  charges  and  polarizabilities  inside 
the  boundary.  The  method  allows  for  an  additional  response  parameter  for  the 
continuum,  the  Debye  screening  length  ,  in  order  to  model  an  ionic  bulk  solution. 

The  free  energy  change  of  solvation  contains  three  contributions  associated,  re¬ 
spectively,  with  the  solute’s  loss  of  tianslational  freedom  [27],  the  creation  of  a 
cavity  in  the  solvent,  and  the  solute/solvent  interactions: 

ACfsolv  =  A(7tr  +  AGcav  +  AG,ni  .  (9) 

We  take 

AG,r  =  A/l„  +  A(pF)«-/?rJln-p+  ij  (10) 

where  the  volumes  V/  and  Vg  are  calculated  from  the  liquid  and  gas  phase  densities, 
respectively,  under  standard  conditions,  and  the  gas  is  assumed  to  obey  the  perfect 
gas  law. 

For  the  cavitation  free  energy  the  semiempirical  method  based  on  a  hard  sphere 
model  due  to  Pierotti  [28]  is  used.  The  expression  for  the  cavitation  free  energy  is 
rooted  in  statistical  mechanics  and  needs  only  solute  and  solvent  diameter  and 
solvent  number  density  as  parameters.  The  molecular  radii  may  be  calculated  from 
density,  reducing  the  number  of  parameters  by  one,  or  fitted  to  experimentally 
determined  cavitation  free  energies: 

AGcav  =  ^0  +  ^I<ri2  +  f 

3 

Ko  =  Jir{-ln(l  -y)  +  l[y/(l  -y)f} 

RT 

Ki  = - {6[;'/(l  -y)]  +  18[y/(l  -y)]^}  +  i^Pa] , 

<^\ 

RT 

=  — {12[y/(l  -y)]+  18[y/(l  -y)]^}  -27rP<r,, 

Ki  =  \-KP,  y  =  ,  (r|2  =  5(t7l  +  (72)  .:  (11) 

oM 

In  ( 1 1 )  (Ti  and  02  aie  the  solvent  and  solute  diameter,  respectively,  P  is  the 
pressure  and  p,  and  M  the  solvent  density  and  molecular  weight,  respectively.  On 
the  beneficiary  side  there  is  the  energy  gained  from  interaction  of  the  solute  with 
the  solvent  molecules,  AGim. 

In  our  implementation  (currently  within  H0ND08  [29]),  the  static  and  response 
potentials  may  be  added  to  the  Hamiltonian  of  the  quantum  system,  giving  access 
to  the  electronic  properties  of  the  quantum  system  in  the  presence  of  the  classical 
discrete  and/or  continuum  environment,  either  in  a  perturbation  treatment,  or 
variationally,  within  the  self-consistent  field  approximation.  For  the  short  range 
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repulsion  energies  between  the  discrete  classical  parts  and  the  quantum  system  we 
used  a  model  adapted  from  the  CHARMM  [30]  programs.  Further  computational 
details  may  be  found  elsewhere  [5,31]. 


Results:  Solvation  (Free)  Energies 


fVater  in  Water 

In  Figure  3  we  have  collected  AG,nt  values  for  water  in  a  dielectric  continuum 
with  c  =  78.5,  for  different  sizes  of  the  cavity. 

Clearly  the  sensitivity  of  the  stabilization  er  ergy  to  the  position  of  the  boundary 
emerges.  At  the  distance  we  feel  the  requirements  of  nonoverlapping  systems  and 
prevention  of  the  polarization  catastrophe  are  met  (7.3  bohr),  the  stabilization 
energy  ( SB 6  kJ  mol"'),  which  reasonably  reflects  the  bulk  contribution,  is  far  too 
small  to  be  considered  as  the  electrostatic  part  of  the  solvation  free  energy. 

As  a  next  step,  the  first  solvation  layer  is  taken  into  account  explicitly  by  our 
classical  model.  The  four  water  molecules  surrounding  the  central  water  are  each 
modeled  by  point  charges  at  the  nuclei  (so  as  to  reproduce  the  vacuum  water  dipole 
moment  [21])  and  a  group  polarizability  (reproducing  the  experimental  water  po¬ 
larizability  [19])  at  the  center  of  charge. 

The  geometrical  arrangement  is  such  that  each  central  water-classical  water  is 
at  the  water  dimer  optimum  distance  and  orientation.  The  various  contributions 
to  the  interaction  energy  are  presented  in  Table  I. 


Figure  3.  Continuum  reaction  field  stabilization  energy  as  a  function  of  water-boundary 
distance  in  the  range  of  van  tier  Waals  ( 3.65  bohr)  to  twice  the  van  der  Waals  radius  ( 7.3 
bohr)  of  water.  The  boundary  was  Connolly’s  van  der  Waals  surface  with  probe  radiu' 
2.64  bohr  and  the  dielectric  constant  was  78.5. 
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Table  I.  Partitioning  of  interaction  energy 
between  a  quantum  mechanically  treated  water* 
and  four  classical  water  molecules.*’ 


Type  of  interaction 

A£,„,  (kJ  mol  ') 

Electrostatic 

-65 

Induction 

-6 

Dispersion* 

-21 

-92 

AGtr  +  A(7cav 

+29 

Total 

-63 

*  DZP  basis  set,  optimized  vacuum  geometry. 

*’  The  water  molecules  are  represented  by  point 
charges  at  the  nuclei  (-0.796  for  O,  to  give  the 
vacuum  dipole  moment)  and  an  anisotropic  group 
polarisability  constructed  from  atomic  polansabil- 
ities. 

'  This  is  an  estimate,  see  Refs.  [20,21,31,32]. 


The  numbers  in  Table  I  are  enthalpies  rather  than  free  energies,  and  therefore  a 
modest  Monte  Carlo  (MC)  sampling  [14]  was  performed  of  the  part  of  phase  space 
connected  with  rotations  of  the  classical  waters.  Translations  were  excluded  at  this 
point  because  we  have  not  yet  developed  a  satisfactory  model  for  the  short-range 
repulsions  consistent  with  the  electrostatic  part  of  our  model.  In  two  further  cal¬ 
culations,  the  bulk  beyond  the  first  solvation  shell  was  added  in  the  dielectric  con¬ 
tinuum  description.  Again,  calculations  were  done  on  the  reference  configuration, 
as  well  as  the  Monte  Carlo  sampling  of  the  rotational  freedom.  The  results  of  all 
these  calculations  are  presented  in  Table  II. 

The  final  result  for  ACzjoiy  in  Table  II  (-48  kJ  mol“')  is  already  close  to  the 
experimental  value  (-42  kJ  moP'),  and  adding  the  translation  freedom  would 
most  likely  bring  the  two  values  closer  together. 


Benzene  in  Benzene,  Hexane,  and  Methanol:  Transfer  Free  Energies 

A  test  of  the  generality  of  the  continuum  model  is  the  transfer  free  energy  of 
benzene,  which  has  been  determined  experimentally  [33].  The  transfer  free  energy 
is  defined  as  the  free  “'.ergy  of  transferring  a  solute  molecule  from  one  solvent  to 
another.  The  solvation  free  energy  of  benzene  in  benzene,  hexane,  and  methanol 
is  calculated  by  the  same  procedure  as  used  for  water  in  water  by  the  continuum 
model  alone.  The  results  are  presented  in  Table  III. 

If  the  boundary  surface  is  Connolly’s  van  der  Waals  surface  of  benzene,  the 
dispersion  interaction  (which  is  the  dominant  contribution)  is  calculated  to  be 
340-390  kJ  mol"'. 

This  is  far  too  large,  probably  due  to  “charge  leakage,”  and  overshooting  the 
solvation  energy  by  a  factor  of  10  for  benzene.  These  problems  are  not  encountered 
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Table  ll.  Total  interaction  energies  between  a  quantum 
mechanically  treated  water  and  four  classical  water 
molecules  for  reference  geometry  and  Monte  Carlo 
sampling  of  rotational  configuration  space  with  and 
without  a  dielectric.* 


Solvent  model 

A£,„,  (kJ  mol"')” 

Four  classical  waters 

Reference  geometry 

-92 

Monte  Carlo  of  rotations 

-71  ±9 

Ref.  geometry  +  dielectric 

-103 

Monte  Carlo  of  rotations 

-77  ±9 

AGk>i,  =  A£,„,  +  29  kJ  mol"' 

•  The  boundary  is  approximately  the  solvent  accessible 
surface  of  the  QM  water  +  four  classical  water  cluster;  the 
dielectric  constant  is  78.5. 

’’  The  dispersion  term  is  assumed  to  be  constant  for  all 
configurations  and  is  not  explicitly  calculated  in  the  Monte 
Carlo  run  for  each  configuration. 


with  the  boundary  surface  suggested  by  us  in  the  third  section,  following  the  no¬ 
overlap  criterion,  taking  both  ".olute  and  solvent  microscopic  structure  into  account. 
On  the  other  hand,  with  the  surface  advocated  by  us,  the  interaction  energy  is  too 
small  to  reproduce  the  experimental  solvation  free  energy,  as  well  as  the  transfer 
free  energy,  even  in  a  qualitative  sense. 


Results:  Electronic  Properties 

Apart  from  energy,  (electronic)  properties  of  the  solutes  are  of  interest.  Indeed, 
the  whole  point  of  maintaining  a  quantum  mechanical  description  of  some  part  of 
the  condensed  phase  is  to  have  access  to  electronic  properties. 


Table  111.  Solvation  and  transfer  free  energies  of  benzene. 


Solvent* 

Solvation  energy 
(kJ  mol"') 

Transfer  energy 
(kJ  mol"') 

Calcd  ExptI 

Calcd 

ExptI 

Benzene 

+27  -31 

+  17 

-5.0 

Hexane 

+30 

+20 

-3.8 

Methanol 

+  10 

0 

0 

*  The  boundary  is  approximately  the  solvent  accessible  surface  of  benzene; 
the  static  and  optical  dielectric  constants  for  benzene,  hexane,  and  methanol 
are  2..t  and  2.253,  1.9  and  1  89,  32.6  and  1.766,  respectively. 
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The  electronic  spectrum  of  water  is  not  particularly  exciting,  but  some  aspects 
will  be  considered  briefly  to  point  out  the  differences  between  the  continuum  model 
and  the  explicit  classical  model  for  the  first  solvation  layer.  Table  IV  contains  the 
shifts  of  the  homo  orbital  energy  (Koopmans’  ionization  energy),  the  dipole  mo¬ 
ment  and  the  polarizability  of  the  quantum  mechanically  treated  water  molecule 
going  from  vacuum  to  solution,  for  the  different  models.  Most  strikingly,  the  HOMO 
is  destabilized  if  the  explicit  water  model  is  used,  and  stabilized  by  about  the  same 
amount  in  the  continuum  model. 

Closer  inspection  of  the  polarizability  shows  that,  although  the  overall  polariz¬ 
ability  is  not  very  sensitive  to  the  solvent  model,  the  individual  contributions  are, 
as  may  be  seen  from  the  oscillator  strengths,  which  for  brevity  here  are  omitted. 
Placing  the  dielectric  at  the  distance  advocated  by  us  leaves  the  properties  of  the 
water  molecule  almost  unchanged. 

The  electronic  spectrum  of  water  is  not  spectacularly  solvent-dependent  and 
therefore  not  a  good  test  for  the  applicability  of  the  continuum  approach  for  elec¬ 
tronic  properties.  More  marked  solvent  effects  are  seen  for  conjugated  systems, 
whose  transitions  often  serve  as  solvent  polarity  scales  [35].  Unfortunately,  these 
systems  are  mostly  too  large  for  easy  quantum  chemical  computations.  A  less  suc¬ 
cessful  solvent  polarity  scale  was  derived  from  the  n  tt*  transition  of  acetone 
[  36  ] ,  which  is  a  tractable  molecule  for  quantum  chemical  computations.  In  he  ne 
the  experimental  transition  energy  is  35.850  cm"' .  The  solvent  effect  on  this  tran¬ 
sition  is  some  2000  cm"'  on  going  from  hexane  to  water. 

The  same  solvent  models  as  for  water  in  water  were  tested,  only  for  the  atomic 
radii  the  van  der  Waals  +  20%  radii  were  chosen,  and  a  larger  number  of  explicit 
water  molecules  were  taken  into  account  ( 17  instead  of  four). 

In  order  to  reflect  the  short  lifetime  of  the  excited  states,  they  were  calculated  in 
the  static  reaction  field  of  the  ground  state,  which  is  not  in  equilibrium  with  the 
excited  state  charge  density.  The  reason  for  this  is  the  slow  relaxation  associated 
with  the  orientational  component  of  the  dielectric  response  [  37,38  ] ,  so  that  the 
excited  state  “sees”  the  polarization  of  the  environment  in  equilibrium  with  the 
ground  state.  The  electronic  part  of  the  dielectric  response  is  much  faster  and  is  in 
equilibrium  with  the  excited  state  density.  This  last  part  is  treated  here  as  a  per¬ 
turbation  (for  both  ground  and  excited  states),  which  usually  makes  little  difference 


Table  IV  Dependence  of  water  properties  on  solvent  model.* 


Solvent  model 

if  (kJ  mol  ')*’ 

Ap  (a.u.) 

Aa  (a.u.’)' 

Dielectric  at  van  der  Waals  distance 

-23 

+0.19 

-0.7 

Four  classical  waters,  reference  geometry 

+19 

Four  class,  waters,  ref.  geom  +  dielectric 

+18 

Dielectric  at  vdW  +  water  distance 

-2 

■v 

•  The  shifts  w.r.t.  the  vacuum  properties  are  shown. 

’’  HOMO  orbital  energy  shift;  this  is  Koopmans’  ionization  energy  shift. 

'  Calculated  by  second-order  perturbation  theory,  with  nonempincal  Unsold  approximation  [34] 
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to  the  interaction  energy.  Work  is  in  progress  to  treat  the  different  frequency  com¬ 
ponents  of  the  dielectric  response  in  a  more  sophisticated  way.  The  results  for  the 
calculated  transition  energies,  as  well  as  the  nuclear  and  electronic  contributions 
to  the  interaction  with  the  nonequilibrium  reaction  field  are  presented  in  Table  V. 
The  difference  between  continuum  approach  and  explicit  treatment  of  solvent  water 
molecules  in  the  first  (and  part  of  the  second)  shell  is  for  acetone  decidedly  more 
marked  than  for  water  in  water.  Although  the  calculated  excitation  energy  shifts 
are  almost  equal  in  both  approaches,  the  causes  are  entirely  different.  With  the 
explicit  water  treatment,  the  nuclei  are  stabilized  more  in  the  static  field  than  are 
the  electrons,  whereas  with  the  dielectric  response  field,  the  reverse  is  found. 

Discussion  and  Conclusions 

Dielectric  continuum  models  have  been  successful  in  calculating  the  interaction 
energy  between  solute  and  solvent  based  on  one  solute  molecule  in  a  dielectric.  It 
has  been  shown  that  the  results  of  the  continuum  models  compare  well  with  results 
obtained  by  models  that  include  a  more  explicit  description  of  the  solvent,  in  MC, 
MD,  and  free  energy  perturbation  (FEP)  calculations,  for  water  as  solvent.  Although 
the  results  may  be  comparable  when  interaction  energy  is  concerned,  the  origin  of 
the  interaction  energy  is  totally  different  in  the  continuum  and  more  explicit  models. 
Modeling  other  solvents  than  water  by  a  dielectric  is  a  rather  unexplored  area  of 
research  [39,40]. 

In  the  dielectric  continuum  approach,  the  interaction  is  solely  due  to  a  linear 
response  of  the  collective  solvent  molecules,  whereas  the  explicit  models  show  clear 
saturation  effects  of  the  first  solvent  layers,  a  fact  that  is  confirmed  experimentally. 
The  molecules  in  the  first  solvation  shell  of  wa‘er  in  water  are  strongly  hydrogen 
bonded  to  the  central  water  and  there  is  a  strong  dipole-dipole  interaction,  keeping 
the  orientation  of  the  molecules  rather  fixed.  This  effect  is  certainly  not  described 
by  the  dielectric  constant,  for  this  behavior  is  strongly  nonlinear. 

The  favorable  interactions  of  water  in  water  thus  find  their  root  in  very  speeific 
dipole-dipole  and  hydrogen  bonding  interactions,  with  electronic  induction  playing 
a  minor  role,  as  can  be  seen  from  Table  I.  The  success  of  the  continuum  model 
must  therefore  be  explained  by  a  fortuitous  choice  of  boundary  regarding  experi¬ 
mental  hydration  energies  only. 


Table  V.  n  v*  transition  energies  of  acetone  m  ractio  and  in  water*  and  interaction  energies  of 
nuclei  and  electrons  with  nonequilibrium  reaction  field  in  the  excited  state. 


Type  and  solvent  model 

u  (n  -»  IT*)  (cm  ') 

A£  (nuc)  (hartree) 

AE  (el)  (hartree) 

Vacuum 

25.321 

Dielectric  at  van  der  Waals  + 

20%  distance 

28.030 

+0.198 

-0.214 

1 7  classical  waters,  reference 

geometry 

28.066 

-0.183 

+0.157 

*  The  standard  sto4-3  IG  basis  was  used  throughout. 
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The  results  for  the  solvation  and  transfer  energies  of  benzene  in  various  solvents 
cleany  show  that  the  dielectric  model  is  not  easily  extendable  to  other  solvents. 
Retaining  the  van  der  Waals  surface  gives  problems  in  the  dispersion  term. 

In  an  attempt  to  extend  the  dielectric  continuum  approach  to  any  solvent,  one 
could  envisage  the  following  procedure  for  each  solvent;  Vary  the  solute-boundary 
distance  to  reproduce  the  experimental  solvation  energy,  and  derive  some  atomic 
radii  for  the  solute  atoms  in  that  solvent.  To  calculate  the  solvation  energy  of  other 
solutes  in  that  solvent,  use  those  solvent  dependent  atomic  radii  as  parameters. 
This  procedure  has  been  very  successful  for  water  [2,4,6],  and  may  be  for  other 
solvents.  Such  a  procedure  must,  however,  be  considered  as  engineering,  and  one 
must  be  very  reluctant  to  attach  any  physical  meaning  to  the  radii  and  to  calculated 
properties  other  than  the  energies. 

Considering  the  electronic  properties  of  water  as  they  change  from  vacuum  to 
solvent,  as  shown  in  Table  IV,  most  striking  are  the  different  HOMO  orbital  energy 
shifts  between  the  explicit  water  model  and  the  continuum  models.  Closer  inspection 
of  the  polarizability  shows  that,  although  the  overall  polarizability  is  not  very  sen¬ 
sitive  to  the  solvent  model,  the  individual  contributions  are.  as  could  be  seen  from 
the  oscillator  strengths.  Placing  the  dielectric  at  the  distance  advocated  by  us,  leaves 
the  properties  of  the  water  molecule  almost  unchanged. 

The  case  of  the  n  -*■  ir*  transition  of  acetone  (Table  V)  shows  that,  although 
overall  energies  may  correspond,  the  partitioning  of  different  contributions  may 
drastically  differ,  giving  rise  to  a  physically  different  interpretation  of  the  causes 
and  effects.  Here,  again,  seme  saturation  in  the  first  shell  sets  up  a  static  field  that 
influences  the  electronic  properties  of  the  ground  and  n  -*■  tt*  state.  This  effect  is 
clearly  reversed  by  a  dielectric  response  only  description. 

Although  the  dielectric  continuum  model  for  water  as  a  solvent  is  satisfactory 
for  hydration  free  energy  calculations,  it  does  not  accurately  describe  the  bulk  solvent 
influence  on  electronic  properties  of  the  solute.  For  this,  a  more  microscopically 
detailed  description  is  required,  to  account  for  nonlinear  effects  in  the  first  solvation 
shell  (s). 

Also,  the  same  procedure  for  modeling  water  is  not  applicable  when  modeling 
other  solvents,  necessitating  a  new  definition  of  the  solute-solvent  boundary  distance 
for  each  solvent.  Rather  than  such  an  ad  hoc  treatment,  we  advocate  the  more 
detailed  description  of  solvent  molecules  in  the  first  solvation  shells.  We  could 
short-ci'-cuit  all  of  our  arguments  for  this  approach  by  looking  at  Figure  1  again, 
and  assume  indeed  that  the  dielectric  constant  can  only  represent  a  homogeneous 
and  structureless  continuum.  This  will  put  directly  the  boundary  where  all  require¬ 
ments  are  fulfilled  at  once,  albeit  that  the  region  where  the  solvent  shows  structure 
must  be  taken  into  account  explicitly. 
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Abstract 

Many-body  perturbation  theory  is  applied  to  the  nitrogen  tnple  bond  for  bond  distances  ranging  from 
the  atomic  regime  to  about  0  6ao  shorter  than  equilibrium.  A  full-optimized  reaction  space  model  is 
used  to  compute  orbital  spaces  with  an  even-tempered  gaussian-type  basis  set  and  also  with  a  nominal 
Bagus-Gilbert  Slater-type  basis  set.  Conservation  of  orbital  angular  momentum  in  the  atomic  regime 
leads  to  perturbative  theory  for  Haitree-Fock  plus  proper  dissociation.  Angular  momentum  conservation 
can  also  be  enforced  with  a  scaled  Slater-Condon  parameter.  Third-order  dissociation  energies  and 
spectroscopic  constants  approach  limits  of  the  chosen  basis  sets.  ©  1993  John  Wiley  &  Sons,  Inc. 

Introduction 

The  methods  of  quantum  field  theory  differ  from  those  of  first-quantized  theory. 
Quasiparticles  and  a  quasiparticle  vacuum  describe  the  single-particle  characteristics 
of  a  many-particle  system.  Many-particle  conservation  theorems  such  as  particle 
number  are  side  conditions  placed  on  observables  and  enter  as  Lagrange  multipliers 
such  as  chemical  potential,  Many-particle  interactions  are  defined  by  Feynman 
diagrams  (FDs).  fds  describe  scattering  events  among  the  quasiparticles.  Correlation 
is  defined  by  fds  that  furnish  an  exact  order-by-order  series  expansion  for  many- 
body  perturbation  theory  (MBPT).  Particles  and  holes  and  the  Hartree-Fock  (hf) 
vacuum  describe  normal  fermion  problems.  They  are  defined  by  a  single-particle 
Schrodinger  equation  with  a  potential  set  up  according  to  Poisson’s  equation  (Har- 
tree  potential )  and  correct  quantum  statistics  (exchange  interaction ) .  Binding  among 
particles  and  holes  is  described  by  paired  quasiparticles  and  the  Bardeen-Cooper- 
Schrieffer  ( BCS)  vacuum.  For  historical  reasons,  paired  quasiparticles  are  associated 
with  superfluid  and  superconducting  fermion  problems;  however,  paired  quasipar- 
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tides  are  also  needed  to  represent  chemical  bonds  in  molecules.  Paired  quasiparticles 
and  side  conditions  for  chemical  problems  may  be  based  on  chemical  intuition 
and  adiabatic  symmetry  correlation.  The  1 -particle  Schrodinger  equation  for  paired 
quasiparticles  includes  a  generalized  hf  potential  which  allows  fractionally  occupied 
1 -particle  levels,  2-particle  binding  interactions  between  particles  and  holes  (pairing 
interactions),  and  terms  from  the  side  conditions  (for  example,  chemical  potential ). 
The  potential  treats  particles  and  holes  associated  with  chemical  bonds  as  paired 
quasiparticles  and  all  others  as  normal  quasiparticles.  Quasiparticles,  quasiparticle 
energies,  and  Lagrange  multipliers  for  side  conditions  are  determined  self-consis- 
tently.  Many-body  corrections  are  FD  for  self-consistent  quasiparticles.  They  are 
computed  with  low-order  MBPT. 

Second-quantized  methods  for  particles  and  holes  can  be  readily  transposed  to 
«-particle  Hilbert  space  because  the  hf  vacuum  is  a  determinant  that  lies  within 
one  such  space.  However,  because  the  Bcs  vacuum  has  components  in  more  than 
one  such  space  (see  the  next  section),  FD  for  paired  quasiparticles  must  be  for¬ 
mulated  in  Fock  space.  As  is  typical  of  the  many-body  problem,  there  is  a  simple 
formulation  based  on  field-theoretic  methods. 

Field-theoretic  methods  for  paired  quasiparticles  were  applied  to  chemical  bonds 
in  a  series  of  papers.  The  Heitler-London  (HL)  model  of  bonding  and  side  conditions 
for  particle-number  conservation  and  proper  dissociation  were  used  to  set  up  the 
model  problems.  Spectroscopic  constants  were  reported  in  Paper  I  [1]  for  H2,  LiH, 
FH,  F2,  and  N2.  Formulae  for  FDs  were  derived  in  Paper  11  [2].  Paper  111  [3] 
reported  ground-state  energy  curves  for  the  four  single-bonded  diatomic  molecules 
H2,  LiH,  FH,  and  F2.  Paper  IV  [4]  analyzed  the  perturbative  corrections  which 
were  the  basis  for  the  energy  curves  reported  in  Paper  111. 

Papers  I-IV  were  concerned  mainly  with  particle-number  conservation.  An  ad¬ 
ditional  side  condition  is  required  for  description  of  multiple  chemical  bonds.  The 
foundations  for  the  present  work  have  been  established  in  earlier  work  [5]  in  which 
a  field-theoretic  model  Hamiltonian  had  been  developed  for  the  proper  dissociation 
of  multiple  bonds,  with  N2  used  as  an  illustration.  Additionally,  the  broad  back¬ 
ground  and  specific  theoretical  treatment  can  be  found  in  Paper  II  [2]  and  the 
detailed  description  and  analysis  of  correlation  energy  terms  can  be  found  in  Paper 
IV  [4]. 

The  present  work  reports  calculations  for  N2  where  the  side  condition  is  conser¬ 
vation  of  orbital  angular  momentum  along  the  transition  from  N2(A^'2^)  to  two 
N(^S)  atoms.  Additional  conditions  needed  to  describe  the  multiple  bonding  struc¬ 
ture  of  N2  are  described  in  the  next  section.  The  computational  methods  and  details 
are  presented  in  the  third  section.  The  perturbative  results  for  N2  and  a  discussion 
of  their  accuracy  are  given  in  the  fourth  section. 

Perturbation  Theory  for  the  N2  Ground  State 

The  foundations  for  the  present  work  have  been  established  in  earlier  work  [5] 
and  Paper  II  [2].  Additional  conditions  needed  to  describe  the  multiple  bonding 
structure  of  N2  are  described  in  this  section. 
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Quasipariiclc's 

Fractionally  occupied  fuil-optimizcd  rcaction-spacc  (fors)  orbitals  [6-8]  which 
become  degenerate  at  long  bond  distances  arc  paired  levels.  They  arc  in  l-to-1 
correspondence  with  filled  (hole)  and  empty  (particle)  levels  of  the  normal  ni' 
determinant. 

Paired  levels  for 


Hole 

Particle 

II 

Zu  —  3<rj/ 

X/i  —  1  ITxu 

II 

II 

Pairing  removes  degeneracy  at  long  bond  distances  by  diagonalizing  a  model  Ham- 
iltonian.  A  special  Bogoliubov-Valatin  (BV)  transformation  is  the  transformation 
to  the  diagonal  representation.  The  bv  transformation  coefficients  may  be  deter¬ 
mined  by.  solving  the  HL  problem  in  the  fors.  In  that  case,  and  with  1  standing 
for  th  and  2  standing  for  ia  for  i  -  x,  y,  z,  the  bv  transformation  is  the  same  as 
Eq.  (3)  of  Paper  IV.  Overlapping  orbitals,  normalization  conditions,  the  particle- 
number  conservation  condition,  and  phase  choices  for  the  bv  transformation  in 
the  FORS  are  the  .same  as  Eqs.  (4)-(7),  respectively,  of  Paper  IV. 

Equation  (4)  of  Paper  IV  in  the  fors  is  a  “pairing”  transformation  (9,  Eq. 
(3.42)].  Second-quantized  operators  for  overlapping  orbitals  are  defined  by  the 
same  transformation 

Cf,  =  t^rA  +  A. .  (2) 

and  satisfy  anticommutation  conditions  like  those  for  overlapping  atomic  orbitals 
llO,Eq.(9.3)]: 

CrA  +  Cf,cf,  =  5f,  St  =  2vi-l.  (3) 

Sf  is  the  inner  product  or  overlap  in  the  fors  defined  by  Eq.  (4)  of  Paper  IV  and 
is  a  type  of  “paired"  inner  product  [9,  Eq.  (3.44)],  ik  is  an  annihilation  operator 
for  the  FORS  orbital  <t>k.  At  long  bond  distances,  the  fors  becomes  the  atomic  HF 
space  for  2N(^S),  and  Cf,  and  Cf,  become  annihilation  operators  for  atomic  HF  2pf 
orbitals  on  the  atoms  N/  and  N,^,  respectively.  In  the  molecular  regime,  Cf,  and  Cf, 
are  annihilation  operators  for  paired  vestiges  of  atomic  2p  orbitals. 

Unperlurhed  Ground  Stale 

Equation  (9)  of  Paper  IV  becomes  a  product  of  operators  for  tors  subshclls 
which  correspond  tr  unpaired  holes  20^,  \au,  lou’. 

K  n  n 

p-k.ii  k’-l 


(4) 
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k(^)  stands  foi  spin  2  )•  Equation  (8)  of  Paper  IV  becomes 


r  1  1*/^ 

L“fi  + 


(5) 

(6) 

(7) 


Equations  (10)  and  (13)  of  Paper  IV  become 


\^X)  =  n  MfKI  >, 

t=x.y.: 

(8) 

)  =  0  for  filled  levels , 

(9) 

otk  1  ^jC  )  =  0  for  paired  levels  , 

(10) 

1  ^jC  )  =  0  for  empty  levels  , 

(11) 

while  Eqs.  (17)  and  (18)  become 

|HL>=  n  BfKI  >. 

(12) 

1  HL>  is  the  HL  Ansatz  in  the  fors.  It  is  a  sum  of  eight  closed-shell  determinants. 
|.^X)  is  the  corresponding  BCS  Ansatz  in  the  FORS.  It  is  a  sum  of  64  closed-shell 

determinants. 

Unperturbed  excited  states  are  defined  in  §2.4  of  Paper  11  and  §2.1.2  of  Paper 
IV.  Excited  states  with  manifestly  incorrect  particle-number  are  not  allowed.  Spu¬ 
rious  excited  states  are  eliminated. 


Model  Hamiltonian 

In  the  FORS,  Eq.  (23)  of  Paper  IV  generalizes  to 
H  =  H  - 

=  xr'(N  -  6)  +  |xr’(N  -  6)2 

=  Lr'  +  Lr’  +  x!.'’^’,  (13) 

£|AM1  =  +  Z2) 

=  £2'''^'  -I- 

H  is  the  ordinary  Hamiltonian.  The  L  operator  (L'^'^')  enforces  particle- 
number  (atomic-fragment  angular  momentum)  symmetry  for  the  fractionally  oc¬ 
cupied  shells.  N  is  the  number^perator  [10,  Eq.(8.7)]  for  the  fractionally  occupied 
shells.  At  long  bond  distances,  L/,  is  the  second-quantized  orbital  angular  momentum 
operator  for  the  2p  shell  on  nitrogen  jtom  p^xpressed  in  a  real  cartesian  basis  [10, 
Eqs.  (8.7)  and  (9.19)].  In  general,  L/  and  Z.,  are  vestiges  of  total  orbital  angular 
momentum  in  the  paired  bases  Cf,,  Cf,  [Eq.  (2)]. 
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Parameters 


1.2'’'^'  and  are  generalizations  of  the  operators  discussed  in  §2.2  of 
Paper  IV.  X  parameters  depend  on  the  fors  but  not  on  a  specific  orbital  basis  for 
the  FORS.  Equation  (35)  of  Paper  IV  is  used  to  calculate  Xo*’’^’.  The  gap  equation 
is  used  to  calculate  “chemical  potential”  fll,  Eq.  (62)].  As  functions  of  bond 
distance,  parameters  for  N2  appear  similar  to  those  for  F2  (see  Fig.  4  of  Paper  IV ). 

Formulae  for  normally  ordered  orbital  angular  momentum  operators  of  Eq.  ( 1 3 ) 
have  not  been  reported  previously: 

,  (14) 

Z.r’  =  xr'  2  2  2 

k-b.a  {-x,y,z  (15) 

I  ^  ^  •XT 

’  *  ft  m,  •/!  > 


f  [AM]  _  .lAM] 
1-7  -  A2 


2  2  2  (■  • 

k.k'‘h.a  (16) 


(ij  +  Vr)  =  2  2  2  hhl-hf^  2  2  M^ft%-  +  XftXfi.) 

k^bkO  \  k'»b,a 


-2  2  2  *?;.) 

\ 

k'~bja  fVf  / 

/-(2.0)_  u 

ftm.  “  "ft 

2  2 

k*~b^a 


(17) 

(18) 
(19) 


Normal  ordering  with  respect  to  [12,  §  1 1.4]  is  denoted  by  :  •  •  •  :,  where 
•  •  •  stands  for  a  product  of  second-quantized  operators,  and  Xt^m.  are  occupation 
and  pairing  numbers,  respectively,  for  the  fractionally  occupied  subshells  of  the 
FORS.  They  are  defined  as  in  §2.1.3  of  Paper  IV, 

The  FORS  at  infinity  connects  to  the  hf  model  of  the  atoms.  The  simplest  mo¬ 
lecular  model  that  properly  connects  to  the  atoms  is  used  to  enforce  angular  mo¬ 
mentum  conservation  for  the  atoms.  This  model  is  known  as  hfpd  for  hf  plus 
proper  dissociation  [13].  Enforcing  the  connection  yields  conditions  for  , 

Xo  ^  =  .E^HL  “  f^HFPD  ,  (20) 

and  the  unperturbed  energy, 

<HL|H|HL>  =  <,rX|H|^.£>  =  £HFPD-  (21) 

£hfpd  is  computed  with  the  hfpd  Ansatz  [  1 3,  Table  I  ]  in  the  FORS .  In  the  language 
of  FD,  hfpd  sums  to  every  order  all  diagrams  needed  to  enforce  angular  momentum 
symmetry  for  2N(^S)  relative  to  hl.  All  orders  are  needed  because  symmetries 
cannot  be  enforced  by  finite-order  perturbation  theory  [14,  p.  431]. 

Xo**^’  varies  between  at  long  bond  distances  and  (approximately)  zero,  where 

is  the  Slater-Condon  electron-repulsion  integral  for  the  2p  subshell  of  nitrogen 
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atom.  This  is  expected  because  orbital  angular  momentum  symmetry  is  quenched 
in  the  molecular  regime.  Association  of  with  may  be  treated  as  a  condition 
for  model  building: 

lim  4  •  (22) 

In  the  latter  case,  quenching  of  orbital  momentum  is  approximated  by  mul¬ 
tiplied  times  the  occupation  number  of  the  1  s-o  natural  orbital. 

X{,^«1(F^)  =  2«,,^F^  (23) 

(The  lira  natural  orbital  is  chosen  because  it  maintains  2p  character  without  ap¬ 
preciable  hybridization.) 

F^  can  be  assigned  its  value  at  F  -*■  oo  (F^)  for  all  R  values,  or  can  be  computed 
for  each  R  value  as  an  appropriately  weighted  average  of  integrals  involving  only 
fractionally  occupied  levels  ( F^ ) .  If  Fhl  is  given,  Eq.  ( 20 )  yields  two  corresponding 
approximations  to  Fhfpd  (Ff^  and  Ei\,  respectively).  Perturbative  corrections  can 
be  computed  for  Ef^  and  Epi- 

Perturbative  Corrections 

One-body  terms  of  the  model  Hamiltonian  are  identified  by  the  normally  ordered 
expansion  of  H  in  the  canonical  representation.  This  is  described  in  Paper  II.  The 
unperturbed  Hamiltonian  is  defined  by  Eq.  (39)  of  Paper  IV.  The  diagonalization 
condition  for  the  generalized  Fock  operator  (self-consistent  energy)  placed  on  Eq. 
(39)  of  Paper  IV  is  also  enforced  here.  This  diagonalization  condition  uniquely 
defines  a  canonical  FORS  representation.  It  does  not  mix filled,  fractionally  occupied, 
or  empty  spaces  defined  by  the  fors. 

The  perturbation  is  the  two-body  operator  in  the  canonical  representation.  It  is 
defined  in  Paper  II  and  §3  of  Paper  IV.  Nonbubble  diagrams  are  calculated  with 
the  formulae  of  Paper  II.  Formulae  for  bubble  diagrams  are  reported  elsewhere 
[15,  16]. 

Diagrammatic  overcount  for  hfpd  and  F^  models  is  the  same  as  for  hl.  Over¬ 
count  for  HL  is  described  in  Papers  II  and  IV  of  this  series,  and  is  the  same  for  Nj 
with  the  understanding  that  three  sets  of  paired  levels  must  be  counted,  fds  are 
then  counted  in  this  work  as  in  Papers  II  and  IV. 

Computational  Details 

Basis  Sets,  fors  and  Software 

Two  basis  sets  were  selected  to  test  the  model.  An  even-tempered  gaussian  con¬ 
traction  (ETGC)  ( 10s6p2dlf)/[4s3p2dlf]  [17-19]  and  a  polarized  (4s3pldlf) 
nominal  Bagus-Gilbert  basis  of  Slater-type  atomic  functions  (STF)  [20]  were  taken 
from  the  literature.  The  etgc  s  and  p  sets  were  augmented  with  the  most  diffuse 
primitives.  The  basis  sets  are  capable  of  approximating  the  dissociation  energy  of 
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nitrogen  to  within  about  0.5  eV  [21],  This  error  is  about  six  times  as  large  as  that 
for  the  total  energy  ofHe  in  a  comparable  basis  [22.  Table  l,p.  1 1].  Errors  associated 
with  the  basic  two-body  interaction  scale  upward  to  the  error  in  dissociation  energy 
for  Nj  because  errors  in  the  six  two-body  bonding  interactions  make  no  contributions 
to  N  atoms. 

Integrals,  4-index  transformations,  and  FORS  calculations  were  carried  out  with 
ALIS  [17]  and  ALCHEMY2  [23]  for  the  ETGC  and  STF  basis  sets,  respectively.  In 
each  case,  the  FORS  is  defined  by  completely  filled  shells  \(jg,  log,  Icu,  2<r„,  frac¬ 
tionally  occupied  shells  3<Tg,  3<ru,  l5r„,  Ing,  and  empty  shells  all  others  allowed  by 
the  basis. 

ALIS  was  supplemented  by  BIGGMOLl  [24]  f-orbital  integral  programs,  square- 
canonical  and  symmetry  supermatrix  ordering  programs,  and  symmetry-adapted 
self-consistent-field  and  4-index  transformation  code,  all  provided  by  Dr.  R.  C. 
Raffenetti.  ALCHEMY2  was  supplemented  by  software  for  a  real  transformation 
[25]  and  generalized  valence  bond  diagonalization  [26]  and  4-index  transformation 
[27]  codes.  Except  for  the  f-orbital  integral  code,  the  supplements  were  required 
for  preprocessing. 

Accuracy  of  FORS  results  is  established  by  comparison  with  exact  FORS  results 
[28,  numerical  MCSCF  in  Table  1].  Spectroscopic  constants  are  compared  in  Table 


Table  I.  fobs  and  perturbative  spectroscopic  constants  for  A'j(.V'2^).  hfto  was  computed  with  etgc. 
and  Fg  above  (below)  hfpd  were  computed  with  ETOCfSTF). 


Model 

D. 

(eV) 

R, 

(A) 

(cm"') 

(cm"') 

Br 

(cm'‘) 

(cm'') 

De 

(cm''  X  10") 

FORS 

ETGC 

8.79 

1.103 

2337 

14.1 

1.963 

0.017 

5.5 

STF 

8.69 

1.103 

2341 

14.1 

1.965 

0.017 

5.5 

limit* 

8.911 

1.102 

2353 

14.1 

f 

0 

F^ 

7.26 

1.098 

2337 

17.1 

1.983 

0.019 

5.7 

i 

r 

F% 

7.25 

1.097 

2344 

17.0 

1.985 

0.019 

5.7 

r 

d 

HFPD 

7.24 

1.096 

2363 

16.3 

1.989 

0.018 

5.6 

s 

e 

F^ 

7.19 

1.097 

2342 

17.0 

1  485 

0.019 

5.7 

t 

r 

F% 

7.16 

1.097 

2349 

16.9 

1.987 

0.019 

5.7 

s 

0 

F^ 

12.34 

1.074 

2610 

13.4 

2.072 

0.015 

5.2 

e 

r 

F% 

12.33 

1.074 

2616 

13.4 

2.074 

0.015 

5.2 

c 

d 

HFPD 

12.33 

1.072 

2633 

12.9 

2.078 

0.015 

5.2 

n 

c 

F^ 

12.19 

1.075 

2604 

13.2 

2.067 

0.015 

5.2 

d 

r 

F% 

12.16 

1.075 

2610 

13.2 

2.069 

0.015 

5.2 

t 

0 

F^ 

9.54 

1.083 

2457 

15.8 

2.039 

0.018 

5.6 

h 

r 

Fi 

9.53 

1.082 

2463 

15.7 

2.041 

0.018 

5.6 

i 

d 

HFPD 

9.53 

1.082 

2476 

I5.I 

2.043 

0.017 

5.5 

r 

c 

F^ 

9.33 

1.084 

2443 

16.0 

2.034 

0.018 

5.6 

d 

r 

Fl 

9.31 

1.083 

2450 

16.0 

2.036 

0.018 

5.6 

[281. 
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I.  The  unperturbed  ground  state  is  accurately  represented  in  the  etgc  and  stf 
bases.  The  unperturbed  excited  states  are  probably  equally  accurately  represented. 
Nevertheless,  small  errors  accumulate  in  the  dissociation  energy  because  of  the 
multiple  bond. 

FDs  calculated  through  third  order  in  the  stf  basis  with  closed-shell  single-de- 
ten  linant  code  [29-31]  require  about  one  IBM  3090  CPU  minute  for  each  value 
of  the  bond  distance,  fds  calculated  through  third  order  in  the  STF  (etgc)  basis 
with  general  block-driven  code  require  about  12  (26)  Convex  C220  CPU  s. 

Spectroscopic  Constants 

Dunham’s  method  [32]  was  used  to  calculate  spectroscopic  constants  [33]. 
Polynomial  of  degree  9  were  fitted.  The  grid  has  22  points  and  is  shown  in  Table 

II.  Step  sizes  around  the  minimum  intemuclear  distance  are  0.05flo-  Uncertainties 
in  equilibrium  bond  distance  due  to  fitting  the  calculated  energies  are  of  order 
lO.OOOSoo  [34  ■’''].  Dissociation  energies  (£>,)  were  computed  with  Eq.  (2)  of 
Paper  III. 


Results 


Perturbative  Calculations 

Perturbative  calculations  have  been  performed  corresponding  to  the  three  models 
described  in  the  second  section.  'Vhere  the  model  building  for  uses  eval¬ 
uated  at  the  separated  atom  limit,  the  results  are  designated  F^.  If  the  hfpd  value 
is  used  as  indicated  in  Eq.  (20),  the  results  are  designated  HFPD.  IfF^  is  evaluated 
at  each  R  value,  then  the  results  are  designated  F%.  The  calculations  for  F^  and 
Fk  were  performed  using  ETGC  and  STF  basis  sets.  The  HFPD  has  been  calculated 
using  ETGC.  The  values  can  be  compared  in  Table  11  (total  energies).  Figure  1 
(binding  energies),  and  Table  1  (spectroscopic  constants).  Agreement  is  excellent. 
Curves  for  model  fd  and  perturbative  corrections  for  N2  appear  overall  the  same 
as  those  shown  for  F2  on  Figures  5-7  of  Paper  IV. 

Accuracy  of  the  Perturbative  Results 

Second-order  accuracy  is  not  sufficient  over  the  range  of  spectroscopic  properties, 
as  demonstrated  in  Table  I.  In  Table  III,  spectroscopic  constants  are  reported  for 
models  of  Ni  near  limits  of  comparably  chosen  basis  sets.  By  third  erder,  De,  /?<., 
and  Wf  differ  by  about  0.1  eV,  -0.02  A  and  130  cm~',  respectively,  from  the  afore¬ 
mentioned  limits.  In  larger  basis  sets  such  as  those  of  Almldf  et  al.  [  2 1  ] ,  it  is  probable 
that  third-order  dissociation  energies  will  fall  within  0.1  eV  of  experiment.  (Un¬ 
fortunately,  we  cannot  preprocess  such  extensive  calculations  at  present.)  The  con¬ 
clusion  is  that  third-order  accuracy  is  useful  for  many  purposes.  In  addition,  the 
third-order  hfpd  value  for  £»<.  (in  Table  1)  is  comparable  to  the  Z),  obtained  from 
the  fourth-order  calculation  MP4  (in  Table  III),  where  the  mp4  is  derived  from  a 
single-determinantal  zeroth-order  expansion. 
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Table  II  hfpd  total  energies  and  energy  diflerences  of  f  ^  and  F\  models  relative  to  hfpd  (etgc 
basis).  To  compute  model  energies,  subtract  energy  differences  from  hfpd.  Units  are  hartrces. 


R(aof 

First  order 

Third  order* 

HFPD 

Fl 

HFPD 

F^ 

n 

1.40 

-108.15866 

-0.00098 

-0.00094 

-108.40314 

-0.00027 

-0.00025 

1.50 

-108.49987 

-0.00103 

-0.00103 

-108.73628 

-0.00032 

-0.00032 

1.60 

-108.72982 

-0.00104 

-0.00110 

-108.95915 

-0.00039 

-0.00041 

1.70 

-108.87992 

-0.00098 

-0.001 1 1 

-109.10300 

-0.00045 

-0.00051 

1.75 

-108.93240 

-0.00094 

-0.001 1 1 

-109.15263 

-0.00046 

-0.00055 

1.80 

-108.97288 

-0.00086 

-0.00109 

-109.19041 

-0.00046 

-0.00058 

1.85 

-109.00324 

-0.00076 

-0.00104 

-109.21821 

-0.00044 

-0.00060 

1.90 

-109.02509 

-0.00064 

-0.00098 

-109.23762 

-0.00039 

-0.00060 

1.95 

-109.03981 

-0.00049 

-0.00090 

-109.25000 

-0.00032 

-0.00058 

2.00 

-109.04859 

-0.00031 

-0.00079 

-109.25655 

-0.00021 

-0.00054 

2.05 

-109.05238 

-0  00008 

-0.00064 

-109.25812 

-0.00006 

-0.00045 

2.10 

-109.05216 

0.00012 

-0.00051 

-109.25581 

0.00009 

-0.00038 

2.15 

-109.04859 

0.00039 

-0.00033 

-109.25017 

0.00030 

-0.00025 

2.20 

-109.04237 

0.00070 

-0.0001 1 

-109.24195 

0.00055 

-0.00009 

2.25 

-109.03401 

0.00104 

0.00014 

-109.23160 

0.00085 

0.00011 

2.30 

-109.02401 

0.00143 

0.00043 

-109.21965 

0.00120 

0.00036 

2.35 

-109.01279 

0.00184 

0.00074 

-109.20652 

0.00157 

0.00063 

2.40 

-109.00069 

000230 

0.001 10 

-109.19254 

0.00200 

O.OOO^S 

2.50 

-108.97506 

0.00328 

0.00189 

-109.16325 

0.00295 

0.00170 

2.60 

-108.94897 

0.00439 

0.00281 

-109.13359 

0.00403 

0.00258 

2.70 

-108.92385 

0.00552 

0.00379 

-109.10498 

0.00518 

0.00355 

2.80 

-108.90056 

0.00664 

0.00479 

-109.07824 

0.00631 

0.00455 

2.90 

-108.87967 

0.00766 

0.00573 

-109.05392 

0.00736 

0.00551 

3.00 

-108.86144 

0.00846 

0.00652 

-109.03227 

0.00822 

0.00633 

3.10 

-108.84593 

0.00909 

0.00716 

-109.01343 

0.00888 

0.00699 

3.20 

-108.83300 

0.00945 

0.00758 

-108.99725 

0.00930 

0.00746 

3  30 

-108.82240 

0.00956 

0.00779 

-108.98351 

0.00946 

0.00771 

3.40 

-180.81383 

0.00943 

0.00778 

-108.97195 

0.0093" 

0.00774 

3.50 

-108.80701 

0.00905 

0.00757 

-108.96229 

0.00903 

0.00755 

3.60 

-108.80167 

0.00846 

0.00714 

-108.95425 

0.00845 

0.00713 

3.70 

-108.79754 

0.00768 

0.00653 

-108.94755 

0.00768 

0.00653 

3.80 

-108.79442 

0.00673 

0.00575 

-108.94197 

0.00675 

0.00576 

4.00 

-108.79038 

0.00461 

0.90390 

-108.93342 

0.00461 

0.00390 

4  50 

-108.78702 

-0.00015 

-0.0004-' 

-108.92129 

-0.00015 

-0.00044 

5.00 

-108.78666 

-0.00235 

-0.00246 

-108.91553 

-0.00234 

-0.00245 

5.50 

-108.78666 

-0.00269 

-0.00271 

-108.91240 

-0.00269 

-0.00271 

600 

-108.78665 

-0  00227 

-0.00226 

-108.91053 

-0.00227 

-0.00226 

6.50 

-108.78662 

-0.00168 

-0.00166 

-108.90936 

-0.00167 

-0.00166 

7.00 

-108.78659 

-0.00116 

-0.00115 

-108.90862 

-0.00116 

-0.00115 

7.50 

-108.78656 

-0.00078 

-0.00077 

-108.90812 

-0.00078 

-0.00076 

Second-order  shows  the  same  behavior. 

The  gnd  l.bOoo  s  R  s  S.OOoo  was  used  to  fit  the  energy  to  a  polynomial  for  Dunham  analysis. 
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Figure  1.  Experimental  and  ETGC  third-order  binding-energy  curves  for  N2(A''S^). 


The  enforcement  of  angular  momentum  conservation  for  Nz  in  Paper  I  was 
based  on  Eqs.  (14)  and  (23),  with  Eqs.  (15)  and  (16)  neglected.  At  third  order, 
this  approximation  yields  Z),  about  0.4  eV  above  the  limit  of  the  chosen  basis.  This 
accuracy  is  insufficient  for  most  purposes.  It  is  noteworthy  that  inclusion  of  the 
terms  in  Eqs.  (15)  and  (16)  enable  the  attainment  of  the  limit  in  the  chosen  basis 
set  from  the  perturbative  approach.  This  level  of  accuracy  extends  to  £),  as  well,  as 
seen  in  Table  III. 

R,  and  w,  at  third  order  differ  from  the  achievable  limits  a  bit  more  than  might 
have  been  guessed.  This  is  also  true  of  Fz  and  FH  in  Paper  III.  Perhaps  it  is  true 
whenever  second-order  corrections  are  “large.” 

HL  models  for  Fz  and  FH  are  the  same  as  hfpd  models.  Supplementary  config¬ 
urations  needed  for  a  more  e.\tensive  unperturbed  model  are  of  the  “split-shells” 
type.  “Split  shells”  for  Fz  and  FH  require  solving  non-HL  problems  to  determine 
BV  transformation  coefficients  in  the  FORS  (sec  §3.3-3.4  of  Paper  III  and  references 
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Table  III.  Comparison  of  third-order  spectroscopic  constants  for  N^X''Zf)  with  accurate  theoretical 

models  in  similar  basis  sets. 


Model 

D, 

(eV) 

Rr 

(A) 

OJ, 

(cm*') 

OJA, 

(cm"') 

Basis  set 

Contracted  ci* 

9.46 

l.lOt 

2345 

(1  Is7p2dlf)/[6s5p2dlfl 

mp4'’ 

9.60 

6-31  lG(2df,p) 

MRCI  +  O' 

9.31 

(13s8p6d4f)/[4s3p2dlf]  GC 

9.54 

1.083 

2457 

15.8 

ETGftpresent) 

ci-sdq" 

9.34-9.41' 

1.095 

2416 

12.0 

(6s4p3d2f )  STF 

F^ 

9.33 

1.084 

2443 

16.0 

STF(piesent) 

Expt^ 

9.91 

1.098 

2359 

14.3 

•[36]. 

"137]. 

'[21]. 

■>[38]. 

'  [39];  (5s4p3d)  stf  vci  +  first  and  second  orders. 
'[40]. 


therein),  whereas  angular  momentum  conservation  includes  available  “split  shells” 
in  N2.  Perhaps  this  explains  why  third  order  based  on  the  Bv  transformation  de¬ 
termined  by  the  HL  problem  is  generally  more  accurate  for  N2  than  for  either  F2 
or  FH. 
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Abstract 

Results  of  an  ab  initio  SCF  (4-31G)  study  of  the  potential  energy  surface  of  4-aminobutanol  are 
reported.  Four  different  intramolecular  hydrogen  bonds  are  present  in  the  various  local  minima:  N  •  •  ■ 
H — O.  N — H  •  •  •  O,  C4 — H  •  •  •  O.  N  •  •  •  H — Ci .  These  interactions  are  discussed  and  compared  with 
those  present  in  the  homologues  3-aminopropanol  and  2-aminoethanol.  ©  1993  John  Wiley  &  Sons.  Inc 


Introduction 

w-Amino-«-alkanoles  form  an  interesting  class  of  compounds,  because  the 
chemical  importance  of  its  members  covers  the  whole  range  of  the  traditional 
branches  of  chemistry.  2-Aminoethanol  has  a  biological  function  in  the  most  im¬ 
portant  phosphatides,  which  play  an  important  role  in  cell  membranes  [1-4],  of 
blood  [5,6],  brain  [7-9],  heart  [10],  and  liver  [1 !].  Both  2-aminoethpnol  and  3- 
aminopropanol  may  act  as  inhibitors  in  various  surface  [12,13]  and  enzyme  re¬ 
actions  [  14]  by  forming  complexes  with  one  of  the  hetero  atoms  N  and  O.  Complex 
formation  of  2-aminoethanol  and  3-aminopropanol  with  Cu(Il)  [15-17],  Ni(II) 
[18,19],  aluminum  oxides  [20,21],  Ti02  [22],  and  barium  aryl  oxides  [23]  has 
been  investigated  over  years.  Both  2-aminoethanol  and  3-aminopropanol  were  also 
studied  as  part  of  Schiff  base  complexes  [  1 9, 1 6  ] .  Except  for  studies  of  the  dehydration 
and  deamination  [24-27]  and  of  the  silver  complexes  [28]  of  all  amino  alcohols 
up  to  five  carbon  atoms,  4-aminobutanol  has  appeared  in  the  literature  in  the  last 
years  as  part  of  antitumor  dna  intercalators  [29]. 

We  became  interested  in  w-amino-«-alkanoles  as  part  of  an  ab  initio  investigation 
of  amino  acids  and  related  compounds.  2-Aminoethanol  has  been  the  target  of  a 
number  of  quantum  chemical  calculations  in  the  past  [  30-34  ] .  The  potential  energy 
surfaces  (pes)  of  3-aminopropanol  and  4-aminobutanol  were  characterized  in  our 
group.  The  results  for  3-aminopropanol  [35,36]  and  the  three  4-aminobutanol 
conformers  of  lowest  energy  have  been  reported  already  [34]..  It  is  the  purpose  of 
this  contribution  to  give  an  overview  over  all  local  minima  in  the  PES  of  4-anii- 
nobutanol,  to  discuss  selected  details,  and  to  compare  4-aminobutanol  with  its 
homologues  3-aminopropanol  and  2-aminoethanol. 
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The  4-3 IG  basis  set  [37]  and  the  RHF  formalism  [38]  were  used  in  all  calculations 
of  this  study  for  two  main  reasons.  First,  optimized  geometries  obtained  in  this 
manner  correlate  well  with  experimentally  determined  structures  for  2-aminoethanol 
and  3-aminopropanol.  Second,  the  same  method  and  basis  set  have  been  employed 
by  our  group  in  the  past;  hence  the  results  for  4-aminobutanol  can  be  compared 
with  those  of  past  work  at  one  consistent  level.  Calculations  were  performed  with 
the  program  G AMESS  [  39  ] .  All  local  minima  were  fully  optimized  to  a  rms  gradient 
less  than  1.6  X  10'“*  H/Bohr  or  H/ rad  and  verified  to  have  positive  eigenvalues  of 
the  Hessian  matrix  only. 


Local  Minima 

One  hundred  ten  symmetry  unique  local  minima  were  located  in  the  potential 
energy  surface  of  4-aminobutanol.  In  accordance  with  earlier  work,  a  positive  torsion 
angle  N — C — C — C  was  chosen  as  the  criterion  for  symmetry  uniqueness.  Figure 
1  displays  the  relative  energy  of  all  of  these  minima  as  a  function  of  the  dihedral 
angles  of  the  molecular  backbone.  Figure  2  shows  the  various  orientations  of  the 
amino  group.  These  displays  show  that  with  one  exception  three  standard  positions 
per  internal  rotation  may  be  distinguished.  This  allows  an  extended  version  of  the 
nomenclature  used  by  Radom  et  al.  in  previous  work  on  2-aminoethanol  [30]: 
each  local  minimum  will  be  denoted  by  a  five  letter  symbol  vlVXYz,  in  which  v 
describes  the  orientation  of  the  amino  group  via  the  torsion  angle  between  the  lone 
electron  pair  and  the  carbon  chain,  IV  gives  the  N — C — C — C  torsion  angle,  X 
gives  the  C — C — C — C  torsion  angle,  Y  indicates  the  C — C — C — O  torsion  angle, 
and  r  gives  the  C — C — O — H  torsion  angle.  G  and  g  denote  torsion  angles  around 
+60°,  G'  and  g'  denote  torsion  angles  around  -60°,  and  T  and  t  denote  torsion 
angles  around  180°.  The  C — C — O — H  angle  close  to  0°,  which  occurs  in  one 
conformer,  will  be  denoted  The  global  minimum,  e.g.,  for  which  the  dihedral 
angles  are  H,— N— C— C  =  59.2°,  H2— N— C— C  =  -171.4°,  N— C— C— C  = 
76.8°,  C— C— C— C  =  -71.5°,  C— C— C— O  =  73.3°,  and  H— O— C— C  = 
-61.6°,  is  thus  labeled  g'GG'Gg'. 

Table  I  lists  the  labels  of  all  symmetry  unique  local  minima  together  with  relative 
energies  (En-i  =  0  corresponds  to  -286.755020  a.u.),  vibrational  zero  point  energies, 
and  rotation  constants.  The  detailed  geometry  data  (bond  lengths,  valence,  and 
torsion  angles)  and  the  Cartesian  coordinates  are  available  from  the  authors  or  by 
electronic  mail  (F5351DAA@AW1UNI1 1.BITNET). 

Four  different  intramolecular  hydrogen  bonds,  N  •  •  •  H — O,  N — H  •  •  •  O, 
N  *  •  •  H — C| ,  and  C4 — H  •  •  •  O,  occur  in  a  number  of  4-aminobutanol  conformers. 
The  first  of  these,  N  •  •  •  H — O,  is  much  stronger  than  the  others.  It  is  present  in 
the  conformers  g'GG’Gg\  g'GG'G'g,  and  g'GGG's  (with  N  -  •  ’H  distances  of 
1.902, 2.057,  and  1.990  \,  respectively).  Some  aspects  of  this  interaction,  namely, 
its  influence  on  bond  lengths  and  vibration  frequencies,  have  been  discussed  pre¬ 
viously  [  34] .  The  N  •  •  •  H — O  hydrogen  bond  also  has  significant  influence  on  the 
energetics:  The  three  conformers,  which  contain  this  bond,  are  much  lower  in 
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Figure  1.  The  relative  energy  (kJ/moI)  of  all  symmetry  unique  local  minima  in  the  PES 
of  4-aminobutancl  as  a  function  of  the  dihedral  angles  of  the  molecular  backbone. 


energy  than  all  other  conformers  (cf.  Fig.  1).  This  stabilization  has  considerable 
effect  on  the  topology  of  the  pes  of  4-aminobutanol,  because  it  fixes  the  amino 
group  in  one  position:  No  local  minima  are  present,  which  differ  from  g'GG'Gg', 
g'GG'G'g,  and  g'GGG's  in  the  orientation  of  the  amino  group  only.  Furthermore, 
no  local  minima  exist,  which  differ  from  the  global  minimum  g'GG'Gg'  in  the 
orientation  of  the  hydroxy  group.  In  the  case  of  g'GGG's  only  one  conformer 
ig'GGG't)  with  a  different  orientation  of  the  hydroxy  group  is  formed. 

A  comparison  of  the  number  of  local  minima,  which  are  actually  present  in  the 
4-aminobutanol  pes,  with  the  number  of  conformations  that  results  from  the  struc¬ 
tural  principle  of  three  orientations  per  internal  rotation  (3^  =  243),  repeals  the 
dominant  nature  of  the  N  •  •  •  H — O  hydrogen  bond.  One  out  of  the  1 10  symmetry 
unique  local  minima  is  of  Cv  symmetry;  hence  a  total  of  2 1 9  local  minima  is  present 
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H2-N-C-C 


Figure  2.  The  plot  of  the  two  dihedral  angles  H — N — C — C  against  each  other  for  all 
symmetry  unique  local  minima  in  the  pes  of  4-aminobutanol  shows  that  three  major 
orientations  of  the  amino  group  occur. 


in  the  4-aminobutanol  pes.  Hence  a  total  of  24  conformers,  i.e.,  12  symmetry 
unique  conformers,  are  “missing.”  The  stability  of  g'GG'Gg',  g'GG'G'g,  and 
g'GGG's  explains  the  absence  of  nine  of  these  12  missing  symmetry  unique  con¬ 
formers.  The  remaining  three  conformers  are  gGG'Tg,  gGG'Tg',  and  gGG'Tt,  which 
are  not  formed  due  to  a  steric  hindering  of  the  type 


'H/Cl 


The  intramolecular  hydrogen  bond  N — H-  •  -  O  is  present  in  the  conformers 
gGG’Gt{H^  •  -Odistancc:  2.055  A),^G'Gg(2.061  A),  tGG'Gg {2.090 k),tGG'Gt 
(2.1 14  A),  gGG'G'g'  (2.178  A),  ^G'G't  (2.263  A),  gGGG'l  (2.355  A),  iGGG't 
(2.395  A),  and  gGGG'g'  (2.469  A).  The  N-  -  -H — C|  hydrogen  bond  is  present 
in  g'GG'G'g'  (N  •  •  •  H  distance  2.462  A),  g'LG'G't  (2.502  A),  g'GG'Tg'  (2.574 
A),  g'GG'Tg  (2.591  A),  and  in  g'GG'Tt  (2.616  A).  The  C4— H-  •  -O  hydrogen 
bond  is  present  in  g'GGG't{}\-  •  •  O  distance:  2.398  A),  tTGG'l  (2.409  A),  g'TGG't 
(2.426  A),  (2.436  A),  g'TGG'g' {2.4A1  k),gTGG't  (2.447  A),  and  gTGG'g' 

(2.470  A).  None  of  the  intramolecular  interactions  N — H-  •  -O,  N*  •  -H — C|, 
and  C4 — H  •  •  •  O  has  an  influence  on  energetics,  bond  lengths,  or  vibration  fre¬ 
quencies.  Analyses  of  the  electron  densities  along  these  interactions,  however,  prove 
that  each  of  them  is  associated  with  a  critical  point  of  the  electron  density  with  two 
negative  eigenvalues.  Such  critical  points  represent  chemical  bonds  [40];  the  clas¬ 
sification  of  these  interactions  as  hydrogen  bonds  is  therefore  correct. 

In  addition  to  these  intramolecular  hydrogen  bonds  two  conformers  contain 
interactions,  which  lead  to  five-membered  ring  structures:  C3 — H  •  •  •  O  in  g'GG'G'i 
(with  an  H*  •  -O  distance  of  2.515  A)  and  N*  •  -H — C2  in  g'GGG't  (with  an 
N  •  •  •  H  distance  of  2.607  A).  Inspection  of  the  electron  density  shows  that  these 
interactions  are  not  hydrogen  bonds  but  mainly  of  electrostatic  ori^n. 
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Reaction  Paths 


The  minima  g’GG'Gg',  g'GG'G'g,  and  g'GGG's,  which  are  stabilized  by  the 
intramolecular  N  •  •  •  H — O  hydrogen  bond,  and  their  mirror  images  are  directly 
interconnected  in  the  reaction  sequence 


/" 

/•C-CH 


HH  H 

p _ 


g’GG'Gg’  ^ 

/  H--N 


g’GG'G’g  H— C..H— OA 

i  \  C~! 

h^n»  / 


/C - C-..H 

H— 0.,.::-^_^  gG’GGg’ 

. . IjH  / 

H 


/C— C— H 
g’GGG's  H— C...U 

h^n 

H 


H 

-C  H 

H— O 


The  N  •  •  •  H — O  bond  is  preserved  along  this  reaction  sequence:  The  N  •  •  •  H 
distance  is  1.906  A  in  the  transition  state  g'GG'Gg'  ^  g'GG'G'g,  1.923  A 
in  the  transition  state  g'GG'G'g  ^  g'GGG's,  and  1.91 1  A  in  the  transition  state 
g'GG'Gg'  ^  gG'G  Gs.  As  a  consequence  of  this  bond  preservation,  some  of  the 
internal  rotations  are  coupled  in  these  reactions  as  shown  in  Figure  3.  The  energy 
profile  along  one  of  these  reaction,  namely,  g'GG'Gg'  ^  g'GG'G'g,  is  displayed 
in  Figure  4.  This  profile  is  noteworthy  because  it  contains  a  flat  region  near  the 
conformation  g'GG'Gs,  which  is  also  a  consequence  of  the  N  >  •  •  H — O  hydrogen 
bond.  In  contrast  to  similar  situations,  which  occur  in  the  PES  of  3-aminopropanol, 
the  point  of  inflexion,  which  is  associated  with  this  flat  region,  d<  not  have  zero 
gradients. 

Table  II  lists  the  potential  barriers  of  all  internal  rotations  of  the  global  minimum 
together  with  the  final  conformer  of  each  of  these  rotations. 


Discussion 

A  comparison  of  Figure  1  with  the  corresponding  plots  for  2-aminoethanol  and 
3-aminopropanoI  (Figs.  1  and  2  in  [35])  shows  increasing  scatter  in  the  series  2- 
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Table  I  Relative  and  vib'".iional  zero  point  energies  (kJ/mol)  and  rotation  constants  (GHz)  for  all 
symmetry  unique  local  minima  in  the  PES  of  4-aminobutanol.  The  absolute  energy  of  the  global  minimum 

is  -286.755020  a.u. 


Conformer 

Relative  energy 

Rotation  constants 

Zero  point  energy 

g'GG'G'g' 

20.235 

1.9823 

2.4500 

4.8861 

437.421 

g'GG'G'g 

6.606 

2.0484 

2.8453 

4.4951 

441.012 

g'GG'G't 

19.951 

1.9249 

2.3219 

5.0383 

437.353 

g'GG'Gg' 

0.000 

1.9519 

2.8492 

4.4927 

441.255 

gGG'Gt 

20.371 

1.9080 

2.7406 

4.5571 

438.178 

g'GG'Tg' 

22.217 

1.5115 

1.7859 

6.3962 

435.964 

g'GG'Tg 

22.587 

1.5173 

1.8051 

6.3466 

436.350 

g'GGTl 

20.860 

1.5187 

1.8027 

6.4301 

436.819 

g'GGG's 

6.876 

2.0774 

2.8578 

4.5231 

441.286 

g'GGG't 

21.684 

1.8100 

2.0465 

5.6946 

436.005 

g'GGGg' 

21.513 

1.8240 

1.8309 

6.2137 

436.073 

g'GGGg 

16.835 

1.9227 

1.9456 

5.8506 

437.575 

g'GGGt 

16.732 

1.9322 

1.9211 

5.9836 

437.287 

g'GGTg' 

22.348 

1.5096 

1.6256 

6.9352 

435.922 

g'GGTg 

20.419 

1.5181 

1.6502 

6.8161 

436.537 

g'GGTl 

22.020 

1.5249 

1  6438 

6.9395 

435.850 

g'GTG'g' 

16.149 

1.3731 

1.4767 

9.5897 

436.025 

g'GTG'g 

18.373 

1.3652 

1.4700 

9.5892 

435  893 

g'GTG't 

15.351 

1.3828 

1.4908 

9.6783 

436.062 

g'GTGg' 

20.151 

1.4877 

1.5953 

7.7394 

435.600 

g'GTGg 

15.995 

1.5057 

1.5879 

7.3455 

436.552 

g'GTGi 

16.761 

1.5181 

1.5801 

7.6013 

435.890 

g'CTTg' 

17.544 

1.2732 

1.3664 

9.2636 

435.768 

g'GTTg 

18.793 

1.2732 

1.3624 

9.3050 

435.722 

g'GTTi 

18.917 

1.2811 

1..3727 

9.3493 

435.013 

g'TG'G'g 

23.414 

1.4836 

1.5875 

7.1202 

435.978 

g'TG'Gg' 

25.733 

1.5460 

1.8254 

6.3449 

436.373 

g'TGG'g' 

23.842 

1.5376 

1.827 1 

6.2704 

436.395 

g’TGG'g 

28.232 

1.5225 

1.7738 

6.4675 

435.730 

g'TGG't 

23.207 

1.5370 

1.8152 

6.4227 

436.157 

g'TGGg' 

25.378 

1.4794 

1.5828 

7.1611 

435.526 

g'TGGg 

21.912 

1.5126 

1.6309 

6.9231 

436  434 

g'TGGt 

21.960 

1.5167 

1.6281 

7.0458 

436.377 

g'TGTg' 

23.590 

1.2383 

1.2985 

11.2167 

435.675 

g'TGTg 

24.202 

1.2364 

1.2919 

11.3292 

435.523 

g'TGTi 

23.236 

1.2412 

1.2991 

11.4938 

435.507 

g'TTG'g' 

18.877 

1.2815 

1.3792 

9.2827 

435.697 

g'TTG'l 

17.181 

1.2893 

1.3899 

9.4151 

435.819 

g'TTGg 

19.589 

1.2810 

1.3720 

9.3463 

435.474 

gGG'G'g' 

22.206 

2.0012 

2.6990 

4.5942 

437.746 

gGG'G'l 

25.092 

1.9989 

2.6602 

4.6676 

436.653 

gGG'Gg 

19.444 

1.9551 

2.8134 

4.4527 

437.855 

gGGG'g' 

22.516 

1.9889 

2.6451 

4.6244 

436.771 

gCJGG'l 

16.858 

2.0219 

2.6731 

4.6972 

437.667 

gGGGg' 

26  452 

1.7914 

1.8093 

6.3200 

436.331 

gGGGg 

23.482 

1.7873 

1.8058 

6.3971 

436.272 
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Table  I. 

(Continued) 

Conformer 

Relative  energy 

Rotation  constants 

Zero  point  energy 

gCGGi 

23.522 

1.7978 

1.8171 

6.4228 

436.278 

gGGTg' 

24.682 

1.5154 

6.8174 

436.114 

gGGTg 

26.344 

1.6255 

434.724 

gGGTi 

24.856 

1.5143 

1  6376 

6.9535 

435.912 

gGTG'g' 

20.463 

1.3662 

1.4766 

436.051 

gGTG'g 

21.398 

1.3566 

9.6826 

436.009 

gGTG't 

19.060 

1.3742 

1.4888 

9.7382 

436.035 

gGTGg' 

22.616 

7.9175 

435.667 

gGTGg 

20.991 

1.4964 

1.5326 

7.7031 

435.356 

gGTGt 

22.424 

1.4879 

1.5185 

8.0528 

434.969 

gGTTg' 

20.489 

1.2588 

1.3587 

9.5305 

435.495 

gGTTg 

21.230 

1.2586 

1.3515 

9.5708 

435.189 

gGTTl 

19.946 

1.2682 

1.3669 

9.5718 

435.687 

gTGG'g' 

22.217 

1.5401 

1.8486 

6.2973 

435.895 

gTGG’t 

21.172 

1.5394 

1.8397 

6.4165 

435.943 

gTGGg 

23.181 

1.4890 

1.5357 

7.2267 

435.737 

gTGGl 

22.687 

1.4990 

1.5892 

7.3396 

436.146 

gTGTg' 

22.,5I4 

1.2384 

1.3080 

11.2433 

435.836 

gTGTg 

23,370 

1.2355 

1.2994 

11.3460 

435.564 

gTGTt 

21.968 

1.2404 

1.3067 

11.5193 

435.547 

gTTG'g 

19.599 

1.2633 

1.3493 

9.6519 

436.009 

gTTG't 

18.890 

1.2885 

1.3760 

9.5597 

435.517 

gTTGg- 

19.416 

1.2670 

1.3690 

9.4677 

435.980 

gTTTg' 

19.080 

1.0977 

1.1360 

16.6300 

434.994 

griTg 

18.741 

1.0978 

1.1364 

16.6275 

435.276 

gTTTt 

17.948 

1.1042 

1.1452 

16.7998 

435.289 

iGG’G'g’ 

27.360 

1.8024 

2.0574 

5.3532 

435.801 

iGG'G't 

2‘'.794 

1.7671 

1.9739 

5.6258 

435.520 

iGG'Gg 

19.857 

1.9318 

2.7804 

4.4276 

437.685 

iGG'Gt 

22.062 

1.8964 

2.7180 

4.5176 

437.716 

tGG'Tg' 

29.020 

1.4813 

1.7414 

6.4150 

434.027 

tGG'Tg 

27.384 

1.5012 

1.7599 

6.3445 

434.881 

tGG'Tt 

27.799 

1.4891 

1.7551 

6.4587 

434.227 

iGGG'g' 

23.060 

1.9589 

2.5545 

4.6649 

436.359 

iGGG't 

16.005 

2.0061 

2.6569 

4.6631 

437.513 

iGGGg' 

24.525 

1.7120 

1.7402 

6.6377 

435.824 

iGGGg 

21.744 

1.7727 

1.7891 

6.4084 

435.872 

iGGGi 

20.899 

1.7158 

1.7490 

6.7842 

436.044 

iGGTg' 

22.766 

1.5000 

1.6284 

6.8122 

435.717 

iGGTg 

24.325 

1.4775 

1.5880 

7.0303 

435.690 

tCGTt 

22.201 

1.4934 

1.6053 

7.0299 

435.738 

iGTG'g' 

18.573 

1.3562 

1.4601 

9.5787 

435.343 

tGTG'g 

19.883 

1.3472 

1.4529 

9.6124 

435.175 

iGTG't 

17.378 

1.3668 

1.4761 

9.6428 

435.424 

iGTGg' 

19.899 

1.4694 

1.5034 

7.7458 

435.761 

tGTGg 

19.114 

1.4785 

1  5318 

7.5514 

435.667 

iGTGi 

17.294 

1.4971 

1.5560 

7.5573 

435.599 

iGTTg' 

19.400 

1.2573 

1.3506 

9.3091 

435.383 

{Continued) 
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Table  I.  (Continued) 


Conformer 

Relative  energy 

Rotation  constants 

Zero  point  energy 

iGTTg 

18.998 

1.2588 

1.3469 

9.3318 

435.407 

tGTTt 

18.02'’ 

1.2656 

1.3591 

9.3779 

435.440 

tTGG'g' 

22.115 

1.5412 

1.8417 

6.2151 

435.900 

tTGG'g 

28.484 

1.4978 

1.7809 

6.3938 

434.686 

iTGG't 

21.886 

1.5442 

1.8368 

6.3492 

435.712 

iTGGg' 

24.110 

1.4739 

1.5813 

7.1335 

435.462 

iTGGg 

20.797 

1.4958 

1.6112 

6.9751 

436.003 

iTGGt 

20.030 

1.5096 

1.6281 

6.9906 

436.143 

tTGTg' 

22.907 

1.2343 

1.3007 

11.1295 

435.298 

tTGTg 

23.102 

1.2321 

1.2932 

11.2092 

435.374 

iTGTl 

21.933 

1.2371 

1.3013 

11.3789 

435.259 

tTTG'g' 

17.515 

1.2721 

1.3636 

9.2731 

435.525 

tTTG'g 

19.087 

1.2577 

1.3488 

9.5059 

435.532 

tTTGt 

16.344 

1.2832 

1.3802 

9.3580 

435.574 

tTTTg 

18.182 

1.0921 

1.1276 

434.990 

tTTTt 

17.305 

1.0986 

1.1364 

16.6482 

435.131 

aminoethanol,  3-aniinopropanol,  and  4-aminobutanol,  which  is  paralleled  by  an 
increasing  energy  gap  between  the  global  minimum  and  the  bulk  of  conformers 
without  intramolecular  interactions.  The  increasing  energy  gap  and  the  corre¬ 
sponding  potential  barriers  prove  that  the  seven-membered  ring,  which  is  formed 
in  4-aminobutanol,  is  more  stable  than  the  six-membered  ring  of  3-aminopropanol 
and  the  five-membered  ring  of  2-aminoethanol.  This  increase  of  stability  with  ring 
size  is  in  good  agreement  with  the  effect  of  the  N  •  •  •  H — O  hydrogen  bond  on  the 
O — H  bond  length  and  vibration  frequency  [34.41,42],  Furthermore,  the  larger 
amount  of  scatter  indicates  that  the  conformers  of  4-aminobutanol  are  more  flexible 
than  those  of  3-aminopropanol  and  2-aminoethanol.  This  is  an  interesting  conclu¬ 
sion:  On  the  one  hand,  it  provides  an  explanation  for  the  increase  of  stability  with 
the  ring  size,  because  the  two  functional  groups  can  arrange  themselves  with  less 
hindering  by  the  rest  of  the  molecule.  On  the  other  hand,  however,  the  same  is 
valid  for  all  other  4-aminobutanol  conformers  too,  which  makes  the  greater  stability 
of  the  global  minimum  of  4-aminobutano!  even  more  remarkable. 

For  the  global  minima  of  the  homologues  2-aminoethanoI,  3-aminopropanol, 
and  4-aminobutanol  it  has  to  be  noted  that  for  none  of  them  the  potential  barrier 
for  H-bond  preserving  reactions  is  the  lowest  one.  In  any  case  there  is  one  internal 
rotation,  which  destroys  the  intramolecular  N  •  •  •  H — O  hydrogen  bond,  but  re¬ 
quires  less  activation  energy  than  those  internal  rotations  that  preserve  the  H-bond. 

Upon  comparing  the  various  intramolecular  hydrogen  bonds  in  4-aminobutanol 
and  3-aminopropanoi,  one  encounters  that  the  Q — H*  •  *0  and  N*  •  -H — C| 
interartions  in  4-aminobutanol  have  no  counterparts  in  3-aminopropanol  and  2- 
aminoethanol.  This  seems  to  be  not  so  much  a  consequence  of  the  greater  flexibility 
in  4-aminobutanol,  but  one  of  the  fact  that  these  interactions  form  six-membered 
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0-C-C-C  — 

Figure  4.  The  energy  profile  of  the  reaction  g'GG'Gg'  ==  g'GG'G’g  contains  a  flat  region 
due  to  the  N  •  •  •  H — O  hydrogen  bond.  (•)  Stationary  points  (local  minima  and  saddle 

points)  of  the  PES. 


rings  of  the  chair  type  in  4-aniinobutanol.  The  analogous  five  membered  rings 
in  3-aniinopropanol  would  require  drastic  deviations  from  the  “standard” 
C — C — C — O  and  C — C — C — N  orientations  of —60°,  60°,  and  180°. 
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Table  II  Final  geometries  and  potential  barriers  (kJ/mol)  of  the  various  reaction  paths  of 
the  most  stable  4-aminobutanol  conformer  ^'CG'G^ 


Internal  rotation 

Final  conformer 

Potential  barrier 

Decreasing  angle  C — C — C — C 

g'GTGg 

19.16 

Increasing  angle  C — C — 0 — H 

g'GG'G'g 

20.60 

Increasing  angle  N — C — C — C 

g'TG'Gg' 

29.32 

Increasing  angle  C — C — C — 0 

g'GG'Tg' 

31.60 

Decreasing  angle  C — C — 0 — H  1 

g'GGGi 

39.01 

Increasing  angle  C— C — C — C  J 
Decreasing  angle  C— C— C— 0 

g'GG'G'g' 

41.00 

Decreasing  angle  N — C — C — C 

gG'G'Gt 

26.06 
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Abstract 

Statistical  mechanics  Monte  Carlo  simulations  of  the  proton  transfer  in  [  H3N — H — NH3  ]  '^  in  aqueous 
solution  have  been  carried  out  using  the  combined  quantum  mechanical  and  molecular  mechanical 
AMl/np3P  potential.  As  in  the  gas  phase,  squeezing  and  stretching  motions  of  the  donor-acceptor 
distance  have  profound  effects  on  the  proton  transfer.  Solvation  increases  the  activation  free  energy 
of  the  proton  transfer  by  3  kcal/mol  over  that  in  the  gas  phase  The  results  suggest  that  the  combined 
QM/MM  potential  may  be  used  in  models  that  treat  the  proton  motion  quantum  mechanically  and  the 
solvent  dynamics  classically.  ©  1993  John  Wiley  &  Sons.  Inc. 

Introduction 

In  a  recent  study,  McCamtnon  and  coworkers  constructed  the  potential 
energy  surface  for  the  proton  transfer  reactions  of  [H3N  —  H  —  NH3]^  and 
[H3N — H — OH2]*  in  the  gas  phase  through  ab  initio  calculations  at  the  mp2/6- 
31G(d)  level  [1],  The  effort  was  a  prelude  to  modeling  the  quantum  mechanical 
tunneling  effect  in  proton  transfer  process  {2],  which  has  also  been  observed  in 
enzymatic  reactions  [3].  The  authors  found  that  the  barrier  heights  for  these  re¬ 
actions  are  sensitive  to  the  squeezing  and  stretching  motions  of  the  donor-acceptor 
fragments,  suggesting  that  a  dynamic  model  should  be  used  for  the  proton  transfer 
[1].  In  addit’cn,  the  analytical  potential  function  obtained  by  fitting  to  the  ab  initio 
data  provides  the  necessary  information  needed  in  the  quantum  mechanical  treat¬ 
ment  of  proton  transfer  in  proteins.  However,  a  full  description  of  the  proton  transfer 
process  requires  specific  consideration  of  the  environmental  solvent  effects  [4], 
Unfortunately,  most  theoretical  studies  to  date  have  been  limited  to  model  systems 
in  the  gas  phase  or  phenomenological  approaches  due  to  the  lack  of  proper  potential 
functions  for  solute-solvent  interactions  [5-7]. 

A  viable  approach  to  the  determination  of  the  potential  surface  for  the  proton 
transfer  in  solution  is  to  use  a  combined  quantum  mechanical  and  molecular  me¬ 
chanical  (QM/MM)  method  oy  treating  the  solute  quantum  mechanically  and  the 
solvent  molecules  classically  [4].  Consequently,  the  solute-solvent  interaction  as 
well  as  the  proton-transfer  potential  surface  is  evaluated  by  the  quantum  mechanical 
calculations  in  fluid  simulations.  The  quantum  tunneling  effect  is  then  obtained 
through  the  time-dependent  Schrodinger  equation  for  the  propagation  of  the  proton 
on  the  potential  surface  approximated  by  the  qm/mm  model.  This  idea  has  been 
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pioneered  by  many  researchers  [8-11].  Warshel  and  coworkers  used  the  empirical 
valence  bond  theory  to  study  the  hydrogen  transfer  reactions  in  proteins  including 
lysozyme,  trypsin,  and  lactate  dehydrogenase  [8],  while  Kollman  [9],  Karplus, 
and  coworkers  combined  ab  initio  and  semiempirical  molecular  orbital  theory  with 
the  classical  dynamics  method  [10].  Our  own  effort  on  the  use  of  the  hybrid 
QM/MM  potential  in  statistical  mechanics  Monte  Carlo  simulations  has  demon¬ 
strated  that  the  combined  semiempirical  AMl  theory  and  the  tip3P  model  for  water 
can  provide  adequate  description  of  the  solvation  of  biomolecular  systems  in 
aqueous  solution  [11].  Furthermore,  our  results  showed  that  the  solvent  polarization 
effects,  which  are  of  the  primary  concern  on  the  use  of  the  empirical  potentials, 
can  be  quantitatively  determined  a  priori  in  these  calculations  [12,13]. 

In  this  article,  we  examine  the  solvent  effect  on  the  Bom-Oppenheimer  potential 
surface  for  the  proton  transfer  of  [H3N — H — NHs]"*^  in  water  using  the  hybrid 
AM  1  /tip3P  potential  via  Monte  Carlo  simulations.  Comparison  with  the  gas  phase 
results  from  Ref.  [1]  will  be  made.  The  dynamics  and  contributions  to  the  rate 
constant  from  the  quantum  mechanical  nuclear  tunneling  will  be  presented  in  a 
future  publication.  In  the  following,  computational  details  will  be  first  given  followed 
by  results  and  discussion. 


Computational  Details 

In  the  present  study,  the  combined  QM/MM  potential  is  partitioned  into  a  quan¬ 
tum  mechanical  region  consisting  of  the  proton  donor  and  acceptor  molecules, 
[  H3N  —  H  —  NH3  ]  ^ ,  and  a  molecular  mechanical  region  of  the  solvent  molecules 
approximated  by  the  three-site  tip3P  model  for  water  [14].  Clearly,  in  order  to 
compute  the  energies  of  the  qm  solute  molecules  throughout  the  condensed-phase 
simulation,  a  computationally  efficient  method  must  be  used.  The  semiempirical 
Austin  Model  1  (  am  1 )  theory  developed  by  Dewar  and  coworkers  [  1 5  ]  is  employed 
to  form  the  am1/tip3P  force  field  [12].  Thus,  the  solute,  [H3N — H — NH3]‘^,  is 
represented  by  valence  electrons  and  nucleus  cores  in  the  Monte  Carlo  simulation. 
The  restricted  Hartree-Fock  (hf)  wave  function,  ^>,  is  used  with  a  single  Slater 
determinant  of  all  doubly  occupied  molecular  orbitals  (mo),  which  are  linear  com¬ 
binations  of  a  minimum  basis  set  [15].  The  total  effective  Hamiltonian  of  the 
system  is  given  by: 

=  H°  +  Hqmimm  +  Hmm  ,  (  1  ) 

where  H°  is  the  amI  Hamiltonian  for  the  solute  molecules,  is  the  molecular 
mechanical  solvent  energy,  and  is  the  solute-solvent  interaction  Hamil¬ 

tonian  defined  previously  [10-12].  Therefore,  the  total  potential  energy  in  the 
combined  qm/mm  force  field  is  computed  using  Eq.  (2), 

Etoi  ~  ~  Eqnt  F  Eqm/mm  "I"  •  (2) 

Here,  #  is  the  wave  function  of  the  solute  in  aqueous  solution,  Emm  is  the  MM  pair- 
interaction  energy  for  the  solvent  molecules  enumerated  with  the  empirical  tip3P 
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potential,  and  {Eqm  +  are  determined  through  the  Hartree-Fock  self-con¬ 

sistent-field  MO  calculations.  In  particular,  the  solute-solvent  interaction  energy  is 

soitae  Hater 

Eq„imn^  =  <4>  | |  «t>  +  S  2  4c„[(<T„//?„) -  (<T„/i?„)®]  ,  (3) 

t  s 

where  Hqm/mm  is  the  electronic  portion  of  the  solute-solvent  interaction  Hamil¬ 
tonian,  and  the  Lennard-Jones  terms  account  for  dispersion  interactions  between 
the  QM  and  MM  regions,  which  are  omitted  in  the  hybrid  QM/MM  approximation 
[10,12].  The  Lennard-Jones  parameters  are  taken  from  our  previous  work  [12]. 

The  potential  energy  surface  for  the  proton  transfer  in  [H3N  —  H — NHa]"^  is 
determined  as  a  function  of  the  intemuclear  distance  between  the  nitrogen  atoms, 
R,  and  the  position  of  the  proton,  x,  measured  from  the  geometrical  center  of  the 
two  nitrogens.  The  model  as  defined  in  Ref.  [1]  is  shown  in  Figure  1.  Note  that 
only  the  linear  arrangement  is  considered  here,  while  McCammon  and  coworkers 
also  investigated  the  off-axis  displacement  [1].  In  the  gas  phase,  the  C3„  structure 
is  the  minimum  configuration  at  a  fixed  R,  which  was  used  in  the  energy  calculations. 
However,  the  rotation  of  the  NH3  groups  along  the  N  —  N  axis  is  sampled  through 
the  dihedral  sampling  procedure  in  the  Monte  Carlo  calculations.  So.  the  model 
system  takes  a  C3  symmetry  in  general  in  the  computation  of  the  free  energies  of 
solvation  in  water. 

The  free  energy  of  solvation  was  computed  via  statistical  perturbation  theory 
[16]  using  the  MCQUB/BOSS  program  [17],  in  which  the  quantum  mechanical 
energies  were  evaluated  with  the  MOPAC  program  modified  to  include  the 
Hqm/mm  terms  [18].  The  calculations  were  performed  at  R  =  2.50  to  3.00  A  with 
an  increment  of  0.05  A.  For  each  value  of  R,  x  was  transformed  from  0.00  to  0.45 
A  using  the  “double-wide”  sampling  technique  [19]  with  Ax’  =  ±0.05  A  to  compute 
the  relative  free  energies  of  solvation  [Eq.  (4)]: 

AGfx,  —  ±2,  R)  =  -keTln  (4) 

where  k^  is  Boltzmann’s  constant,  (  )|  represents  the  ensemble  average  using  the 
Hamiltonian  Heff{x\,  R),  and  AE(X|  -*■  X2,  R)  is  the  energy  difference  between 
proton  positions  Xi  and  X2.  Finally,  the  relative  free  energy  between  two  adjacent 
lines  was  connected  through  an  additional  perturbation  of  R  at  x  -  0.00  A.  There¬ 
fore,  the  data  for  the  solvation  effect  on  the  proton  transfer  came  from  a  total  of 


K -  R  - H 


Figure  1.  Geometrical  definition  for  the  model  proton  bound  dimer. 
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60  simulations,  each  consisting  of  1  X  10*  configurations  for  equilibration  followed 
by  1.5  X  10*  configurations  of  averaging,  and  each  requiring  about  10  h  of  CPU 
time  on  an  IBM  RS/6000  Model  560  computer.  All  simulations  were  carried  out  in 
a  rectangular  box  (ca.  18.5  X  18.5  X  27.8  A^)  containing  321  water  molecules  plus 
[H3N — H — NH3]^.  The  isothermal-isobaric  ( npt  )  ensemble  at  25 °C  and  1  atm 
was  used  along  with  the  periodic  boundary  conditions.  A  spherical  cutoff  of  9.5  A 
was  adopted  for  solute-  solvent  and  solvent-solvent  interactions.  Standard  deviations 
were  estimated  by  computing  separate  averages  over  blocks  of  10*  configurations. 
The  accumulative  error  was  about  0.2  kcal/mol  from  the  center  of  the  free 
energy  map. 


Results  and  Discussion 

To  verify  the  performance  of  the  semiempirical  amI  method  for  the  proton 
transfer  in  [H3N — H  —  NH3]^,  the  computed  barrier  heights  at  fixed  values  of 
the  nitrogen  separation  are  compared  with  the  results  obtained  by  McCammon 
and  coworkers  using  the  MP2/6-31G(d)  method  (Table  I);  the  agreement  with 
the  ab  initio  data  is  good,  suggesting  that  the  am  1  method  may  be  used  to  study 
the  present  proton  transfer  reaction.  Tiie  MP2/6-31G(d)  optimizations  yield  an 
equilibrium  distance  of  R  =  2.72  A  [1],  which  is  close  to  the  AMl  value  of  2.74  A. 
At  R  =  2.75  A,  the  computed  barriers  are,  respectively,  2.6  and  3.3  kcal/mol  at 
the  mp2  and  AMl  levels,  while  squeezing  the  system  to  R  =  2.5  A  results  in  a  single 
minimum  of  symmetric  configuration,  which  has  an  energy  equal  to  the  barrier 
height  at  R  =  2.75  A.  The  barrier  height  is  raised  to  10.2  and  12.4  kcal/mol  for 
the  MP2/6-31G(d)  and  amI  models  upon  stretching  the  N  —  N  distance  to  R  = 
3.00  A.  The  potential  energy  surface  obtained  from  the  am  1  calculations  is  similar 
to  that  from  the  ab  initio  calculations,  and  is  displayed  in  Figure  2,  in  which  the 
energy  has  been  anchored  to  zero  at  the  global  minimum.  The  feature  of  the  com¬ 
bined  motion  of  the  proton  and  heavy-atom  positions  for  the  proton  transfer  along 
the  classical  path  observed  in  the  ab  initio  calculations  are  clearly  seen  in  Figure 
2.  The  transition  state  at  R  =  2.62  A  (x  =  0.00)  is  1.4  kcal/mol  above  the  global 
minimum,  a  result  in  good  agreement  with  the  MP2/6-31G(d)  result  (AE*  =  0.9 
kcal/mol  at  R  =  2.62  A)  [1]. 

The  free  energy  surface  for  the  proton  transfer  in  aqueous  solution  is  shown  in 
Figure  3.  The  solvent  effect  is  apparent  in  comparison  with  the  potential  surface 
in  the  gas  phase  displayed  in  Figure  2.  First,  the  solvation  effect  increases  the  barrier 


Tablb  1.  Computed  activation  energies  for  the  proton 
transfer  of  [H3N-H-NH3j‘'  in  the  gas  phase  (kcal/mol) 


R(N— N).  A  AHt(AMI)  AEt  {MP2/6-3IG*) 


2.50 

2.75 

3.00 


0 

3.3 

124 


0 

2.6 

10.2 
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Figure  2.  Potential  energy  contour  for  the  proton  transfer  in  (HjN  —  H  — NHj)'*  in  the 
gas  pnase.  Contour  levels  are  0.4  kcal/mol. 


Figure  3.  Potential  energy  contour  for  the  proton  transfer  in  [HjN  —  H  —  NH3]*  in 
aqueous  solution.  Contour  levels  are  1.0  kcal/mol. 
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of  the  proton  transfer  in  [H3N  —  H — NHj]^  by  ca.  3  kcal/mol.  Apparently,  at 
the  transition  state  where  the  transferring  proton  is  situated  in  the  middle  of  the 
donor  and  acceptor  molecules,  the  positive  charge  is  more  delocalized  than  the 
ion-dipole  complex.  Consequently,  the  transition  state  is  not  as  strongly  stabilized 
by  the  solvent  as  that  of  the  ion-dipole  structure.  The  results  emphasize  the  im¬ 
portance  of  specific  consideration  of  the  environmental  solvation  contributions  to 
the  rate  constant  in  theoretical  models  for  the  proton  transfer  in  enzymatic  systems. 
Secondly,  the  ion-dipole  complex  is  a  very  stable  species  in  the  gas  phase,  which 
requires  much  more  energy  to  dissociate  into  separate  ammonium  ion  and  ammonia 
monomers  than  that  needed  to  cross  the  barrier  of  the  proton  transfer  process  (the 
binding  energy  of  NHJ  and  NH3  is  about  27  kcal/mol)  [6].  In  aqueous  solution, 
however,  the  competition  between  the  intrinsic  binding  of  the  ion-dipole  complex 
and  the  solute-solvent  interaction  yields  a  delicacy  on  the  details  of  the  potential 
surface.  Figure  3  shows  that  the  ion-dipole  complex  is  only  about  1  kcal/mol  lower 
in  energy  than  the  species  with  a.  N — N  separation  of  2.95  A.  The  feature  is  similar 
to  that  observed  in  the  Sfj2  reaction  of  Cl'  +  CH3CI  in  water,  where  the  gas-phase 
ion-dipole  minimum  is  completely  smeared  out  in  aqueous  solution,  leading  to  a 
unimodal  reaction  profile  (20]. 

Structural  details  on  the  solute-solvent  interactions  are  further  characterized  by 
the  computed  radial  distribution  functions  (rdfs),  gi>(r),  which  give  the  probability 
of  finding  an  atom  of  type  y  a  distance  r  from  an  atom  of  type  x.  The  results  for 
the  simulation  at  R  =  2.75  A  and  x  =  0.30  A,  corresponding  to  the  minimum  in 
Figure  3,  are  given  in  Figures  4  through  6.  In  these  figures  the  first  atom  for  an  xy 
distribution,  refers  to  an  atom  of  the  solute  and  the  second  atom  is  either  the  oxygen 
or  the  hydrogen  of  water. 

Figu’-e  4  shows  the  NH3  hydrogen-water  oxygen  rdfs.  For  the  donor  molecule 
(NH4 ,  solid  curve),  hydrogen  bonding  interactions  are  clearly  revealed  by  the  first 
peak  at  1.9  A,  which  is  in  exact  agreement  with  a  previous  study  of  the  solvation 
of  methylammonium  ion  in  water  [21].  Integration  of  this  peak  to  2.5  A  yields  1 .4 
water  molecules  per  hydrogen.  Thus,  the  total  number  of  nearest-neighbor  water 


Figure  4.  Amino  hydrogen-water  oxygen  radial  distnbution  functions. 
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Figure  5.  Nitrogen-water  oxygen  radial  distribution  functions. 


molecules  for  the  donor  is  4.2,  which  may  be  compared  with  the  value  of  3.9  for 
methylammonium  ion  [21].  Hydrogen  bonding  interactions  between  the  acceptor 
ammonia  hydrogen  and  water  are  much  weaker  as  expected.  The  dashed  curve  in 
Figure  4  displayed  only  a  small  shoulder  in  the  H — O  rdf.  The  hydrogen-bond 
donating  ability  of  NH3  is  known  to  be  poor  from  ab  initio  calculations  and  gas- 
phase  microwave  experiments  [22].  The  N — O  rdfs  in  Figure  5  confirm  the  pattern 
of  hydrogen  bonding  interactions  noted  above  with  a  sharp  first  peak  for  the  donor 
monomer  (solid  curve)  and  a  broad  distribution  for  the  acceptor  nitrogen 
(dashed  curve). 

Finally,  the  rdfs  for  the  transferring  proton  with  water  are  shown  in  Figure  6. 
Due  to  steric  congestions  from  the  donor  and  acceptor  NH3  groups  in 
[H3N  —  H  —  NH3]^,  the  migrating  proton  is  not  well  hydrated  by  the  solvent 


H-X  Radial  Distribution  Functions 


Figure  6.  Radial  distribution  functions  for  the  migrating  proton  with  water  oxygen  ( solid 
curve)  and  hydrogen  (dashed  curve). 
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molecules.  Thus,  the  peaks  occur  at  larger  distances  than  those  shown  in  Figure  4. 
Furthermore,  the  broad,  structureless  distribution  for  the  H  —  H  rdf  indicates  that 
the  water  hydrogens  have  considerable  orientational  freedom  in  forming  hydrogen 
bonds  with  the  solute. 


Conclusion 

We  have  presented  the  results  from  Monte  C^rlo  simulations  of  the  proton  transfer 
in  [H3N — H — NHa]"^  in  water  using  the  combined  qm/mm-am1/tip3P  method. 
The  solvent  effect  was  found  to  significantly  increase  the  classical  activation  free 
energy  over  that  in  the  gas  phase.  The  barrier  to  proton  transfer  is  sensitive  to  the 
distance  between  the  donor  and  acceptor  nitrogen  atoms.  Although  the  quantum- 
mechanical  nuclear  tunneling  was  not  specifically  considered,  the  present  st"dy 
shows  that  the  hybrid  QM/MM-AMl  /tip3P  potential  can  provide  adequate  descrip¬ 
tion  of  the  solute-solvent  interaction  for  the  proton  transfer  process  in  solution, 
and  the  method  may  be  used  in  a  model  that  treats  the  proton  motion  quantum 
mechanically  and  the  solvent  dynamics  classically. 
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Abstract 

The  electronic  and  geometrical  structures  of  exchange-coupled  open-shell  systems  such  as 
bis{  methylene )  methane  and  the  chromium-methylene  cation  were  investigated  by  the  molecular  orbital 
and  its  extended  methods;  rhf,  uhf,  unocas,  rmp,  ump,  pump,  apump,  uccsd(t),  uno-ci  and 
QCiSD(T).  The  fully  optimized  geometries  of  these  species  by  the  energy  gradients  of  UMPn  solutions  are 
consistent  with  the  expenmental  values.  The  total  energies  of  the  lowest  and  highest  spin  states  were 
calculated  in  order  to  elucidate  the  effective  exchange  integrals  in  the  Heisenberg  model  for  methylene 
and  bis( methylene) methane.  The  spin  projection  is  necessary  for  the  UHF-based  wavefunctions  in  the 
lowest  spin  state  of  the  species.  The  binding  energies  for  the  chromium  cation-methylene  double  bond 
were  calculated  by  both  spin-unprojected  and  -projected  methods.  It  was  found  that  the  binding  energy 
between  Cr*  and  ’CHj  by  apuhf  QCI5D(T)  is  in  accord  with  the  experimental  value.  ©  1993  John  Wiley 
&  Sons.  Inc. 


Introduction 

Recently  electronic  and  geometrical  structures  of  unstable  molecules  and  their 
clusters  have  received  particular  interest,  since  computational  methods  for  stable 
closed-shell  molecules  and  concerted  reactions  are  rather  established.  One  of  the 
remarkable  features  of  unstable  molecules  such  as  diradical  and  polyradicals  is  that 
the  restricted  Hartree-Fock  (RHF)  and  open-shell  rhf  (orhf)  solutions  for  these 
systems  suffer,  respectively,  from  the  so-called  triplet  instability  and/or  spin-flip 
(SF)  instability  [1].  Then  the  spin  and  electron  correlations  are  nondynamically 
important,  indicating  the  necessity  of  beyond  Hartree-Fock  theories  and  compu¬ 
tations  [2],  As  a  continuation  of  the  previous  work  [1]  (hereafter  it  is  referred  to 
as  the  Part  I),  we  here  summarize  our  recent  theoretical  approaches  to  unstable 
chemical  bonds  involved  in  exchange-coupled  open-shell  systems. 

We  first  summarize  theoretical  backgrounds  in  the  next  section,  where  advanced 
MO  techniques  are  examined  in  relation  to  chemical  pictures  and  models  for  unstable 
molecules  [1],  Both  spin-restricted  and  unrestricted  post-Hartree-Fock  methods 
are  briefly  reviewed  in  the  next  section.  In  order  to  explain  practical  aspects  in  ab 
initio  computations,  we  investigate  the  electronic  structures  of  methylene  and 
bis( methylene) methane  in  the  third  and  fourth  sections.  The  effective  exchange 
integrals  for  these  species  in  the  Heisenberg  model  are  calculated  by  the  total  energy 
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difference  between  the  lowest-spin  (ls)  and  highest-spin  (hs)  states.  The  direct 
exchange-coupling  energies  between  chromium  cation  and  triplet  methylene  are 
examined  by  several  computational  methods  in  the  fifth  section. 


Theoretical  Backgrounds 
Instability  of  Chemical  Bonds  in  Open-Shell  Systems 

The  ab  initio  restricted  Hartree-Fock  (rhf)  molecular  orbital  (mo)  method  [3] 
has  been  utilized  for  elucidation  of  electronic  structures  of  closed-shell  organic 
molecules  such  as  homopolar  and  polar  nonradical  species.  In  order  to  eludidate 
geometrical  parameters  such  as  bond  lengths  and  angles,  together  with  electronic 
structures,  a  variety  of  ab  initio  rhf  and  RHF-based  post  Hartree-Fock  calculations 
have  already  been  carried  out  for  organic  molecules  and  organic  reaction  inter¬ 
mediates  [4]. 

However,  the  intrinsic  deficiency  of  the  rhf  MO  method  is  that  it  cannot  describe 
a  dissociation  process  of  a  covalent  bond  even  in  a  qualitative  sense.  An  approach 
to  this  problem  within  the  Hartree-Fock  (HF)  MO  framework  is  to  use  the  unre¬ 
stricted  HF  (UHF)  theory  [5].  The  bifurcation  of  the  rhf  MOS  into  the  dods  mos 
occurs  at  the  triplet  (T)  instability  threshold  where  the  lowest  eigenvalue  { Xo  =  0) 
of  the  T-instability  matrix  is  zero  [3,6-8].  This  means  chemically  that  a  molecule 
under  consideration  has  a  singlet  diradical  character  [9].  Generally,  the  uhf  mo 
method  offers  a  qualitatively  correct  orbital  picture  and  orbital  correlation  diagram 
for  the  dissociation  of  a  covalent  bond,  being  applicable  to  diradical  and  polyradical 
species  [  10] .  However,  the  UHF-based  wavefunctions  for  singlet  diradicals  and  low- 
spin  species  suffer  from  the  spin  contaminations  arising  from  high-spin  states  [S], 
leading  to  spin-projection  or  spin-restricted  procedures  described  below  [11-13]. 

UNO-CASa  and  casscf  Methods 

A  spin-restricted  method  for  the  elucidation  of  electronic  structures  of  unstable 
molecules,  particularly  of  open-shell  species  such  as  diradical  and  polyradial  species, 
is  the  multiconfiguration  self-consistent-field  (  mcscf  )  method  [  2  ] .  As  an  extension 
of  the  MO-theoretical  approach  to  unstable  ncolecules,  we  have  investigated  the 
utility  of  approximate  natural  orbitals  and  their  occupation  numbers  determined 
by  the  uhf  calculations  for  selections  of  active  orbitals  and  for  rapid  convergence 
of  the  multireference  (MR)  configuration  interaction  (ci)  and  MCSCF  calculations 
[14,15].  Recently,  Pulay  and  his  coworkers  [16,17]  have  also  demonstrated  that 
the  UHF  solutions  can  be  used  for  determining  approximate  natural  orbitals  that 
are  starting  trial  orbitals  for  effective  mcscf  calculations  of  complex  open-shell 
species. 

In  order  to  elucidate  the  open-shell  characters,  the  uhf  solution  is  expressed  by 
the  corresponding  molecular  orbitals  (CMOS)  [5,18]: 


^'(uhf)  =  |X,^X2^  •  •  •  XsW\  , 


(1) 
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where  X,  and  ri,  are  CMOs,  which  are  related  to  the  UHF  natural  orbitals  (UHF 
NO  =  UNO)^^, 

^Ho-i  “  COS  CO  iplio-i  "1"  sin  co  (pm+i  (HO  =  HOMO,  LU  =  LUMO) ,  (2a) 

’?Ho-(  =  cos  CO  <pno-i  ~  sin  co  ^plu+i  >  (2b) 

where  co  is  the  orbital  mixing  parameter.  The  localized  natural  orbitals  (LNO)  anc. 
magnetic  orbitals  are  defined  as  the  corresponding  MOs  at  the  strong  correlation 
iimitco  =  45°[19]: 

Xho-. (LNO)  =  (^HO-.  +  V’LU+, )/^  (HO  =  HOMO,  LU  =  LUMO)  ,  (2c) 

»?ho-i(lno)  =  (v’ho-i  “  <Plu-h)I^  ■  (2d) 

The  LNOs  are  useful  for  the  valence-bond  (VB)-like  explanations  of  the  UNO  com¬ 
plete  active  space  (CAS)  configuration  interaction  (ci)  wavefunctions  [1,14]. 

The  natural  orbital  pairs  (v>ho-i  ,  ^lu+<  )  in  Eq.  (2)  are  usually  the  bonding  and 
antibonding  MOs,  and  these  are  spatially  symmetry-adapted  orbitals  in  contrast  to 
generalized  valence  bond  (gvb)  natural  orbitals  [1],  which  are  ohen  bond  orbitals. 
The  occupation  numbers  (/i^)  of  these  nos  are  given  by  the  orbital  overlap  between 
the  corresponding  orbital  pairs  (Xho-i  and  tjho-i)  as 

»jho-,(uhf)  =  1  +  Ti,  nLu+,(UHF)  =  1  -  r, ,  (3a) 

where 

Tij  —  J'  X,Tij  dr  =  5,j,  Til  —  Ti  .  (4) 

Judging  from  the  magnitude  of  the  occupation  numbers,  we  can  select  the  active 
natural  orbitals  for  the  full  valence  (FV )  ci  or  CAS  Cl  calculations  for  the  ground 
and  lower  excited  states  [15,20,21].  When  the  UNO  and  cas(fv)  a  vector  are 
utilized  as  trial  orbitals  [  1 5  ]  for  the  more  refined  FV  -type  MCSCF  calculations  [15] 
for  a  state  under  consideration,  the  procedure  is  often  called  as  uno-Casscf  or 
UNO-CAS  [18]. 

The  interrelationship  between  the  UHF,  spin-projected  UHF  (puhf)  and  UNO 
CASSCF  methods  is  clear.  As  an  example,  let  us  consider  the  case  of  diradical  species 
in  which  the  corresponding  orbitals  are  essentially  equivalent  to  the  closed-shell 
MOS  ip  except  for  a  diradical  pair  [1,22,23]: 

X,  =77,  (/ =  0,1,2,...,  HO -i).  (5) 

The  singlet  uhf  orbital  configuration  |  Xhotiho  I  for  diradicals  does  not  express  the 
pure  singlet  state.  The  spin  projection  of  the  configuration  into  the  singlet  state 
(PUHF)  is  necessary  for  a  quantitative  purpose: 

Sk(PUHF)  =  Iv’i^V’2^{Xho77ho  +  tihoXho}  I  (6a) 

=  cos^  '  •  •¥>ho^hoI 

—  sin^  <r>\*P\'P\<P'^2  •  *  '^lu^PluI  >  (6b) 
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where  to  is  defined  by  Eq.  (2).  The  PUHF  wavefunction  is  the  2X2  multiconfi- 
guration  wavefunction  constructed  from  the  ground  and  doubly  excited  state 
functions. 

The  diradical  character  is  defined  by  the  weight  ( H'j)  of  the  doubly  excited 
configuration  in  the  mcscf  theory  and  it  is  formally  expressed  by  the  orbital  overlap 
T,  in  the  case  of  the  puhf  theory  [1,5,23,24]: 

y,=2W,=  \-  2T,/(\  +  T^).  (7a) 

The  diradical  characters  by  the  PUHF  method  are  0%  and  100%  for  closed-shell  and 
pure  diradical  states,  respectively.  The  occupation  numbers  of  UNO  for  polyradicals 
in  the  puhf  state  are  generally  given  by 

«ho-,(puhf)  =  (nHo-.(UHF)]V(l  +7'?).  (3b) 

«lu+,(puhf)  =  [«lu+,(uhf)]V(I  +  7'?)-  (3c) 

Therefore,  the  occupation  number  of  an  antibonding  UNO  is  nothing  but  the  diradi¬ 
cal  character  for  a  chemical  bond  /: 

«lu+,(puhf)  ~  3’lu+i  •  (7b) 

Generally,  the  uno  casscf  method  should  involve  at  least  all  the  active  pairs  of 
UNOs,  which  are  responsible  for  di-  and  polyradical  states.  Therefore,  the  UNO  Cas 
wavefunction  includes  the  nondynamical  part  of  electron  correlation  correction. 

Single  Reference  (SR)  Post-Harlree-Fock  Methods 

The  R(u)HF  and  uno-casscf  wavefunctions  do  not  involve  the  so-called  dy¬ 
namical  electron  correlation  effect.  Several  post  Hartree-Fock  approaches  have 
been  succesfully  used  for  simple  and  efficient  computations  of  electron  correlation 
energies  [24,25].  For  closed-shell  molecules  and  singlet  species  without  strong  di¬ 
radical  characters,  the  reference  wavefunction  is  usually  for  the  RHF  type: 

’I'(RHF2r)  =  ^(RHF)  -F  ^(2(  correction) ,  (8) 

where  the  X's  mean  the  post  Hartree-Fock  methods:  nth-order  Moller-Plesset 
perturbation  (MPn),  coupled-cluster  double  (ccD).  cc  single  double  (ccsd),  ccd 
plus  single  and  triple  purturbation  (CCDST4),  quadratic  SD  configuration  interaction 
(QCISD),  and  QCISD  plus  triple  excitation  [qcisd(t)],  etc.  [25].  The  RHF-based 
wavefunctions  are  quite  acceptable  wavefunctions  for  closed-shell  species  and  singlet 
species  with  weak  diradical  characters.  However,  they  often  over( under)  estimate 
the  correlation  correction  for  singlet  species  with  strong  diradical  characters,  in¬ 
dicating  the  breakdown  of  the  RHF-based  approaches  [26], 

For  open-shell  molecules  and  singlet  species  with  strong  diradical  characters,  the 
reference  wavefunction  is  of  the  uhf  type: 

^(uhfA')  =  '*'(UHF)  +  ^(JT correction) ,  (9) 

where  the  X's  are  the  same  as  those  in  Eq.  (8).  The  UHF-based  wavefunctions  are 
acceptable  wavefunctions  for  doublet  radicals,  triplet  diradicals,  and  high-spin 
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polyradicals  at  or  close  to  the  equilibrium  geometries.  For  singlet  diradicals  and 
low-spin  polyradicals,  however,  they  are  often  heavily  contaminated  by  higher  spin 
states.  The  MP,  cc,  and  QCI  theories  do  not  remove  these  contaminations  involved 
in  the  UHF  reference  function  in  Eq.  (9). 

Schlegel  [12]  has  proposed  a  spin  anihilation  procedure  for  the  UMPn  (n  =  2- 
4)  wavefunctions  based  on  Ldwdin’s  projection  operator  O  [27]: 

0=  n  {S'--JiJ+l)}/{S{S+l)-J{J+\)},  (10) 

1 

where  S  is  given  by  the  difference  between  up  ( A^„)  and  down  (Ni,)  spins,  i.e.,  S  = 
j(A'u  —  Nh)  and  where  the  cases  of  /  =  1  and  1  =  2  correspond,  respectively,  to 
Schlegel  [12]  and  Handy  [13]  projections.  Generally  speaking,  the  spin  projection 
is  necessary  for  all  the  split  pairs  (Tj<  1.0)  in  Eq.  (2).  The  incomplete  projection 
often  suffers  from  the  so-called  size  inconsistent  problem:  The  size  inconsistency 
means  that  the  total  energy  of  a  cluster  consisted  of  n  molecules  is  not  equal  to  n 
times  the  energy  of  one  molecule  even  at  the  dissociation  limit. 

Approximate  but  Size-Consistent  Spin  Projection  Procedure 

A  size-consistent  but  approximate  spin  projection  procedure  was  also  proposed 
[  1 0, 1 1  ].  Let  us  first  consider  the  diradical  case  given  by  Eq.  ( 6 ) .  If  the  contamination 
of  the  singlet  wavefunction  is  assumed  to  be  arisen  only  from  the  triplet  state  in 
the  case  of  the  singlet  diradical,  then  the  singlet  UHF-based  wavefunctions  can  be 
written  as 

'^(uhfX)  =  C,  '4'(puhfA')+  C,  ^4>(puhf  A') .  (11) 

where  ‘'4>(PUHF  X)  denotes  the  UHF X  wavefunction  for  the  pure  q  spin  state  and 
C\  and  C,  are  the  coefficients  of  the  pure  singlet  and  triplet  components,  respectively. 
Because  of  the  second  term  in  Eq.  (11),  the  'Sk(UHF  X)  wavefunction  will  have 
an  expectation  value'  (S*)  of  the  total  spin  operator  different  from  zero.  There*  jre, 
the  coefficient  of  the  triplet  component  can  be  calculated  by 

cr  =  '<s->(uHFAr)/^<S‘>(uHF;ir).  (i2a) 

The  total  energy  of  the  pure  singlet  state  is  approximately  given  by 

'£(APUHFAf)  =  '£(uhfA')  +  /X  ['£(1  HFAf)  -  ^£(UHFAr)] ,  (13a) 

where  APUHF  X  denotes  the  approximate  puhf  X  and  "  £(Z )  denotes  the  total 
energy  for  the  H’  spin  state  by  the  method  Z.  The  correction  factor  /  is  given  by 

/=  cr/(i  -cr)  =  '<s2>(uhfZ)/[XS‘>(uhfZ)-  '(s'XuhfZ)]. 

(14a) 

Equation  ( 13a)  is  applicable  to  polyradicals  [1 1,28]  where  the /factor  is  approxi¬ 
mately  given  by 

/=  ['^(S^XuhfA-)  -5(5+  1)]/[“s<S=>(uhfA')-  '“XS^XuhfZ)]  , 

(14b) 
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where  LS  and  HS  mean  the  lowest  spin  and  highest  spin  states,  respectively.  Since 
the  computation  of  ”  (S^)(uhf  .^f)-value  is  not  easy,  it  is  estimated  by  the  ex¬ 
trapolation  from  the  UMP2  value 

‘‘'(S^>(uhf,Y)  =  ”'<S^>(ump2)  +  g  X  [”'(S*>(ump2)  -  ”'(S‘>(UHF)]  , 

(15) 

g=  [”'£(UHF2f)  -  ‘'£(UMP2)]/t’*£(UMP2)-  ”£:(UHF)].  (16) 

The  approximate  spin  projection  procedure  is  applicable  to  all  UHF-based  post 
Hartree-Fock  wavefunctions  in  the  strong  correlation  regime. 

Mullireference  (MR)  Post  Hartree-Fock  Methods 

The  multireference  (MR)  methods  are  inevitable  for  very  accurate  descriptions 
of  unstable  molecules  in  the  intermediate  correlation  regime  and  excited  states  in 
general  [1].  In  principle,  the  MR  zero-order  wavefunction  can  be  constructed  by 
the  UHF  Cl  within  active  UHF  MOs,  which  are  responsible  for  diradical  and  poly¬ 
radicals  [15].  For  example,  two  occupied  and  two  unoccupied  MOs  are  selected  as 
active  UHF  orbitals  for  diradicals;  the  number  of  active  MOs  are  twice  as  compared 
with  the  spin-restricted  approach.  The  MR  UHFCI  and  related  methods  are  useful 
for  heavy  atom  systems,  where  spin-dependent  interactions  such  as  the  spin-orbit 
interaction  are  particularly  important. 

In  order  to  reduce  the  number  of  active  orbitals,  we  have  proposed  the  use  of 
symmetry-adapted  uhf  no  for  MR  post  hf  calculations  in  the  case  of  spin-free 
molecular  systems  [1,14,15],  where  the  UNOCASCI  wavefunction  is  taken  as  the 
zero-order  wavefunction: 

^'(MR  A")  =  ^(UNO-CASCI)  +  correction) ,  (17) 

where  X  is  given  in  Eq.  (8).  If  the  coupled-cluster  (CC)  single  excitation  operator 
is  employed  for  X,  the  resulting  solution  is  equivalent  to  the  UNO  casscf  wave- 
function  as  discussed  previously  [15].  The  UNO  CASCi  reference  is  also  used  for  the 
UNO  many-body  perturbation  (Pt«)  and  UNO  coupled-cluster  single  and  double 
(CCSD)  method  [15].  The  second-order  uno-pt  method  can  be  applicable  to  rel¬ 
atively  larger  systems  for  which  uno-casscf  and  MR  so  Ci  calculations  are  not 
practical.  Recently  several  partitioning  schemes  of  effective  Hamiltonians  are  pro¬ 
posed  for  efficient  ft  computations  [29,30].  Figure  1  illustrates  computational 
schemes  for  the  ground  and  excited  states  of  unstable  molecules. 


Singlet-Triplet  Energy  Separation  of  Methylene 

Methylene  is  usually  investigated  as  a  calibration  of  several  levels  of  computations 
[31].  The  RHF  solution  of  methylene  is  triplet-unstable  [5,9],  being  reorganized 
into  the  singlet  UHF  solution,  where  the  split  HOMOs  for  the  up  and  down  spins  are 
given  by 
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X  -  MP,  CC(S)D.QCISD(T) 

Figure  1.  Schematic  illustrations  of  theoretical  approaches  to  unstable  molecules  such 
as  singlet  diradicals  and  polyradicals.  The  constructions  or  appropriate  uhf  solutions  are 
particularly  important  in  this  approach.  To  this  end,  uhf  mos  for  fragments  are  useful  as 
starting  orbitals  for  SCF  cycles.  The  natural  orbital  analysis  of  the  uhf  solutions  is  also 
important  to  perform  the  spin-restricted  post  HF  calculations. 


Xho  =  COS  w  cr  +  sin  ft)  TT ,  (2c) 

Vho  ~  cos  ft)  (T  —  sin  ft)  tr .  (2d) 

Figure  2(A)  illustrates  the  shapes  of  molecular  orbitals  in  the  'rhf,  'uhf,  and 
^UHF  solutions.  The  orbital  overlap  Fho  between  the  split  MOs  in  the  'uhf solution 
is  0.537  at  the  singlet  optimized  geometry  (see  below),  and  therefore  the  diradical 
character  y  is  only  17%.  The  puhf  solution  for  the  singlet  state  is  expressed  by  the 
2X2  UNO  Cl  form  as 

^(PUHF)  =  I { Xho^  +  nnoXHo} I  =COS^ft)|(7o|  -  sin^  ft)|7rir|  .  (6c) 

Therefore,  apuhf  post  HF  wavefunctions  such  as  apuhf  ccsd  are  approximately 
equal  to  those  of  two-reference  post  hf  methods  in  Eq.  ( 17). 

The  post  HF  calculations  were  carried  out  for  the  calculated  geometries  [31]: 
(A)  /?(C — H)  =  1.0832  A  and  <HCH  =  132.3°  for  the  triplet  methylene  and  (B) 
R{C — H)  =  1.1 1 18  A  and  <HCH  =  102.4°  for  the  singlet  methylene.  Table  I 
summarizes  the  total  energies  by  the  rhf-  and  UHF-based  6-3 IG*  calculations  and 
the  adiabatic  energy  differences  between  the  singlet  (S)  and  triplet  (T)  states. 

From  Table  I,  the  following  should  be  noted:  ( 1 )  The  singlet  uhf  6-31 IG** 
solution  is  more  stable  by  1 1.6  kcal/mol  than  the  rhf  solution  in  accord  with  the 
triplet  instability  of  the  rhf  solution.  (2)  The  spin  projected  uhf  (apuhf)  solution 
is  less  stable  by  3.7  kcal/mol  than  the  uhf  solution  because  of  the  removement  of 
the  ground-triplet  contaminant.  (3)  The  <S^)-values  for  singlet  UHF-based  methods 
are  about  0.7,  indicating  the  necessity  of  the  spin  projection,  whereas  they  are  about 
2.0  for  the  triplet  UHF-based  methods.  (4)  The  energy  correction  for  uhfqcisd(t) 
by  the  approximate  spin  projection  is  0.9  kcal/mol.  The  energy  difference  between 
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(A) 


singlet(S) 


tripletcn 


(’RHF) 


(’UHF)  (^UHF) 


Figure  2.  ( A )  Energy  levels  are  diradical  orbital  configurations  for  methylene.  ( B)  energy 
levels  of  bis(  methylene)  methane  constructed  of  triplet  methylenes  via  the  through-bond 
coupling  and  the  optimized  geometry  by  the  ump2  6-31G*  method.  (C)  Exchange-coupling 
between  chromium  cation  and  triplet  methylene  and  the  partial  optimized  geometry  by 

the  UMp4  method. 


the  RHFQCISD(T)  and  apuhfqcisd(t)  is  only  0.5  kcal/mol.  This  means  that  rhf 
qcisd(t)  works  well  in  the  case  of  weak  singlet  diradicals.  (5)  The  S-T  separation 
is  calculated  to  be  29  kcal/mol  by  the  energy  difference  between  singlet  rhf  and 
triplet  UHF  energies,  whereas  it  is  21  kcal/mol  by  the  energy  difference  between 
'apuhf  and  ^uhf.  (6)  The  S-T  gap  is  12.6  kcal/mol  by  the  difference  between 
'rhf-  and  ^UHF  QCISD(t)  total  energies,  whereas  it  is  12.7  kcal/mol  by  the  total 
energy  difference  between  'apuhf  and  ^uhfqcisd(t). 

The  S-T  separation  at  the  QCISD(T )  level  is  the  same  as  that  of  extensive  casscf 
MBPT  calculations  [30].  However,  it  is  still  larger  by  0.7  kcal/mol  than  the  full  ci 
value  [31]. 
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Table  I.  Energy  gaps  bei  .een  ns  and  ls  states  calculated  for  methylene* 
by  several  computational  methods.'’ 


Method 

A/:' 

(kcal/mol) 

RHF 

-38.869754 

* 

(29.2) 

UHE 

-38.905961 

-38.933894 

17.7 

APUHF 

-38.882616 

-38.933894 

21.4 

PUHF(/=  1)“ 

-38.8921 19 

-38.936661 

19.4 

PUHF  (/  =  2) 

-38.887879 

-38.936655 

20.5 

rmp2 

-38.985493 

« 

(18.0) 

ump2 

-39.009446 

“39.033121 

14.0 

APUMP2 

-38.987899 

-39.033121 

16.0 

PU,MP2  (/  =  1 )“ 

-38.995325 

-39.0.34977 

15.1 

PUMP2  (/  =  2) 

-38.990961 

-39.034971 

16.3 

RMP4 

-39.010397 

• 

(13.8) 

UMP4 

-39.031977 

-39.053149 

12.3 

APUMP4 

-39.012708 

-39.053149 

13..5 

PUMP4  (/  =  !)<’ 

-39.017608 

-  39.054290 

13.0 

PUMP4  (/  =  2) 

-39.013148 

-39.054284 

14.2 

RHF  CCD 

-39.011107 

* 

(13.4) 

UHF CCD 

-39.035334 

-39.051619 

12.0 

APUCC 

-39.020513 

-39.051619 

1.3.0 

RHFCCDST4 

-39.015589 

* 

(12.5) 

UHFCCDST4 

-39.041659 

-39.054460 

11.6 

APUHF  CCDST4 

-39.030009 

-39.054460 

12.4 

RHFQCISD 

-39.011953 

• 

(13.4) 

UHF  C?CISD 

-39.037480 

-39.052614 

12.0 

APUHF  QCISD 

-39.023706 

-39.052614 

13.0 

RHFQCISD(T) 

-39015763 

* 

(12.6) 

UHFQC1SD(T) 

-39.045098 

-39.054735 

1 1.8 

APUHF  QCISD(T) 

-39,036327 

-39.054735 

12.7 

•  /?(CH)  =  1.0832  A;  ^HCH  =  132.3°. 
*'6-31 1 G**  basis  set. 

'  Af  =  E'(ls)  -  E(hs). 

■’  Level  of  spin  projection. 


Through-Bond  Interaction  between  Triplet  Methylenes 

As  shown  previously  [32],  the  merit  of  the  apuhf  qcisd(t)  is  applicable  to 
relatively  larger  clusters  of  open-shell  species.  As  a  simple  example  of  tetraradicals, 
here  we  examine  bis  (methylene)  methane  2  (HC — CH2 — CH)  which  is  formally 
formed  by  the  through-bond  coupling  of  two  triplet  methylenes  via  the  methylene 
bridge.  There  are  two  singlet,  three  triplet,  and  one  quintet  states  for  2  as  illustrated 
in  Figure  2(B).  The  lower-lying  singlet  (S|),  triplet  (T,),  and  quintet  (Q)  states 
for  2  arise  from  the  exchange  coupling  between  the  ground  triplet  methylene  units 
as  illustrated  in  Figure  2(  B).  The  UHF  wavefunctions  for  the  Q  state  are  approxi¬ 
mately  given  by  the  symmetry  adapted  uhf  nos  as 
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^(Q)  =  ,  (18) 

where  uhf  no  are  given  by  the  bonding  and  iding  c  and  tt  MOs  constructed 
of  the  linear  combination  of  the  a  and  jr  radical  orbitals  of  methylenes  a  and  b: 

'P±i  =  (<r.v±  ff)')/V2,  v>±2  =  (’rA-±  7r)  )/V2  ,  (19) 

where  X ( 7)  denies  the  site  a  or  b.  Therefore  the  corresponding  MOs  for  the  singlet 
UHF  solution  |Xfl|oX/,?/,|  are  approximately  given  by 

Xx  =  cos  a)|  v?)  +  sin  W|  ^-1  =  cos(wi  —  ir/4)  +  cos(w|  +  7r/4)  ffy  ,  (20a) 

^x  =  cos  toi  (p2  +  sin  «2  <p-2  =  cos(a)2  ~  2r/4)  :rx  +  COs(aj2  +  2r/4)  Try  .  (20b) 

The  natural  orbital  (NO)  analysis  of  the  single:  UHF  6-3 IG*  solution  for  2  shows 
that  the  orbital  overlaps  for  <r-  and  rr-type  corresponding  MOs  are  0.0019  and  0.0331, 
respectively:  namely,  wi  =  0)2  =  7r/4,  and  therefore  Xa  s  (7a.  fa  =  X*  =  cr*  and 
=  TT/,.  The  corresponding  MOs  are  essentially  equivalent  to  the  a  and  ir  radical 
MOs  of  methylenes  a  and  b,  respectively. 

Full  geometry  optimizations  of  the  quintet  state  of  2  have  been  carried  out  by 
using  the  energy  gradient  of  the  UMP2  6-3 IG*  wavefunction.  Figure  2(B)  shows 
the  fully  optimized  geometry  of  2.  The  total  energies  of  the  lowest  spin  (LS  =  S|) 
and  highest  spin  (HS  =  Q)  states  of  2  at  the  optimized  geometry  were  calculated 
by  several  post  HF  methods,  and  the  refinements  of  the  total  energies  for  the  LS 
state  were  conducted  by  the  spin  projection  procedure  in  the  second  section.  The 
energy  gaps  between  the  LS  and  HS  states  were  determined  from  these  total  energies, 
being  equivalent  to  6Jah  values  in  the  Heisenberg  Hamiltonian  as  illustrated  in 
Figure  2(B): 

Hhb  =  “  Z  ^JahSa’Sl,  ,  (21a) 

Jah  =  [‘'"fuHFA  -  "®£;uHFa]/[”'<S'>(UHF^)  -  ‘-^<S'>(UHF  )]  ,  (21b) 

where  S(  denotes  the  spin  at  the  site  c{=aoT  b). 

Table  II  summarizes  the  calculated  results.  From  this  table,  the  following  con¬ 
clusions  can  be  drawn:  ( 1 )  The  LS-HS  energy  differences  are  positive  by  all  com¬ 
putational  methods.  Then  bis(methylene)methane  with  the  C2i.  symmetry  is  the 
HS  ground-state  molecule.  (2)  The  energy  differences  by  the  spin-unprojected  UHF 
CCD(UCCD),  QCISD,  qcisd(t),  and  ump4  methods  are  quite  similar.  (3)  The  <S^> 
values  for  the  singlet  post  uhf  wavefunctions  are  about  2.0,  showing  the  necessity 
of  the  spin  projection  to  obtain  the  pure  singlet  state  wavefunctions.  ( 4 )  The  energy 
differences  by  UMP«,  QCISD,  and  QCISD(T)  increase  by  40-50%  after  the  approximate 
spin  projection  of  the  LS  (singlet)  solution.  (5)  The  energy  difference  by  the  UHF- 
based  post  hf  methods  are  close  to  those  of  the  spin-symmetry  adapted  post  HF 
calculations:  namely,  uno-casscf  by  the  use  of  the  four-active  orbitals  and  four- 
electrons  { 4,4 }  and  MRSDCI  based  on  uno-CASSCF  { 4,4 } .  (6)  The  energy  difference 
after  Schlegel’s  spin  projection,  pumpk  (/  =  i ),  becomes  abnormally  large,  indicating 
that  the  single  anihilation  procedure  breaks  down  in  this  system,  and  Handy's 
double  projection  scheme  [13],  PUMPn  (/  =  2),  also  leads  to  the  overestimation  of 
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Table  II.  Energy  gaps  between  hs  and  ls  states  calculated  for  CHCH2CH" 
by  several  computational  methods.'’ 


Method 

A£' 
(cm  ') 

RHF 

-115.606953 

* 

« 

UHF 

-115.726823 

-115.731980 

1132 

APUHF 

-115.724216 

-115.731980 

1704 

PUHF  (/  =  O'* 

-115.076390 

-115.736351 

144810 

PUHF  (/  =  2) 

-115.714942 

-115.736316 

4690 

PUHF  (/  =  3) 

-115.730817 

-:  14.736316 

1207 

CASSCF  {4,4} 

-115.632879 

-115.637639 

1045 

rmp2 

-115.928650 

* 

• 

UMP2 

-116.011102 

-116.016101 

1097 

APUMP2 

-116.008583 

-116.016101 

1650 

pump2(/=  I)" 

-1 15.360641 

-116  019048 

144469 

PUMP2  (/  =  2) 

-115.999185 

-116.019017 

4352 

pump2  (/  =  3) 

-116.015094 

-116.019017 

861 

MRSDCI 

-1 15.938403 

-115  994575 

1355 

rmp4 

-115.984004 

• 

« 

UMP4 

-116.057428 

-116.061969 

996 

APUMP4 

-116.055141 

-116.061969 

1499 

PUMP4(/=  I)*" 

-115.406948 

-116.063810 

144130 

PtiMP4  (/  =  2) 

-116.045485 

-116  063782 

4015 

PUMP4  (/  =  3) 

-116.061418 

-116  063782 

519 

MRSDCI* 

-115.969039 

-115.975804 

1485 

RHF CCD 

-115.978510 

• 

* 

UHF CCD 

-116.049360 

-116.053758 

965 

APUCCD 

-116.047115 

-116.053758 

1452 

RHF  CCDST4 

-115.994211 

• 

« 

UHFCCDST4 

-116.060536 

-116  064864 

950 

APUHF  CCDST4 

-116.058356 

-116.064864 

1429 

RHF  QCISD 

-115.982453 

« 

* 

UHFQCISD 

-116.053440 

- 1 16.057786 

954 

APUHF  QCISD 

-116.051251 

-116.057786 

1435 

RHFQCISD(T) 

-115.994910 

• 

* 

UHFQCISD(T) 

-116.061562 

-116.065875 

946 

APUHF  QCISD(T) 

-116.059390 

-116.065875 

1424 

•  At  the  optimized  geometry  for  quintet  state. 
’’6-310*  basis  set 
'  ^E  =  £(i.s)  -  E(hs). 

*’ '  of  spin  projection. 

'  .  Davidson  correction. 


the  energy  gaps,  as  compared  with  those  of  QCISD  and  MR  SD  Cl,  because  of  the 
multiple  bond  splittings.  The  triple  projection,  PUMPn  (/  =  3),  rather  underestimates 
the  energy  gaps  because  of  the  overstabilization  of  the  ls  state. 

From  these  conclusions,  the  ls-hs  energy  gaps  by  the  UHF-based  post  hf  methods 
(for  example,  uhfqcisd)  followed  by  the  approximate  spin  projection  are  almost 
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equivalent  to  those  of  the  spin-restricted  post  HF  methods  such  as  MR  SD  Cl// 
CASSCF.  Judging  from  the  numerical  data  in  Table  II,  the  apump2  method  is  prac¬ 
tically  useful  for  semiquantitative  calculations  of  the  LS-HS  gaps  [/a/,  in  Eq.  (21b)] 
in  larger  systems. 


Chromium-Methylene  Complex 

Carter  and  Goddard  [33]  have  shown  that  the  high-spin  (^Bi )  state  of  the  chro¬ 
mium-methylene  ion  3  is  calculated  to  be  more  stable  than  the  low-spin  CBi  )  state 
when  the  open-shell  rhf  method  is  employed.  Their  perfect-pairing  (pp)  generalized 
valence  bond  (GVB)  calculations  have  also  failed  to  reproduce  the  greater  stability 
of  the  ''5i  state.  Thus,  the  nature  of  the  naked  chromium-methylene  bond  appears 
to  be  much  different  from  that  of  the  corresponding  carbene  complexe  (CO)5Cr  = 
CXX'  satisfying  the  18-electron  rule,  for  which  the  closed-shell  rhf  description  is 
reasonable.  In  other  words,  the  <r-  and  7r-bonds  between  Cr"^  and  ^CH2(^5i )  in  the 
‘*B,  state  are  not  so  strong  as  described  by  the  closed-shell  bonding  pairs  [33], 
showing  the  triplet-type  instability  [1]. 

The  highest-spin  ( HS)  UHF  solution  for  the  ^Bi  state  is  given  by  the  bonding  and 
antibonding  a  and  tt  natural  orbitals  (NOs)  as 

'i'CB,)^  ,  (22) 

where  K  denotes  the  high-spin  d-electron  part  {dzydx^  -  dxy).  The  spin-polarized 
corresponding  <7-  and  ir-orbitals  are  given  by  the  orbital  mixings  as 

X<,  =  cos  o)  ¥*<,  +  sin  w  <p^,  ri„  =  cos  o}tp„-  sin  w  ,  (6e) 

X,  =  cos  wp,  +  sin  (u  =  cos  w  -  sin  w  .  (6f) 

The  intermediate  {^B\)  and  lowest-spin  (^Bi )  UHF  solutions  are  therefore  given  by 

'ieB,)=  \Kx,ru<Pr^A.  ^{*B,)=  \Kx,rrcX,T.\  ■  (23) 

These  uhf  solutions  were  obtained  by  the  use  of  the  basis  sets:  Tatewaki-Huzinaga 
MlDl-1  supplemented  by  the  4p-AO  with  the  same  exponent  as  that  for  the  4s-ao 
and  by  the  Hay’s  diffuse  d-orbital  (a  =  0.0912)  for  Cr  and  MlDl-1  plus  diffuse 
orbital  for  C  and  4-3 IG  for  H.  The  orbital  overlap  T„  for  the  (T-corresponding  MOs 
in  the  *B\  state  is  0.952,  whereas  it  is  0.302  for  the  ir-corresponding  MOs;  note  that 
T,  is  0.342  by  gvb  pp.  The  tr-bond  is  regarded  as  a  stable  covalent  bond.  On  the 
other  hand,  the  rr-bond  is  a  diradical  pair  with  the  moderate  diradical  character 
(45%).  The‘’5i  state  is  calculated  to  be  more  stable  by  15.1  kcal/mol  than  the*fii 
state  even  at  the  uhf  level,  in  contrast  to  the  gvb  pp  result  [34],  The  *B\~^B\ 
energy  gap  becomes  19.0  kcal/mol  after  the  approximate  spin  projection  (  apuhf) 
by  the  use  of  the  **5]  uhf  energy.  The  latter  value  is  in  accord  witii  the  mcscf 
value  (17.7  kcal/mol)  [35]. 

Figure  2(C)  illustrates  the  energy  optimized  geometry  of  (CrCH2)^  by  ump4. 
The  binding  energy  between  Cr^  and  ^CH2  has  been  calculated  by  the  UHF-based 
post  HF  methods  by  adding  the  /  (a  =  0.9).  d  (a  =  0.6),  and  p  (a  =  0.164) 
polarization  functions  for  Cr,  C,  and  H,  respectively.  Table  III  summarizes  the 
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Table  III.  Total  energies  of  chromium-methylene  caption  and  methylene 
and  binding  energy  between  chromium  cation  and  triplet  methylene  by  post- 
Hartree-Fock  methods. 


Method 

(ClCHiF 

CHz 

BE* 

(kcal/mol) 

UHF 

-1081.590694 

-38.732592 

10.8 

APUHF 

-1081.596224 

-38.732592 

14.3 

PUHF(/=  I) 

-1081.588557 

-38.735148 

7.89 

PUHF  (/  -  2) 

-1081.588764 

-38.735143 

8.02 

PUHF  (/  =  3) 

-1081.588754 

-38.735143 

8.02 

PUHF  (/  =  4) 

-1081.588754 

-38.735143 

8.02 

UNOCAS  {7,7} 

-1081.614535 

-38.735148 

25.8 

ump2 

-1081.783325 

-38.822338 

36.6 

APUMP2 

-1081.791261 

'38.822338 

44.6 

PUMP2(/=  1) 

-1081.781178 

-38.823986 

34.2 

PUMP2  (/  =  2) 

-1081.781385 

-38.823981 

34.4 

PUMP2  (/  =  3) 

-1081.781375 

-38.823981 

34.3 

pump2  (/  =  4) 

-1081.781375 

-38.823981 

34.3 

ump4 

-1081.811054 

-38.838798 

39.5 

APUMP4 

-1081.819342 

-38  838798 

44.7 

PUMP4  (/  =  I) 

-1081.808911 

-38.839782 

37.6 

PUMP4  (/  =  2) 

-1081.809118 

-38.839777 

37.7 

PUMP4  (/  =  3) 

-1081.809107 

-18  839777 

37.7 

PUMP4  (/  =  4) 

-1081 809107 

-38.839777 

37.7 

UHF  CCSD 

-1081.814276 

-38.838552 

42.3 

APUHF  CCSD 

-1081.822728 

-38.838552 

47.6 

UHFQCISD 

-1081.817769 

-38.838585 

44.3 

APUHF  QCISD 

-1081.826810 

-38.838585 

50.0 

CCSD(T) 

-1081.823399 

-38.839856 

46.5 

APUHF  CCSD(T) 

-1081.832696 

-38.839856 

52.4 

uhfqcisd(T) 

-1081.825574 

-38.839871 

47.8 

APUHF  QC1SD(T) 

-1081.835054 

-38.839871 

53.8 

•  Binding  energy. 


total  energies  of  the  chromium-methylene  cation  and  triplet  methylene  and  the 
calculated  binding  energies. 

From  Table  III,  the  following  conclusions  may  be  drawn:  ( 1 )  The  binding  energies 
(bes)  are  1 1  and  14  kcal/mol  by  uhf  and  apuhf,  respectively,  whereas  it  is  26 
kcal/mol  by  uno-CASSCF  { 7,7 } .  (2)  The  (S^)-values  for  the  quartet  UHF-based 
wavefunctions  are  about  4.7,  being  larger  than  the  exact  value  (3.75).  (3)  The  be- 
values  are  37  and  40  kcal/mol  by  ump2  and  ump4,  respectively.  The  binding 
energy  is  45  kcal/mol  by  apump4,  indicating  that  the  spin  projection  effect  is 
significant,  as  expected  from  the  ( )-value.  ( 4 )  The  BE-values  are  47  and  48  kcal / 
mol  by  UHFCCSD(T)  and  qcisd(t),  respectively.  After  the  approximate  spin  pro¬ 
jection,  they  are  52  and  54  kcal/mol,  respectively. 

The  binding  energy  by  apump4  is  similar  to  the  gvb  ci  value  (44  kcal/mol) 
[33].  The  BE'value  by  apuhf  qcisd(t)  is  close  to  the  experimental  value  (53.6 
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kcal/mol )  by  Armcntrout  ct  al,  [36],  The  present  result  indicates  the  importance 
of  both  dynamical  correlation  and  spin-projection  corrections  for  unstable 
dTT-pTT  bonds. 


Concluding  Remarks 

The  spin  projection  is  necessary  for  uiiF-bascd  post  Hartrcc-Fock  wavcfunclions 
[  37  ]  if  systems  under  consideration  have  unstable  chemical  bonds  with  significant 
diradical  and  polyradical  characters.  The  approximately  spin-projected  UHF 
( Ai»iJi  It )  QCISI)( T)  and  CCSD(T)  methods  arc  applicable  to  qualitative  computations 
of  the  i  s-tis  energy  gaps  in  exchange-coupled  open-shell  systems  and  naked  tran¬ 
sition-metal  carbon  bonds.  The  apuhf  mp2  (apump2)  method  is  useful  for  qual¬ 
itative  purpo.se  [II].  The  uno-ca.sscf  method  is  applicable  to  rather  complex  sys¬ 
tems  such  as  organo  transition-metal  systems. 

The  uiiF-ba.sed  post  Jif  methods  arc  particularly  useful  for  theoretical  investi¬ 
gations  of  exchange-coupled  organic  systems  such  as  clusters  of  triplet  carbenes 
[38],  high-spin  oligomers,  clusters  of  stable  nitroxidcs,  etc.,  for  which  the  spin 
polarization  clfects  play  important  roles  in  spin  alignments  (39-42).  These  systems 
are  received  irrcnt  interest  from  the  view  point  of  molecular  magnetism,  for  ex¬ 
ample,  organic  ferromagnet  (43.44).  The  present  computational  .schemes  arc  used 
for  molecular  design  of  organic  magnetic  materials  (45). 
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Abstract 

We  investigate  the  intramolecular  dynamics  of  the  ground  state  and  the  first  three  singlet  '2  excited 
states  of  boron  hydride  (BH)  using  the  split-operator  method.  Ab  initio  calculations  show  that  these 
states  have  strong  nonadiabatic  couplings,  resulting  in  a  complex  topology  of  avoided  crossing  regions 
and  double  well  potentials.  We  attempt  to  find  a  method  to  populate  the  second  minimum  of  B  ''S* 
enabling  experimental  observation  of  its  vibrational  states.  ©  1993  John  Wiley  &  Sons,  Inc. 

Introduction 

The  first  four  ‘S*  states  of  BH  have  been  studied  spectroscopically  by  various 
authors  [1],  For  each  of  the  states,  at  least  one  vibrational  level  has  been  observed. 
There  are  several  ab  initio  studies  of  BH  [  2  ] .  These  ab  initio  calculations  show  that 
the  '2  *  potential  curve  has  a  double  minimum.  The  vibrational  levels  which  are 
localized  in  the  outer  minimum,  however,  have  not  yet  been  observed  experimen¬ 
tally.  In  this  contribution  we  are  looking  for  a  method  to  populate  vibrational  levels 
in  the  second  minimum  of  the  B  state.  The  frequencies  associated  with  tran¬ 
sitions  from  these  states  are  sufficiently  different  from  those  originating  from  the 
inner  minimum  that  they  can  be  easily  discriminated. 

The  next  section  presents  the  fft  split-operator  method  and  the  Hamiltonian 
used  in  our  calculations.  In  the  third  section,  we  give  short  description  of  the  elec¬ 
tronic  states  involved  and  the  method  used  to  calculate  them.  We  will  discuss  our 
results  in  the  fourth  section.  It  appears  that  a  direct  excitation  process  cannot  create 
the  required  population  of  the  outer  minimum.  The  procedure  we  investigate  con¬ 
sists  of  three  steps.  In  the  first  one  we  create  an  electronic  excitation.  The  fifth 
section  contains  the  conclusions  to  be  drawn  from  this  contribution. 

Time  Propagation 

We  consider  a  one-dimensional  model  for  BH,  neglecting  effects  of  rotation. 
Four  electronic  states  of  the  diatom  are  included:  the  X  '2'*^  electronic  ground  state 
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and  three  (closely)  coupled  excited  electronic  states,  5  ‘2^,  C  ‘2^,  and  £  '2'^  in 
order  of  increasing  dissociation  energy.  We  study  the  interaction  of  the  vibronic 
degrees  of  freedom  BH  with  an  ultraviolet  laser  and  a  strong  infrared  laser  (applied 
both  separately  and  simultaneously).  In  what  follows,  we  will  use  atomic  units 
unless  stated  explicitly. 

The  time  evolution  associated  with  the  nuclear  motion  is  obtained  as  a  solution 
of  the  time-dependent  Schrodinger  equation  (tdse): 

-  -  - 

=  +  +  (1) 

where  'i'  is  a  four-component  wave  function,  one  for  each  of  the  four  electronic 
states  mentioned  above, 

4 

<«i 

Each  'I',  represents  the  nuclear  motion  on  the  associated  electronic  potential.  Pro¬ 
jecting  on  the  electronic  states  ij,  leads  to  the  set  of  four  coupled  equations 

‘  ^  =  2  (Hij  +  fiijEaiO  cos{ut))\l/j ,  (3) 

where  X  represents  all  the  electronic  coordinates  and  R  is  the  intemuclear  distance. 
The  different  matrix  elements  are  given  by 

H.j{^R,-^'j-U,iR)  +  2UR)-^  +  g,j{R),  (5) 

H,  =  Hj,  (6) 

The  various  potentials  and  coupling  are  given  by  the  formulae 

g,(R)  =  {l/2m)(v,{R)\d^/dR^\vj{R))x  ,  (7) 

f,j{R)  =  {l/2m)(r,,{R)\d/dR\vj{R))x,  (8) 

I/y(i?)  =  <*;,(£)|^dln//?)>x,  (9) 

with  m  the  reduced  mass  of  the  molecule  and  ii,j  the  matrix  elements  of  the  dipole 
operator.  (  )x  means  integration  over  X. 

The  equation  will  be  solved  numerically  on  a  one-dimensional  grid  with  the 
initial  condition  that  one  component  i/j  is  an  adiabatic  vibrational  state  x)  and  the 
other  components  are  zero.  The  vibrational  eigenstates  are  obtained  using  a  shooting 
method  [3]  based  on  the  Numerov  algorithm. 

The  propagation  is  done  using  the  fft  split-operator  method  (for  comparison 
with  other  methods,  see  Refs.  [4]).  The  key  concept  in  the  split-operator  technique 
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is  to  approximate  the  evolution  operator  over  a  small  time  interval  as  a  product  of 
exponentials 

U{t,  /  4-  c)  =  T  exp^-/ 

as  cxp^-(^  rjexp^-t  F(.y)J  exp|-t^  rj  ,  (II) 

which  is  correct  up  to  second  order  in  e  [5].  This  way,  one  can  use  a  different 
electronic  basis  set  for  the  kinetic  energy  operator  and  the  potential  energy  operator, 
each  one  optimal  for  the  exponentiation  of  the  respective  operator.  While  for  the 
ease  of  two  electronic  states  special  formulae  exist  [6,7],  the  exponentiation  of  the 
matrix  operators  in  general  is  most  effectively  done  by  straightforward  diagonal- 
ization  [8,9], 

The  Born-Oppenheimer  or  adiabatic  basis  is  the  most  natural  to  use.  However, 
the  nonvanishing  derivative  coupling  terms ftj  inhibit  exponentiation  since  there  is 
no  transformation  on  nuclear  coordinates  which  can  diagonalize  a  matrix  containing 
these  terms.  It  is  also  known  tnat  the  functions  gn  and  ftj  are  sharply  peaked  when 
evaluated  in  the  Born-Oppenheimer  basis.  In  order  to  correctly  represent  them, 
one  cither  has  to  use  very  dense  or  nonequidistant  grids.  Neither  of  both  alternatives 
is  computationally  efficient  [10],  To  solve  these  problems,  one  can  introduce  an 
alternative  set  of  electronic  functions,  a  so-called  diabatic  basis.  In  this  basis*  the 
multiplicative  coupling  functions  vary  smoothly  with  R  and  the  derivative  couplings 
arc  negligible.  Without  the  derivative  couplings,  the  kinetic  energy  operator  can  be 
exponentiated  by  Fourier  transformation  to  the  momentum  representation  where 
it  becomes  a  multiplicative  operator.  In  our  calculations  we  use  the  quasidiabatic 
basis  as  defined  by  Cimiraglia  et  al.  [1 1,12]. 

The  BH  Potentials 

The  calculations  in  this  paper  are  performed  using  ab  initio  potentials.  Other 
authors  have  used  ab  initio  potentials,  c.g.,  to  study  the  dissociation  dynamics  of 
Cl  NO  [13],  FNO  [14,15],  and  H2O  [16].  The  potentials,  coupling  functions  and 
dipole  moments,  were  described  in  Ref,  [12].  As  can  be  seen  in  Figure  1,  the  first 
excited  state  appears  to  have  a  double  minimum.  It  dissociates  into  B(^S,  3s)  and 
H(^S)  but  acquires  a  strong  ionic  character  in  the  outermost  well.  The  second 
excited  state  exhibits  an  avoided  crossing  and  a  double  minimum.  It  dissociates 
into  B(^D,  2s2p^)  and  H(^S),  In  the  outer  minimum  its  character  is  a  mixture  of 
B^H“  and  B(^P,  3p)  while  in  the  interior  region  it  has  the  same  character  as  in 
the  dissociation  region.  The  third  excited  state  undergoes  two  avoided  crossings, 
one  with  the  ionic  state  and  one  with  the  B(^P,  3p)  state. 

As  mentioned  in  the  previous  section,  our  calculations  are  performed  in  the 
quasidiabatic  basis.  The  first  step  in  the  construction  of  this  basis  is  the  calculation 
of  a  zeroth  order  adiabatic  basis.  The  adiabatic  basis  was  obtained  by  a  SCF-CI 
calculations  at  some  20  internuclear  distances,  starting  from  1.2  a.u.  A  gaussian 
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Figure  1 .  Adiabatic  potentials  for  the  first  four  'S'*  states  of  BH.  Values  in  atomic  units. 


atomic  basis  set  was  used,  similar  to  that  of  Jaszunski  et  al.  [17].  Some  diffuse  s 
and  p  functions  were  added  to  correctly  represent  higher  Rydberg  terms  of  the  B 
atom  and  the  H'  anion.  Electron  correlation  was  treated  by  the  CIPSI  multireference 
perturbation  algorithm  [18,19].  Spline  interpolation  was  used  to  obtain  the  cal¬ 
culated  functions  at  intermediate  values  of  R.  For  further  details  we  refer  to  Ref. 
[20],  The  zeroth-order  diabatic  basis  is  obtained  by  unitary  transformation.  The 
transformation  is  such  that  it  maximizes  the  overlap  of  the  zeroth-order  diabatic 


Figure  2.  Diabatic  potentials  for  the  first  four  '2*  states  of  BH.  Values  in  atomic  units. 
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functions  with  a  set  of  independent  diabatic  reference  functions.  Quasidegenerate 
perturbation  theory  is  then  applied  to  obtain  the  final  diabatic  basis  set.  For  a 
detailed  description  we  refer  to  Refs.  [11,12]. 

In  comparing  Figures  1  and  2,  respectively,  one  notices  the  rather  large  differences 
between  adiabatic  and  diabatic  potentials.  The  diabatic  ground  state  minimum  is 
less  deep  and  less  tight  than  the  corresponding  diabatic  one.  The  order  of  the  second 
and  the  third  state  is  interchanged  between  the  two  representations.  The  rather 
large  interaction  energies  ( Fig.  3),  in  particular  /lu,  are  consistent  with  this. 

The  dipole  moments  and  the  dipole  transition  matrix  elements  n,j  are  also  ab 
initio  results.  The  presence  of  the  ionic  contribution  in  the  electronic  wavefunction 
is  clearly  visible  in  these  quantities.  In  the  regions  where  the  electronic  states  are 
characterized  as  ionic,  the  dipole  moment  is  large  and  varies  almost  linearly  with 
distance. 


Results  and  Discussion 

In  what  follows,  we  investigate  several  possibilities  to  populate  the  second  min¬ 
imum  of  B  '2^  by  combination  of  electromagnetic  and  nonadiabatic  interactions. 
Experimentally  the  population  of  vibrational  states  in  the  outer  well  should  be 
observable.  We  assume  that  the  electric  field  is  parallel  to  the  molecular  axis  and 
that  the  electric  dipole  approximation  is  valid.  Two  types  of  fields  are  used  through¬ 
out  the  calculations:  monochromatic  fields  (instantaneously  switched  on)  and 
gaussian  pulses.  The  parameters  of  the  field  are  chosen  using  physical  arguments. 
We  did  not  use  any  technique  to  optimise  the  field  as,  e.g.,  developed  by  Tannor 
and  Rice  [21,22]  and  also  by  Rabitz  and  coworkers  [23-26]. 


Figure  3.  Off-diagonal  matrix  elements  of  the  Hamiltonian  m  the  diabatic  basis.  Values 

m  atomic  units. 
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It  appears  that  one  cannot  directly  excite  a  vibrational  state  of  the  outer  minimum. 
The  Franck-Condon  factors  are  too  small:  there  is  no  appreciable  overlap  between 
the  respective  wavefunctions  due  to  the  difference  in  position  of  the  minima 
(dinner  2.3  a.u.,  /?outcr  5.8  a.u.).  Only  the  vibrational  states  of  B  which 
are  significantly  localized  in  the  inner  well,  can  be  excited  directly  from  the  ground 
state.  To  create  population  in  both  wells,  we  will  have  to  consider  highly  excited 
vibrational  states  of  The  oscillator  strengths  /[27]  of  the  corresponding 

transitions  show,  however,  that  these  states  are  also  not  accessible  by  direct  em 
excitation  from  the  ■  ibronic  ground  state  (/ as  10''“). 

An  alternative  pa*nway  of  occupying  B  '2^  is  through  excitation  to  the  other 
electronic  states  and  subsequent  nonadiabatic  population  transfer.  From  our  cal¬ 
culations  we  see  that  excitations  to  vibrational  states  of  C'2^  are  effective  in  pop¬ 
ulating  B  '2^  appreciably,  starting  from  v  =  A.  Again  the  oscillator  strengths  [27] 
do  not  permit  direct  transitions.  They  vary  in  magnitude  from  10“’  to  10“*.  The 
only  remaining  possibility  is  an  electronic  excitation  to  E  '2^.  Oscillator  strengths 
leave  us  no  choice  but  to  make  a  transition  to  the  vibrational  ground  state  (/ « 
10“^  for  this  transition).  We  performed  a  first  calculation,  modeling  an  instanta¬ 
neous  monochromatic  transition  from  .T  '2'^  to  £  '2'^,  by  taking  tp  =  (0,  0,  0, 
xS)  as  initial  condition.  The  calculation  does  not  include  an  explicit  interaction 
with  an  external  field.  Figure  4  shows  the  time  evolution  of  the  occupation  prob¬ 
ability  of  the  electronic  states.  One  can  see  that  the  maximum  of  PiiO  is  about 
20%  of  the  initial  occupation  probability  of  £  '2"^.  It  is  also  clear  that  there  is  a 
large  recurrence  of  the  system  to  its  initial  state  after  approximately  Ips.  To  approach 
more  realistic  experimental  conditions,  which  will  use  a  pulsed  laser  to  perform 
this  excitation,  we  performed  a  second  calculation.  Here  we  used  \p  =  (X?,  0, 0, 0) 
as  an  initial  condition,  and  we  included  explicitly  the  interaction  with  the  external 
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Figure  4.  Occupation  probability  of  electronic  states  as  a  funaion  of  time.  No  interaction 

with  an  external  held. 


523 


'2*  STATES  OF  BORON  HYDRIDE 

field  (w  =  0.28  a.u.).  The  pulse  we  use  is  a  Gaussian  with  an  fwhm  of  50  fs  and  a 
peak  intensity  of  0.4  GW/cm^.  The  pulse  is  centered  at  /  =  100  fs.  Figure  5  shows 
the  occupation  probabilities  of  the  electronic  states  as  a  function  of  time.  Clearly 
the  behavior  is  similar  to  the  previous  calculation,  apart  from  a  time  delay  and  a 
scale  factor  reflecting  the  oscillator  strength  for  the  transition.  Also  PaH)  starts  from 
zero  and  grows  to  its  maximum  during  the  time  the  field  is  active  instead  of  starting 
from  its  maximum  value  as  in  Figure  4.  The  order  of  magnitude  of  F2  ('«10“‘*) 
seems  small  but  is  nevertheless  sufficient  for  spectroscopic  purposes.  So  far,  we 
only  studied  the  time  evolution  of  the  total  occupation  probability  of  the  electronic 
states.  By  projecting  ^2(0  on  the  adiabatic  vibrational  states  of  B  'S^,  we  learn 
that  ^2  ««  X2®.  The  populations  of  the  other  vibrational  states  are  at  least  2  orders 
of  magnitude  smaller,  xi*  is  not  localized  in  one  of  the  two  wells  exclusively.  Our 
purpose  is,  however,  to  create  a  nonnegligible  population  of  some  vibrational  states 
located  in  the  outer  well  only. 

Next  we  will  investigate  the  adiabatic  vibrational  deexcitation  of  the  B  '2"^  state 
through  an  applied  IR  field.  Collisional  deexcitation  is  not  a  relevant  process  in 
view  of  the  time  scales  involved.  The  highest  vibrational  state  which  is  located 
clearly  in  the  outer  well  is  the  xj®  state.  Transitions  to  this  state  create  the  required 
population.  Therefore,  we  focus  momentarily  on  X2®  to  optimize  our  field  param¬ 
eters.  The  selected  transition  can  be  induced  with  and  without  the  use  of  intermediate 
levels.  Also  the  number  of  intermediate  levels  can  be  varied:  The  transition  frequency 
can  be  chosen  to  be  resonant  to  a  single  photon  transition  (ft  =  £,6  -  £10)  or  to 
an  n-photon  process  [ft  =  (£16  -  £io)/n].  Testing  various  possibilities,  we  found 
that  the  2-photon  process  was  more  effective  than  the  3-  or  6-photon  process.  This 
we  investigated  using  monochromatic  radiation.  Figure  6  shows  the  evolution  of 
the  occupation  probability  of  xi*  and  xi®  under  the  application  of  a  gaussian  laser 


Figure  5.  Occupation  probability  of  electronic  states  as  a  function  of  time.  Interaction 
with  gaussian  laser  pulse. 
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Figure  6.  Time  evolution  of  the  occupation  probabilities  of  vibrational  states  using  a 

gaussian  pulse. 


pulse  with  an  fwhm  of  500  fs,  a  peak  intensity  of  40  GW/cm^  centered  at  t  = 
750  fs.  The  main  difference  with  the  monochromatic  results  is  a  decrease  of  the 
maximum  of  the  curve  by  about  10%  and  a  decrease  in  the  global  period.  The 
maximum  in  the  curve  still  reaches  about  70%  of  the  initial  occupation  of  xi*, 
which  is  sufficiently  high  for  the  purpose  of  observation. 

Until  now  we  have  studied  the  effect  of  excitation,  deexcitation  and  nonadiabatic 
coupling  of  the  electronic  states  separately.  Here  we  will  present  the  results  of  a 
combined  calculation.  The  initial  condition  is  ^  =  (X?,  0,  0,  0).  We  used  two 
gaussian  pulses.  The  uv  pulse  has  the  same  characteristics  as  in  the  calculation 
presented  previously.  The  IR  deexcitation  pulse  (w  =  0.0045  a.u.)  has  an  fwhm  of 
400  fs  and  is  centered  at  r  =  750  fs.  From  Figure  7  we  see  that  we  have  succeeded 
in  populating  xj®  to  *=  10'^.  Moreover,  we  can  see  that  after  the  IR  pulse  has  died 
out,  the  population  of  xi®  stays  constant  while  the  population  of  xi*  starts  to 
oscillate  again  due  to  the  nonadiabatic  interactions  with  the  other  electronic  states. 
The  population  of  X2°  is  4%  of  the  total  population  of  the  outer  minimum  on  the 
average.  The  interaction  with  the  IR  field  also  causes  the  total  population  of  the 
outer  minimum  to  be  increased  by  about  a  factor  of  2  (Fig.  8),  reaching  a  time 
averaged  value  of  1.2  X  10“'*.  From  these  two  facts  we  may  conclude  that  it  should 
be  possible  to  observe  the  vibrational  states  of  the  second  minimum  experimentally. 


Conclusions 

Using  ab  initio  potentials,  coupling  functions  and  dipole  functions  we  have  ex¬ 
amined  the  intramolecular  dynamics  BH  of  the  first  four  electronic  states. 
Looking  for  a  procedure  to  populate  the  second  minimum  of  '2^,  we  found  that 
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Figure  7.  Time  evolution  of  the  occupation  probabilities  of  vibrational  states  using  a 
gaussian  uv  (excitation)  pulse  and  a  gaussian  IR  pulse  (deexcitation). 


it  is  not  possible  to  perform  a  one-step  excitation  to  the  outer  minimum.  We  pro¬ 
posed  the  following  procedure:  excitation  by  a  uv  laser  pulse  to  the  £  '2^  state. 
The  nonadiabatic  interactions  cause  the  B '  2  *  to  be  populated  in  a  high  vibrational 
state.  The  second  step  in  our  procedure  is  a  vibrational  deexcitation  by  an  infrared 
laser  pulse.  We  succeed  in  increasing  the  total  population  of  the  outer  well  and 
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Figure  8.  Time  evolution  of  the  occupation  probabilities  of  the  outer  minimum  using  a 
gaussian  uv  (excitation)  pulse  with  and  without  a  gaussian  ir  pulse  (deexcitation). 
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creating  a  stable  population  of  vibrational  states  which  are  predominantly  localized 
in  it.  From  the  results  of  the  combined  calculation,  however,  we  can  see  that  there 
is  still  much  to  be  done  on  the  optimisation  of  the  parameters  (e.g.,  width  of  the 
IR  pulse,  maximum  intensities  of  and  tim<*  delay  between  the  pulses). 
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Abstract 

A  variation  of  the  usual  semiclassical  short  wavelength  (eikonal )  method  was  recently  applied  to  model 
calculations  of  electronically  diabatic  atom-atom  collisions.  Microreversibility  was  computat  onally  im¬ 
posed  on  the  dynamics  by  following  trajectories  initiating  in  the  ground  and  excited  states  simultaneously. 
When  improved  transition  probabilities  for  several  two-state  systems  were  obtained  in  this  mcnner,  a 
multistate  application  was  selected.  Self-consistent  eikonal /averaged  effective  potential  (sce/aep)  oiate 
to  state  calculations  are  performed  for  rotationally  inelastic  HeH2  scattering  at  total  energies  of .  I  and 
.9  eV.  using  an  (8,2)  basis  (7  states).  Phase  shifted  amplitudes  are  introduced  that  add  constraints  to 
the  coupled  differential  equations  and  reduce  cpu  time.  Definite  parity  partial  and  total  cross  sections 
compare  favorably  to  the  previous  semiclassical  coupled  states  (scs)  results  of  Billing  ( 1978)  as  well  as 
to  both  the  quantum  adp j.  and  CC  results  of  Shimoni  and  Koun  ( 1977 ).  ©  1993  John  Wiley  &  Sons.  Inc. 

Introduction 

For  several  decades  the  difficulty  of  applying  purely  quantum  dynamical  methods 
to  all  but  a  handful  of  chemical  systems  has  inspired  the  development  of  various 
semiclassical  and  quasiclassical  approaches  [  1  ] .  Among  the  more  serious  semiclass- 
ical  attempts  is  an  eikonal  formalism  presented  with  some  clarity  by  Micha  in  1983 
[2],  This  method,  with  refinements,  has  been  applied  to  polyatomic  photodisso¬ 
ciation  [3],  electronically  diabatic  gas-surface  scattering  with  [4]  and  without  [5] 
surface  motion,  rotationally  and  electronically  inelastic  scattering  of  NO  from  an 
Ag(  111)  surface  [6],  atom-atom  elastic  scattering  angular  distributions  [7],  and 
atom-atom  collision  electronic  transitions  [8]. 

Near  quantitative  agreement  with  formally  exact  methods  was  achieved  in  the 
electronical!'  active  atom-atom  investigation  by  strictly  imposing  microreversibility 
on  the  semiclassical  calculations.  The  improvement  was  initially  demonstrated  with 
three  different  electronically  diabatic  two-state  models.  It  was  accomplished  by 
running  the  excitation  and  de-excitation  calculations  simultaneously  along  a  near 
classical  trajectory  formed  by  combining  the  two  processes.  Because  a  major  ad¬ 
vantage  of  the  eikonal  method  is  that  it  can  be  extended  without  any  fundamental 
changes  to  accommodate  a  theoretically  unlimited  number  of  degrees  of  freedom, 
the  natural  next  step  is  to  apply  the  improved  version  to  generic  multistate  systems. 
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A  select  case  is  rotationally  inelastic  atom-diatomic  scattering.  This  is  the  simplest 
case  where  the  matrix  elements  are  well  known  [9],  having  been  exactly  derived 
from  the  laws  of  vector  addition  [10].  It  has  a  further  advantage  that  the  energy 
levels  are  closely  spaced,  a  condition  generally  favoring  semiclassical  treatments. 

As  far  back  as  1977  Billing  performed  similar  calculations  for  He — at  collision 
energies  below  the  threshold  for  vibrational  excitation  [  1 1  ] .  His  efforts  even  included 
an  ad  hoc  means  for  dealing  with  microreversibility.  The  initial  velocity  was  taken 
to  be  the  mean  of  the  velocities  for  the  initial  and  final  states.  While  this  approach 
does  not  achieve  genuine  microreversibility  as  the  forward  and  reverse  probabilities 
are  not  identical,  it  does  come  close.  It  is  highly  effective  and,  along  with  related 
variations,  has  been  found  useful  [12].  Using  the  same  potential  and  general  de¬ 
velopment  as  Billing  allows  us  to  directly  compare  results. 

Billing  endorsed  what  he  called  the  semiclassical  coupled  states  method  (scs) 
based  on  three  assumptions: 

(i)  decoupling  of  the  rotational  projection,  m,  states; 

(ii)  classical  treatment  of  translation; 

(iii)  an  effective  potential  and  velocity  is  used. 

The  eikonal  treatment  presented  here  departs  from  these  assumptions  only  in  that 
it  uses  an  improved  effective  potential  and  no  effective  velocity.  The  advantage  is 
that  microreversibility  is  precisely  achieved  while  eliminating  choices  based  on 
intuition.  Also,  the  formalism  is  developed  in  a  way  that  allows  uniform  application, 
avoiding  the  need  for  reinterpretation  with  each  new  problem. 

Assumption  (i)  refers  simply  to  the  coupled  states  (CS)  approximation  itself..  We 
do  not  take  a  position  on  that  here,  since  the  purpose  is  to  explore  an  improved 
semiclassical  approach  to  dynamics  which  could  be  applied  in  either  a  coupled 
states  or  close  coupling  (cc)  context.  The  literature  is  replete  with  discussions  of 
CS  calculations  [13].  While  the  savings  in  cpu  time  is  expected  to  be  less  dramatic, 
some  savings  may  be  obtained  by  using  a  variable  time  step  differential  equation 
solver  and  semiclassical  solutions  in  place  of  exact  solutions  in  a  CC  calculation. 
This  is  left  for  future  investigation. 

Assumption  ( ii )  is  the  minimalist  semiclassical  approximation.  Quantization  of 
translational  motion  is  least  likely  to  change  the  results  in  a  practical  sense.  Eikonal 
treatments  generally  include  this  feature,  as  do  most  semiclassical  methods. 

Assumption  ( iii )  considers  two  approximations,  the  effective  potential  and  ef¬ 
fective  velocity.  An  effective  potential  is  used  to  self-consistently  couple  quantum 
coefficients  and  the  classical  motion.  This  provides  a  method  for  step-wise  generation 
of  the  coefficients  from  initial  conditions  and  the  progress  of  the  modified  (near-) 
classical  trajectory  that  proceeds  from  the  modified  (effective)  potential.  While 
there  may  be  some  variation  in  the  expressions  used  for  the  coefficients,  and  dif¬ 
ferences  in  the  differential  equations  also  may  appear,  the  effective  potential  is 
seminal  to  both  SCS  and  Self-Consistent  Eikonal  (SCE)  calculations.  The  present 
work  uses  the  recently  developed  averaged  effective  potential  (aep)  in  place  of  the 
usual  effective  potential  (ep)  used  in  scE  calculations. 


EIKONAL  TREATMENT  OF  He— Hj  SCATTERING 


529 


The  effective  velocity  is  another  matter.  In  the  scs  approach,  the  initial  velocity 
is  chosen  to  be  the  average  of  the  initial  and  final  state  velocities,  which  is  imposed 
as  an  initial  condition  of  the  differential  equations.  In  SCE  calculations  the  initial 
potential  energy  subtracted  from  the  total  energy  gives  the  initial  kinetic  energy, 
which  has  the  initial  velocity.  No  effective  velocity  is  imposed. 

This  articl'"  'S  divided  into  seven  sections.  Following  the  introduction,  the  semi- 
classical  fom  .jism  originally  employed  in  Billing’s  work  is  presented  and  the  cs 
equations  and  some  simple  insights  are  given.  Eikonal  formalism  and  the  modifi¬ 
cations  to  Billing’s  approach  that  were  used  in  the  present  calculations  are  then 
discussed  followed  by  parameters  and  various  other  details  of  the  potential.  Some 
results  are  shown  and  discussed,  including  alternative  eikonal  equations  that  reduce 
the  consumption  of  cpu  time.  The  final  section  presents  conclusions,  general  dis¬ 
cussion,  and  plans  for  future  applications. 


Semiclassical  Equations  (Billing) 

Following  Billing  [11],  using  assumptions  (i),  (ii),  and  (iii)  gives  the  following 
set  of  equations  for  the  amplitudes  a„ 


ihda=  Z  ^.n-exp 


(£a  -  Ea’)t 


fla' 


(2-1) 


where  represents  the  vibrational /rotational  eigenstate,  and  a  is  the  collection 
of  quantum  numbers  (i),y,  m). 

The  potential  matrix  elements  are  expressed 

=  (1^)"'  2  <«l(^-  ONt;'>/*,[£(/)]^</:/00|y0><//:/7i'0|ym> 

(2-2) 

where  R  is  the  A  to  5C  (He  to  H2)  separation,  r  the  BC  (H2)  bond  length,  and  f 
the  equilibrium  value  of  r.  The  first  bracket  on  the  right  describes  vibrational  tran¬ 
sitions  and  the  other  two  are  Clebsh-Gordon  coefficients,  for  rotations.  The/i,  (/?) 
were  obtained  by  expanding  the  Krauss-Mies  potential  surface  [14]: 

(r-r-yP, (cosy)  (2-3) 

(.ft 


where  is  a  Legendre  polynomial  and  y  is  the  angle  between  R  and  r. 

The  effective  potential  combines  transition  amplitudes  and  potential  elements: 


Feflr(£,  /)  =  Z  exp 


(2-4) 


Because  Vca  is  real  and  determines  the  relative  motion,  where  is  the 

reduced  mass  of  ^4  +  BC  (He  +  H2),  P  is  the  relative  momentum,  /  is  the  orbital 
angular  momentum  quantum  number  and 
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R  =  dH,ff/dR  =  ti-'P, 


P=-dfhfr/dR  =  -^  + 


H 


nR 


3  ’ 


(/  +  '/2)^  is  the  Langer  modification  for  /(/  +  I )  [15],  and 


(2-5) 


^eff  -  f  0 

2n 

=  {\/2ui)(Pl  +  Pl)+  K,r(R,0, 


where 


P^  = 


'  +  2 


R‘ 


(2-6) 


Billing  uses  an  effective  velocity  (u)  to  determine  the  initial  value  of  P, 

-^(tinmal)  ^  P l?(tinitial)  ”  » 

V  =  (2Uln)''\ 


where 


U=^-{2E-E,j-  E,y  +  2[(£  -  £,,)(£  -  E.y)]'^-}  .  (2-7) 

This  is  the  semiclassical  approach  presented  by  Billing  for  rotationally  inelastic 
He — H2  in  1 977.  In  a  1 976  article  [  1 6  ]  he  gave  simpler  expressions  for  the  coefficient 
derivatives  and  effective  potential  that  are  consistent  with  those  presented  by  this 
author.  The  more  involved  equations  were  probably  developed  for  computational 
efficiency,  which  this  author  handles  in  a  slightly  different  manner.  The  effective 
velocity  does  seem  somewhat  problematic.  If  there  is  to  be  an  effective  velocity 
there  will  be  an  effective  energy  {U)  of  some  sort  so  the  first  two  lines  in  (2-7) 
follow  naturally.  The  particular  form  given  to  U  in  the  third  line  of  (2-7)  was 
determined  in  some  sense  by  intuition.  Under  different  circumstances  one  might 
devise  a  different  choice  for  U  that  gave  better  results,  leaving  the  overall  concept 
less  grounded  in  generality.  Other  effective  velocity  treatments  have  similar 
problems. 

The  notion,  moreover,  that  microreversibility  should  lead  to  some  kind  of  average 
velocity  is  genuinely  insightful.  The  trajectory  ought  not  only  to  commence  with 
that  description,  but  conclude  that  way  as  well.  While  this  approach  ensures  that 
things  start  out  right,  there  is  nothing  in  the  calculatic”’’:  design  to  compel  average 
trajectory  behavior  at  the  detector.  In  spite  of  this,  the  amazing  stability  of  these 
kinds  of  semiclassical  calculations  allows  the  effective  velocity  method  to  produce 
vastly  more  useful  results  than  the  entirely  unmodified  one.  This  is  particularly 
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remarkable  in  light  of  the  fact  that  the  initial  amplitudes  are  in  no  way  adjusted  to 
conform  to  the  radically  compromised  initial  velocity.  The  extended  Hamilton¬ 
like  equations  [(2-1 )  and  (2-5)]  are  symmetrically  balanced  so  as  to  gravitate  away 
from  any  strong  deviations.  In  what  follows,  a  very  small  number  of  additional 
modifications  give  a  significantly  enhanced  account  of  semiclassical  scattering. 


Coupled  States  Equations  (ADP/^) 

One  adaptation  of  the  coupled  states  approach  is  the  ADPj:  method  ofShimoni 
and  Kouri  [17].  Each  total  angular  momentum  (J)  cross  section  is  taken  as  the 
sum  of  even  and  odd  parity  contributions, 

+  <ri,d(y-»/)-  (3-1) 


The  even  parity  cross  section  is 


^  even 


^{2J+  1)  ^  1 

^i/(2y  +  1 )  x-o  Jm  +  1 


\5„ 


and  the  odd  parity  cross  section  is 


<Todd 


<2J+  1)  ^ 

kl{2j+l)  x4wt. 


1  ~  ^X.O  IS  I  ■> 

Jm 


AL  =  2;  (3-2) 


AL  =  2.  (3-3) 


where 


2^ 

klj  =  -^  (£■  -  E..,),  j,„  =  min(;,7').  and;.,,  =  max(;,/) . 

n 

While  a  thorough  discussion  of  these  equations  would  be  quite  lengthy,  the  general 
intent  is  readily  encapsulated: 

Parity  is  familiar  from  quantum  mechanics.  Transitions  between  parity  states 
may  or  may  not  be  allowed.  It  is  useful  therefore,  in  determining  state-to-state  cross 
sections,  to  take  parity  into  account. 

The  square  modulus  of  the  amplitude,  |  b„„'  -  a„„- \  is  the  transition  probability. 
Cross  sections  are  intuitive  physical  representations  of  the  likelihood  of  a  transition. 
They  are  proportional  to  the  probability. 

The  appearance  of  vr  results  from  integration  over  the  azimuthal  angle.  Intuitively, 
the  cross  section  is  the  area  of  a  circle.  That  area  is  given  by  ir  times  the  square  of 
the  radius  in  which  collisions  are  determined  to  have  occurred. 

Cross  sections  are  inversely  proportional  to  the  square  of  initial  relative  momen¬ 
tum  ( in  this  case  the  wave  number  ^,.,)-  This  is  clear  because  the  slower  the  projectile, 
the  longer  it  spends  in  the  interaction  region,  the  greater  the  chance  for  the  target 
to  be  influenced,  and  the  larger  the  cross  section.  The  converse  applies  to  faster 
projectiles. 

Each  J  state  is  ( 27  +  1 )  degenerate.  Each  of  these  degenerate  states  has  the  same 
energy,  but  may  have  a  different  orientation  (spatial  representation)  and/or  com- 
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bination  of  orbital  (/)  and  rotational  {j)  components  from  the  rest.  The  factor  (2J 
+  1 )  is  included  to  weight  the  cross  sections  accordingly. 

The  precise  mathematics  that  places  (2j  +  1 )  in  the  denominator  is  beyond  the 
scope  of  this  discussion,  but  the  idea  is  to  keep  from  over  counting  the  j  degeneracy. 
Remember,  J  and  j  are  related,  each  j  state  leads  to  a  different  J  state,  so  the  j  states 
are  already  included  once  in  the  J  states.  Some  of  them  explicitly  reappear  in  the 
sum  over  X  (degenerate  j  states  are  m  states;  m  states  contributing  to  the  CS  rep¬ 
resentation  are  labeled  X).  The  j  (and  /)  degeneracy  is  also  taken  into  account  in 
the  expression  for  K„„-  [eq.  (2-2)].  Apparently  if  not  placed  in  the  denominator, 
it  would  appear  once  too  often. 

The  sum  over  X  relates  directly  to  the  coupled  states  approximation.  In  close 
coupling  calculations  all  possible  m  -*■  m'  transitions  are  explicitly  taken  into  ac¬ 
count.  This  means  that  different  solutions  are  required  for  every  set  of  (m,  m') 
quantum  numbers,  all  (27ma,  +  1)^/2  of  them  [18]!  j  -*■  f  cross  sections  are 
obtained  by  adding  over  all  of  these  m  -*■  m'  contributions.  The  coupled  states 
approach  postulates  that  the  details  of  the  m  -►  m'  transitions  are  relatively  un¬ 
important  in  looking  at  j  -►  f  transitions.  Setting  the  total  magnetic  quantum 
number  to  zero,  M  =  m  +  m'  =  Q,  decouples  the  m  states  and  eliminates  these 
details.  Nullifying  M  means  that  m  =  -m’  and  by  convention  X  =  |m|  =  |m'|. 
There  are  only  (2 yma*  +  1  )/2  values  of  X  and  only  that  many  solutions  are  used 
(recall  that  K„„'  =  When  cs  calculations  are  appropriate  they  save  tremendous 
amounts  of  computer  time. 

The  sum  over  alternate  values  of  L  amounts  to  a  pre-averaging  or  smoothing 
technique.  Before  it  was  developed  individual  transition  probabilities  were  directly 
summed  over  X.  The  problem  was  that  these  tended  to  vary  irregularly  leading  to 
unstable  results.  For  <rtv„(<7^d)  there  are  +  1  Um)  values  of  L  included  in  the 
AL  sum.  Inclusion  of  the  factor  I/Um  +  1 )  for  even  parity  states  and  ( 1  -  5)/Ja/ 
for  odd  parity  states  simply  produces  local  (in  L  or  J)  averages  of  subsets  of  prob¬ 
ability  elements.  Taking  an  even  parity  example  with,  sayjm  =  5  and  7=6,  there 
is  an  average  over  6  alternate  probability  elements  ranging  from  L  =  1  to  L  =  11 
instead  of  the  single  L  =  6  element.  After  the  summation  over  alternate  values  of 
L  is  performed,  every  entry  presents  an  average  rather  than  a  single  datum.  This 
clearly  does  stabilize. 

Finally,  the  identical  coupled  states  equations  are  applied  in  either  the  exact  or 
semiclassical  context.  If  the  amplitudes  (a„„')  are  obtained  by  exact  methods  one 
has  the  usual  cs  calculation;  if  they  are  obtained  semiclassically  one  has  an  SCS 
calculation.  That  is  the  entire  difference. 

Virtually  limitless  volumes  of  scientific  literature  have  been  devoted  to  the  coupled 
states  approximation  and  its  application.  The  interested  reader  is  enthusiastically 
referred  to  these.f 

Eikonai  Formalism 

Integrating  over  the  electronic  variables  leaves  an  R-dependent  form  of  Schroe- 
dinger’s  equation. 


t  Kouri's  chapter  on  approximate  rotational  methods,  Ref.  (18).  lists  156  articles. 
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2m 


+  \{R)-E 


m)  =  0 , 


(4-1) 


where  V  refers  to  the  classical  potential  and  R  is  the  interparticle  separation.  The 
eikonal  wave  function  is  defined  in  two  steps.  First,  separate  \p  into  exponential 
and  pre-exponential  factors, 


\p  =  X(R)  exp 
Then  associate  S  with  the  classical  action, 

5(f?)  =  J  P{R)dR, 


so 


P  =  dStdR  . 

Putting  eqs.  (4-2)  and  (4-3)  in  eq.  (4-1 )  gives 

_  Al  h—  — 

2m  dR^  ‘  m  dR 

.  h  dP  P^  „  ^  „ 

2m  dR  2m 


(4-2) 


(4-3) 


(4-4) 


A  simplified  form  of  this  equation  may  be  obtained  by  introducing  diagonal  matrices 
lF(F,/«)and/:(i?); 


W,„(P,  R)  =  V,„(/?)  +  /Uf/?)  -  K(/?) , 


(K)r'r  —  ^FT 


2m) 


d^Xr 

dR^ 


U(i?)  =  :^^  1  . 
2m  dR 


(4-5) 


Equation  (4-4)  becomes 


m  I  dR 


X(R)  =  0 


(4-6) 


which  is  still  exact. 

The  effective  potential,  Fqu(P,  R),  is  obtained  exactly  by  left  multiplying  eq.  (4- 
4)  by  x^,  add’ng  the  adjoint  of  the  resulting  expression  to  itself,  and  solving  for 
E.  Since  E  is  the  total  energy  and  P^I2m  is  the  kinetic  energy  the  remaining  term 
is  the  effective  potential: 
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^  ^  R)  , 


W.  m  -  (X-X)-  [x*vx  +  X  -  X- 1) 

(d^X*  +5^X\1 

2m\dR-  ■ 


In  practice  one  uses  a  short  wavelength  approximation.  As  the  momentum  in¬ 
creases  at  higher  energies  the  wavelength,  which  is  inversely  proportional  to  the 
momentum,  decreases  and 

IdHldR^  <  \dxldR(Plh)\  «  |X(P/ft)^|  .  (4-8) 

Applying  these  inequalities  to  eq.  (4-6)  gives 

+  x(/?)  =  o. 

m  I  dR 

y^{R)  =  V,{R)  \  +  /U(/?), 

F,(i?)  =  (X^X)-'X'^V(/?)X  (4-9) 

Since  R  =  R{t)  the  function  X(i?)  may  be  expressed  as  some  function  of  time.  The 
row  matrix  A(t)  is  defined  to  satisfy 


xlR(t)]  =  A(/)e'"‘ 


Also  V(R)  =  K[f?(/)],  so  placing  eq.  (4-10)  in  eq.  (4-9)  gives 


y|  + V[/?(0]  A(/)  =  0. 


(4-10) 


(4-11) 


The  initial  value  of  Aj(t)  =  1  in  the  initial  channel  and  zero  in  all  others.  Since  V 
is  Hermetian 

d{A*k)/dt  =  0 


A^A  =  1 


(4-12) 


at  all  times.  A  is  the  general  probability  amplitude  in  the  eikonal  formalism.  If  V 
corresponds  to  the  rotationally  inelastic  potential,  i.e.. 


Vll  =  Vaa'  +  5„„'£'a  , 

then  A  corresponds  to  a  from  above  [see  eqs.  (2-1 )  and  (2-4)], 


(4-13) 


A  =  a, 


(4-14) 
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so 

ihda  =  2  iKa’  +  &a,,’E„)aa' .  (4-15) 

a' 


Notice  since 


x-^vx 

x^x 


A^VA 
A -"A 


=  A^VA . 


the  effective  potential  for  rotationally  inelastic  scattering  becomes 

Kit=  Z  a*y„„'a„' . 

aa' 


(4-16) 


(4-17) 


These  coefficient  and  effective  potential  equations,  sans  the  complex  exponential 
factors,  conserve  energy  precisely  and  are  more  directly  evaluated. 

The  discussion  up  to  now  concerned  the  SCE/ep  formalism  (Self-Consistent  Ei- 
konal / Effective  Potential).  The  SCE/aep  equations  (Self-Consistent  Eikonal/ Av¬ 
eraged  Effective  Potential )  were  developed  to  impose  microreversibility  on  the  cal¬ 
culations.  To  do  this,  the  excitation  and  de-excitation  calculations  are  combined. 
The  potential  governing  this  new  calculation,  the  aep,  is  the  mean  of  the  usual 
effective  potential  for  the  forward  reaction  and  the  usual  effective  potential  for  the 
reverse  reaction  at  time  /, 


,Va  . 

*aa  ^  ••ftnr 


V  /  —  ' 

*^a  a 


•o  a**a  a 


p=  ^  ""  (4-18) 

Notice  that  there  are  now  two  complete  sets  of  coefficients,  a„„’  and  a„'„.  One  set 
starts  with  an  amplitude  of  unity  in  the  a  state  and  zero  in  all  others  while  the 
other  set  starts  with  unity  amplitude  in  the  a'  state  and  zero  in  the  rest.  In  the 
diabatic  representation,  where  this  work  is  done,  the  high  symmetry  of  the  sce/ 
AEP  equations  allows  the  calculations  to  be  performed  in  about  the  same  cpu  time 
as  SCE/EP  calculations,  despite  the  increased  number  of  differential  equations.  The 
improved  quality  of  results  is  remarkable.  scE/  aep  calculations  demonstrate  semi- 
quantitative  to  quantitative  agreement  with  calculations  based  on  exact  methods. 

The  Potential 

When  Billing  expanded  the  Krauss-Mies  potential  as  in  eq.  (2-3),  the  following 
fki's  were  obtained  [11]: 
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foo  =  aCe  “0^,  foi  =  Ce  -  c) , 

fix  =  Ce~‘'°'^(abaiR  +  cd) ,  fio  =  abCe~'''>'^ , 

f22  =  a\abR^  +  uxcdl^  , 

where 

a  =1.10041,  ^  =  0.1825,  C  =  198.378  eV.  , 

c  =  0.9855  A"',  i/  =  0.27506,  ao  =  3.518A-', 


and 


a,  =  1.145  A"^ 


In  general  one  might  evaluate 


Ec.j  =  hoOi 


(i^  +  ^)--v,(n  +  ^)  +Ej 


(5-1) 


where  «,  and  are  Morse  parameters  for  the  diatomic  and  Ej  is  the  rotational 
state  spacing.  In  the  present  work  correct  spectroscopic  values  were  preferred  [19). 

Since  collision  energies  were  taken  below  the  vibrational  threshold  (and  also 
vibrational  transitions  were  not  considered )  it  was  feasible  to  evaluate  the  vibrational 
factors.  <n|(r  -  r)'|i;'),  see  eq.  (2-2),  within  a  rigid  rotor  approximation.  This  is 
the  limit  where  vibrational  and  rotational  motions  are  decoupled.  Billing’s  hydrogen 
molecule  values  for  the  matrix  elements  defined  [20] 


= 


<t>°(u)4>'iiu){ulry  du  , 


u  =  r-F,  (5-2) 

are  given  in  Table  1.  The  0?  are  eigenfunctions  in  the  vibrational  equation 


fm  = 


-7- T^  +  U(r) 
2m  or 


0?  =  E,4>° 


(5-3) 


Table).  ,</?;’ for  Hi  119). 


/  =  0 

1  =  1 

/  =  2 

1.00 

0.0304 

0.0152 

0.0 

0.119 

0.0122 

1.00 

0.0944 

0.0527 

0 

0 

1 


0 

I 

I 
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where  m  is  the  reduced  mass  of  H2  and  u(r)  is  the  intramolecular  (Morse  H2) 
potential.  The  mathematics  for  evaluating  A/J,'-!.  is  described  in  Ref.  [21].  It  is 
helpful  to  keep  in  mind  that  the  need  be  multiplied  by  F  to  recover  the  length 
in  nonreduced  units.  In  the  vibrating  rotor  model  the  AfJ,'i  represent  unperturbed 
states  which  are  subsequently  modified  with  correction  (perturbation)  terms.  This 
is  necessary  to  describe  vibrational  transitions  and  some  purely  rotational  transitions, 
exs.,  large  basis  set  calculations,  large  \j,  collision  energy  above  vibrational  thresh¬ 
old. 

Clebsh-Gordon  coefficients  convert  to  3  j  symbols, 

f^'  =  l)-'/'0,72W,m2U-A/>,  (5-4) 

\mi  m2  M) 


where  M  =  +  m2.  Subroutines  for  evaluating  3  j  symbols  are  fairly  common. 

The  potential  elements  are  now 

V„AR)  =  (2j'+\)1m[‘IfUR) 

a 


A-(-l) 


JJ  k  A^'  k  A 

\0  0  0/\-A  0  X/ 


(5-5) 


Figure  1  shows  the  diagonal  elements,  -I-  used  in  these  HeH:  calculations. 
For  an  (8,2)  basis,  that  is,  j  equals  0,  2, 4,  6,  and  8  in  the  ground  vibrational  state 
(v  =  0)  and  j  equals  0  and  2  for  the  first  excited  vibrational  state  (r  =  1 ),  there  are 
seven  diagonal  elements.  Quite  noticeable  is  the  near  degeneracy  of  the  (iy‘=)  08 
and  10  surfaces.  This  follows  because  the  spectroscopic  states  are  separated  by  only 
8.3  cm"'  (Table  11). 

Since  coupling  between  rotational  states  disappears  for  1  Ay  1  >  2  the  combination 
of  diagonal  and  off-diagonal  elements  within  a  vibrational  level  form  a  tri-diagonal 
matrix.  In  addition  there  is  nonzero  coupling  between  both  like  and  adjacent  ro¬ 
tational  states  in  different  vibrational  levels.  Figure  2  shows,  for  exam,  te,  nonzero 
coupling  between  the  02  state  and  the  adjacent,  00  and  04,  states  and  the  02  state 
and  the  vibrationally  excited,  10  and  12,  elements.  For  an  (8,2)  basis  there  are  22 
off-diagonal  nonzero  potential  elements.  Of  course,  since  only  1 1  of 

these  are  unique.  There  are  19  different  potential  elements  for  this  particular  basis, 
which  happens  to  be  quite  small. 


Results 

Rotationally  inelastic  SCE  scattering  calculations  were  performed  on  the  HeH2 
system  in  the  body  fixed  frame  at  total  energies  of .  1  and  .9  eV.  ( neglecting  ground- 
state  vibrational  energy)  using  an  (8,2)  basis.  The  standard  choice  function 
RjJ)  =  ^ was  used,  as  it  was  for  the  original  semi-classical  calculations  performed 
by  Billing  [11],  The  selection  of  choice  function  is  a  very  intricate  matter,  but 
apparently  within  the  adp formalism  there  is  little  effect  other  than  a  small  shift 
of  along  the  J  axis.  Since  the  purpose  was  to  test  the  applicability  of  the  scE/ 
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2.00  2.75  3.50  4.25  5.00 

R  (Bohr) 

Figure  I.  Diagonal  He— Hj  potential  elements,  +  E,,  for  an  (8,2)  basis. 
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Table  II.  Molecular  constants  and 
vibrotational  eigenstates  of  Hj  [19], 


tn  0.504  a.u. 
f 0.7438  A 
w, 4416.7  cm"* 
wpt,  127.4  cm"’ 

Spectroscopic  slates 


t? 

J 

(cm"') 

0 

0 

2176.5 

0 

2 

2530.9 

0 

4 

3345.3 

0 

6 

4591.4 

0 

8 

6229.4 

1 

0 

6337.7 

1 

2 

6674.4 

AEP  method  to  rotationally  inelastic  collisions,  previously  developed  parameters 
and  constants  were  left  unchanged. 

An  Adams-Basforth-Moulton  predictor  corrector  type  differential  equation  solver 
with  documented  reliability,  DE  [21],  was  used  with  both  error  parameters  set  to 
1  X  10~’.  These  could  have  been  adjusted  to  as  large  as  10"^  or  10"*  without  much 
effect,  but  these  particular  calculations  are  not  very  time  consuming  so  extra  care 


0.50  1. 1 2  1.75  2.38  3.00 

R(Bohr) 

Figure  2.  Off-diagonal  He  -H:  potential  elements,  V„„.  for  an  (8,2)  basis.  Coupling  to 

the  C2  state. 
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could  be  afforded  the  testing  stage.  A  tnicro-Vax  II  was  selected  because  code  de¬ 
velopment  is  more  facile  than  with  more  massive  computers  and,  again,  the  model 
calculations  are  abbreviated.  Experience  has  enabled  the  addition  of  a  number  of 
nice  touches  to  the  code,  but  it  has  never  been  fully  optimized.  Future  large-scale 
applications  will  necessitate  optimization. 

Interestingly,  when  the  eikonal  expressions  for  the  time  dependent  amplitudes 
were  developed,  eqs.  (4-15).  removal  of  the  exponentials  [  see  eqs.  ( 2-  i )  ]  amounted 
to  a  reduction  in  constraints  on  the  differential  equations,  resulting  in  computations 
that  require  considerably  more  cpu  time.  More  useful  differential  equations  start 
with  the  eikonal  expressions  and  define  new  amplitudes.  c„, 

where 

/?)  +  £„.  (6-1) 
Substituting  for  and  solving  for  dcjdi  in  eq.  (4-15)  gives 

2  (6-2) 

One  immediate  difference  is  the  elimination  of  the  self-dependent  terms,  those  in 
which  the  variation  in  Ca  depends  on  c^.  Inclusion  of  these  terms  allows  for  more 
play  in  the  differential  equations.  Apparently,  false  variation  of  the  derivatives  can 
be  consistent  with  compensating  false  variations  of  entire  sets  of  the  self-dependent 
coefficients.  Decreasing  the  step  size  eliminates  the  problem,  but  increases  the  cpu 
requirement.  Of  course  additional  differential  equations  are  required  for  the  action 
expressions, 

J  (fF„  -  H;.)  dt  =  j  V„„  dt-  \  V„,„.  dt  +  (E„  -  E„.)t .  (6-3) 

but  even  with  these  included,  considerable  computer  time  is  saved  (see  Table  Ill). 
Because 


Table  HI.  Integral  cross  sections  at  .9  eV. 


VJ  vf 

Integra!  cross  sections  (1 1]  (bohr^) 

Relative  cpu  time 
SCE/AEP  eqs 

SCE/AEP 

(SCS) 

cc 

ADP/j 

16-2) 

(4-15) 

00 -*02 

II  94 

(12.35) 

:  1.342 

12.001 

1.4 

8.0 

02-*04 

3.62 

(3.75) 

3.739 

3.596 

3.6 

11.1 

00-*04 

0.81 

(0.83) 

0.806 

0.800 

1.0* 

2.8 

388  s  actual  cpu  time  on  a  VAX-3100  under  VMS  5.2. 
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r 


(6-4) 


probabilities  are  not  affected. 

Although  the  numerical  experiments  performed  by  Billing  were,  by  and  large, 
reproduced  here,  nothing  more  than  a  platform  for  making  reaoonable  comparisons 
was  intended.  Some  of  the  constants  and  ps  rameters  used  in  the  SCE  code  came 
from  outside  the  body  of  the  1977  article;  Billing  undoubtedly  carried  a  different 
number  of  significant  figures  in  many  cases;  he  solved  his  differential  equations  by 
different  means;  and  finally,  he  used  the  SCS  formalism.  Our  primary  objective  was 
(like  his)  favorable  comparison  with  exact  results.  It  is  nevertheless  comforting  to 
see,  in  Figure  3,  how  well  the  first  successful  SCE/EP  calculation  of  this  study  agreed 
with  the  unmodified  SCS  results.  It  is  not  possible  to  predict  from  this  that  the  SCE 
and  SCS  results  will  always  agree,  but  it  is  fair  to  say  that  the  approach  to  the 
problem  has  been  correctly  translated  to  the  eikonal  formalism. 

Figure  4  shows  the  same  scattering  event  as  Figure  3,  but  here  SCE/aep  results 
are  compared  directly  to  exact  adp  and  CC  results.  In  a  practical  sense  agreement 
between  the  three  is  quite  good.  Both  the  semiclassical  and  exact  adp  results  are 
a  trifle  low  at  high  J  but  this  is  expected.  Before  the  adp  j;  method,  CS  calculations 
generally  failed  completely  in  this  region.  For  jmm  =  0  there  is  no  odd  parity  con¬ 
tribution  to  the  cross  section,  that  is,  a  =  (Teven  [  see  eq.  ( 3-3 )  | .  Figure  5  demonstrates, 
for  j  =  2 -*■  j'  =  A  dX  E  =  .9  eV.,  the  same  quality  agreement  for  an  odd  parity 
calculation  as  found  in  the  even  parity  case  in  Figure  4.  The  middle  curve  in  Figure 
6  shows,  for  the  same  transition  as  in  Figure  5,  agreement  between  the  sce/aep 
cross  section  summed  over  parity,  a  =  (Teven  +  iTodd*  and  CC  results  [17].  Only  a 
few  points  were  tabulated  in  the  literature,  but  those  are  convincing. 


SCE/EP 

unmodified  SCS 


J  (Total  Angular  Momentum) 


Figure  .t  <r-'(0  -»  2)  at  /;■  =  .1  eV 


542 


COHEN  AND  FAMINI 


J  (Total  Angular  Momentum) 

Figure  4.  <t''(0 2)  at  £  =  .1  eV. 


Figure  6  is  interesting  for  two  other  reasons  as  well.  First,  Billing’s  article  includes 
a  similar  figure,  but  for  a  series  of  SCS  calculations.  Because  his  results  are  not 
tabulated  only  a  qualitative  comparison  of  the  two  figures  is  possible.  Even  so, 
agreement  between  the  sets  of  results  is  apparent.  Second,  the  results  themselves 
have  informative  trends.  The  middle  curve  is  for  the  same  magnitude  transition  as 
the  upper  curve,  \j—f\  =  2,  but  initiates  in  an  excited  (y  =  2),  rather  than  the 


J  (Total  Angular  Momentum) 


Figure  S.  aiM(.2  -*■  4)  at  £  =  .9  eV. 
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SCE/AEP(00->04) 

SCE/AEP(02->04) 

SCE/AEP(00->02) 

+  cc‘^ 


J  (Total  Angular  Momentum) 

Figure  6.  a-'(0j-*-0 j')  at  £  =  .9  eV. 


ground  (j  =  0),  state;  hence  the  smaller  cross  section.  The  bottom  curve  initiates 
in  the  ground  state  but  has  a  larger  transition,  Aj  =  4;  it  has  the  smallest  cross 
section. 

Table  III  presents  further  evidence  of  the  effectiveness  of  the  SCE/aep  calculations. 
It  compares  the  areas  under  the  curves  in  Figure  6,  the  total  (  <t)  as  opposed  to  the 
partial  (ff^)  cross  sections,  with  exact  adpJ^  and  cc  results.  Agreement  is  more 
than  satisfactory.  The  SCS  results  (shown  in  parenthesis)  are  also  quite  good.:^ 

Columns  5  and  6  in  Table  III  show  the  relative  computer  times  required  for  the 
SCE/AEP  calculations  performed  using  eqs.  (6-2)  and  (4-15),  respectively.  As  dis¬ 
cussed  above,  calculations  from  eqs.  (6-2)  are  considerably  faster.  The  00  -*■  02 
calculation  takes  longer  than  the  00  -*•  04  calculation  because  it  is  carried  to  higher 
J.  The  02  04  calculation  takes  much  longer  than  the  others  because  it  basically 

has  to  be  repeated  for  every  X  from  0  to  {jmm  =)  2  [eqs.  (3-2)  and  (3-3)].  Exact 
ADPj^  and  SCE/AEP  ADP calculations  have  not  yet  been  performed  on  the  same 
computer  so  we  are  unable  to  report  on  their  relative  epu  times.  The  savings  afforded 
by  semiclassical  calculations  does  not  seem  to  be  a  major  concern  in  the  literature 
as  it  is  not  mentioned.  Shimoni  and  Kouri  [17]  took  2.5  h  of  epu  time  for  the  cc 
calculations  and  9  min  for  the  exact  adp^i  calculations  on  a  Univac  1 108.  That’s 
a  relative  speed  of  16.7  to  1. 


t  This  raises  a  delicate  point.  Our  present  purpose  is  not  to  critique  or  detract  from  the  fine  work  that 
has  preceded  us.  SCS  calculations  are  generally  most  serviceable  as  are  a  number  of  other  semiclassical 
approaches  The  essential  eikonal  concept  is  simple,  direct,  and  flexible.  With  rigorous  and  creative 
development  it  affords  additional  insight  and  expanded  opportunities  for  application. 
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Conclusion 

An  improved  eikonal  calculation  was  performed  on  rotationally  inelastic  HeH2 
scattering  at .  1  and  .9  eV.  We  have  shown  how  semiclassically  obtained  amplitudes 
are  incorporated  into  definite  parity  adp coupled  states  partial  and  total  cross 
sections,  gave  a  nonrigorous  view  of  the  eikonal  formalism,  explained  how  the 
averaged  effective  potential  (aep)  imposes  microreversibility  on  state-to-state  cal¬ 
culations,  and  related  the  equations  to  rotationally  inelastic  scattering.  We  have 
also  shown  how  additional  constraints  could  be  placed  on  the  calculation  through 
the  use  of  phase-shifted  amplitudes,  thereby  reducing  cpu  time  requirements  several 
fold  and  that  the  calculations  had  in  fact  been  performed  correctly  by  favorably 
comparing  the  cross  sections  obtained  with  published  semiclassical  coupled  states, 
exact  coupled  states,  and  close  coupling  results. 

This  research  has  been  entirely  successful  in  every  respect  save  one.  One  can 
envision  the  same  sort  of  semiclassical  calculation  performed  so  that  microrever¬ 
sibility  is  imposed,  not  only  on  the  state-to-state  transition  being  examined,  but  all 
transitions  simultaneously.  Of  course,  this  would  demand  the  use  of  N  complete 
sets  of  coefficients  rather  than  just  2  (where  N  is  the  number  of  states  included  in 
the  basis). 

While  it  is  unfortunate  that  attempts  at  this  sort  of  calculation  have  not,  to  date, 
been  successful,  the  situation  is  not  entirely  black  and  white.  First,  they  would 
require  considerably  more  computer  time,  very  possibly  with  only  slightly  improved 
results.  On  the  other  hand,  they  may  prove  to  converge  with  smaller  basis  sets. 
Even  if  they  do,  however,  the  decrease  in  basis  will  almost  certainly  not  outweigh 
the  increase  in  cpu  time.  Second,  the  design  of  this  calculation  carries  its  own 
conditions.  Each  set  of  coefficients  starts  with  the  probability  in  a  different  state. 
The  resulting  calculation  necessarily  solves  for  all  possible  transitions,  only  sub¬ 
sequently  providing  information  regarding  particular  transitions.  Such  “super  cal¬ 
culations”  are  a  lovely  thought,  but  they  present  an  undesired  increase  in  the  com¬ 
plexity  of  the  problem.  Since  there  is  no  obvious  way  to  perform  panicular  state- 
to-state  calculations  whose  coefficients  satisfy  microreversibility  between  all  states, 
the  simple  two  state  microreversibility  imposed  on  the  HeHj  calculations  above 
represents  an  acceptable  compromise. 

With  all  this  in  mind,  it  is  nonetheless  interesting  to  think  about  “super  calcu¬ 
lations”.  What  is  causing  the  calculations  attempted  so  far  to  fail?  The  answer  may 
lie  in  an  extension  of  the  falsely  compensating  setwise  variation  of  the  coefficients 
discussed  in  the  preceding  section.  In  that  case  the  solution  would  lie  in  imposing 
additional  conditions  to  restrain  random  variation  in  the  coupled  differential  equa¬ 
tions.  This  is  a  thought  for  future  consideration. 

Another  possibility  is  to  somehow  string  together  a  series  of  simple  two  state 
microreversible  calculations  to  lead  to  an  improved  result — to  theoretically  work 
backward  from  the  desired  transition  so  that  when  actually  performing  the  calcu¬ 
lations  in  sequence  the  final  calculation  somehow  comprises  an  average  over  av¬ 
erages.  The  details  haven’t  been  worked  out  yet,  but  parallel  computing  techniques 
could  make  this  sort  of  approach  very  effective. 
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The  prognosis  for  eikonal  applications  is  excellent.  Programs  are  already  being 
developed  that  account  for  vibrational  as  well  as  rotational  transitions.  Postcollisional 
reorientation  effects  are  also  being  examined.  Eventually  the  focus  will  shift  to 
larger  systems. 
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Abstract 

To  assess  the  flexibility  of  a  molecule  (and  to  establish  how  it  is  affected  by  external  conditions),  one 
can  analyze  the  nature  of  the  configurations  encountered  along  molecular  dynamics  (  md)  trajectories. 
If  the  molecule  is  initially  at  a  rigid  conformation,  then  it  will  mostly  undergo  deformations  which 
conserve  the  initial  fold  or  shape.  That  is,  a  rigid  molecule  will  exhibit  very  small  shape  fluctuations 
about  the  initial  shape,  whereas  large  fluctuations  will  be  charactenstic  of fle.xihle  molecules.  Establishing 
the  persistence  of  molecular  shape  features  along  md  trajectones  is  a  valuable  piece  of  information  in 
the  analysis  of  dynamic  phase  transitions  in  large  biomolecules,  including  protein  folding  and  polymer 
melting.  In  this  work,  we  discuss  a  methodology  to  monitor  macromolecular  shape  along  dynamic 
trajectones.  The  procedure  uses  desenptors  which  convey  some  global  shape  features,  including  the 
compactness  and  degree  (and  complexity )  of  entanglement  in  a  backbone.  The  method  is  used  to  follow 
shape  fluctuations  in  chain  molecules  (carboxylic  acids)  and  cyclic  molecules  (cycloalkanes),  with  6  up 
to  12  carbon  atoms.  The  role  of  vanable  temperature  and  number  of  atoms  on  these  fluctuations  is 
analyzed.  The  results  indicate  a  relatively  constant  average  shape  in  the  cyclic  molecules,  although  the 
amplitude  of  the  shape  fluctuations  increases  with  the  temperature.  In  contrast,  the  shape  of  the  chain 
molecules  is  affected  largely  at  high  temperature,  where  folding  or  “melting"  becomes  dominant.  At  low 
temperature,  the  chains  are  very  ngid.  In  the  high  temperature  regime,  the  shape  fluctuations  in  chains 
are  large  and  very  similar  to  those  in  cycles  In  other  words,  the  distinction  between  configurations  of 
cycles  and  chains  starts  to  blur  at  high  temperature,  thus  suggesting  that  chains  are  mostly  folded  into 
turns.  The  results  would  appear  not  to  depend  very  strongly  on  the  functional  group  at  the  end  of  the 
chain.  S'.  1993  John  Wiley  &  Sons.  Inc 


Introduction 

Computer  simulation  of  molecular  dynamics  (md)  [1]  permits  the  analysis  of 
microscopic  molecular  phenomena  within  the  picosecond  domain.  Experimental 
data  on  nuclear  motions  from  neutron  scattering  [2-6],  nuclear  magnetic  resonance 
[7-9],  and  Mdssbauer  spectroscopy  [3,10,11]  suggest  that  md  provides  reliable 
information  on  the  details  of  the  structural  reorganizations  in  large  biomolecules 
[12],  The  knowledge  of  amplitudes  and  frequencies  of  individual  and  collective 
nuclear  motions  is  essential  towards  the  understanding  of  processes  such  as  protein 
folding  [13-19]  and  various  other  configurational  transitions  [3-6,20-23], 

The  characteristics  of  the  configurational  “phases”  found  in  macromolecules  are 
more  related  to  the  fluctuations  in  the  overall  molecular  shape  than  to  the  detailed 
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changes  in  individual  nuclear  positions  [20-22].  Distinct  “phases”  can  be  char¬ 
acterized  by  the  dominant  presence  of  either  stretched  or  entangled  configurations, 
as,  for  instance,  helices  or  random  coils  [22],  unfolded  or  folded  structures  [13- 
19],  and  compact  or  swollen  configurations  [24] .  Similarly,  configurational  phases 
can  be  described  in  terms  of  the  rigidity  and  flexibility  of  molecular  chains  or 
clusters,  as  it  is  the  case  of  melting  in  crystalline  polymers  [20,21],  or  the  dynamic 
liquid /glass  transitions  in  the  hydration  shells  of  some  enzymes  [3,11,25].  The 
recognition  of  these  configurational  changes  requires  the  analysis  of  folding  features 
as  a  function  of  time  and  external  variables  (e.g.,  temperature,  pH).  Therefore,  it 
is  necessary  to  use  adequate  molecular  “order  parameters,”  whose  values  along  MD 
trajectories  may  indicate  the  persistence  of  certain  shapes.  Some  commonly  used 
geometrical  parameters,  such  as  r.m.s.  deviations,  provide  a  limited  shape  description 
which  is  relative  to  a  reference  configuration  and  not  very  discriminating.  Folding 
is  a  large-scale  (global)  property  which  is  not  well  conveyed  by  the  local  details  in 
the  nuclear  geometry.  In  this  work,  we  discuss  and  apply  a  new  approach  which 
quantifies  global  molecular  shapes  along  a  MD  trajectory.  Especially  important  for 
the  analysis  of  configurational  phase  transitions,  the  persistence  of  shape  features 
over  time  and  the  fluctuations  in  the  shape  is  estimated. 

Our  procedure  introduces  a  descriptor  of  folding,  the  probability  of  overcrossings 
in  the  rigid  3D  placements  of  a  molecular  backbone  [26,27].  The  methodology 
extends  a  previous  idea  of  using  the  molecular  geometry  to  derive  topological  char¬ 
acterizations  of  protein  motifs  [  28,29  ] .  The  value  of  the  descriptor  gives  a  measure 
of  compactness  in  a  fold;  the  fluctuations  in  this  value  characterize  the  flexibility 
of  the  molecular  backbone  with  respect  to  changes  in  its  fold.  Earlier  applications 
to  the  study  of  rigidity  in  hydrocarbons  [26,27]  and  some  a-helices  [30],  suggest 
that  the  technique  is  useful  in  assessing  structural  stability.  In  this  work,  we  extend 
the  study  to  a  series  of  chain  and  cyclic  molecules,  and  explore  the  effect  of  tem¬ 
perature  and  different  connectivity  on  the  molecular  shapes  found  along  MD  tra¬ 
jectories.  The  interplay  between  potential  energy  and  shape  changes  is  discussed. 
As  well,  we  analyze  the  variation  of  trajectory  averages  of  the  shape  descriptors  as 
a  function  of  the  backbone  length.  Finally,  the  information  that  can  be  retrieved 
from  fluctuations  in  molecular  size  and  in  molecular  shape  are  compared. 

Global  Shapes  and  Shape  Flexibility  of  Molecular  Backbones 

The  molecular  shape  features  we  deal  with  are  those  conveyed  by  the  backbone. 
In  this  work,  we  restrict  ourselves  to  the  study  of  simple  saturated  chain  and  cyclic 
molecules,  where  the  backbones  are  defined  by  a  sequence  of  straight  line  segments 
(bonds)  between  sp^  carbon  atoms.  These  backbones  exhibit  no  branching. 

The  shape  of  a  backbone  can  be  described  locally,  by  studying  the  distribution 
of  interatomic  distances  and  bond  angles  (see,  e.g..  Refs.  [31,32]).  If,  in  contrast, 
one  is  interested  in  folding  features,  a  global  analysis  will  be  preferable.  Global 
characterizations  of  backbones’  shapes  have  been  proposed  in  the  literature,  in¬ 
cluding  geometrical  measures  of  compactness  [  33,34  ] ,  as  well  as  several  topological 
methods  to  describe  folding  (e.g..  Refs.  [22,35-39]  and  others  quoted  therein).. 
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Recently,  an  approach  which  combines  both  geometrical  and  topological  data 
to  characterize  the  3D  global  shape  of  macromolecules  has  been  introduced  [28]. 
The  procedure  can  be  adapted  to  the  study  of  persistence  of  shape  features  over 
time  [26,30],  and  it  provides  oru^r  parameters  to  monitor  structural  organization 
in  polymers  [27],  Th'  method  has  been  described  in  detail  elsewhere.  For  com¬ 
pleteness,  we  discuss  here  its  basic  ideas.  The  methodology  is  used  in  later  sections 
to  study  the  shape  of  molecular  cycles  and  chains,  and  to  analyze  the  effect  of 
different  variables,  such  as  chain  length,  temperature,  functional  groups,  and  con¬ 
nectivity  type  of  the  backbone. 

To  illustrate  the  essential  ideas,  we  proceed  by  example.  Figures  1  and  2  show 
two  of  the  molecules  dealt  with  in  this  work,  dodecanoic  acid  and  cyclododecane. 
The  two  molecules  appear  in  arbitrarily  folded  configurations;  their  full  specifications 
appear  at  the  top  of  Figures  1  and  2  (the  so-called  “all-atom  structures”).  The 
corresponding  backbones  (defined  by  the  connectivity  of  carbon  atoms)  appear 
immediately  below,  on  the  left-hand  side. 

Let  us  consider  first  Figure  1 .  The  molecule  is  shown  in  a  given  configuration, 
say  K.  This  configuration  corresponds  to  a  snapshot  of  a  possible  structure  found 
along  a  md  trajectory,  and  it  can  be  identified  parametrically  as  a  function  of  time 
{t).  We  shall  therefore  label  the  configuration  as  K{t* ),  corresponding  to  a  snapshot 
at  t  =  t*.  Let  us  think  of  the  snapshot  K(t* )  of  the  backbone  as  a  rigid  entity.  Our 


Figure  1.  Four  rigid  3D  placements  of  the  carbon  backbone  of  dodecanoic  acid  in  a 
generic,  folded  conformational  snapshot  K{t*).  IThe  top  diagram  corresponds  to  the  full 
structure  in  a  given  placement;  the  backbone  with  the  same  placement  is  shown  immediately 
below,  on  the  left-hand  side.  The  other  placements  are  denved  from  this  latter  by  90° 
rotations.  Next  to  the  diagrams,  the  number  of  overcrossings  in  the  projection  is  given. 
To  calculate  the  overcrossing  probabilities,  all  possible  rigid  placemenis  must  be  studied  ] 
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Figure  2,  Four  rigid  3D  placements  of  the  carbon  backbone  of  cycledodecane  in  a  generic, 
twisted  conformational  snapshot  K(t*).  (The  top  diagram  corresponds  to  the  full  struaure 
in  a  given  placement;  the  backbone  with  the  same  placement  is  shown  immediately  below, 
on  the  left-hand  »ide.  The  other  placements  are  derived  from  this  latter  by  90°  rotations. 
Next  to  the  diagrams,  the  number  of  overcrossings  in  the  projection  is  given.) 


molecular  shape  description  uses  all  possible  3D  placements  ofK{t*),  derived  from 
one  another  by  rigid  rotations  [  28  ] .  Each  of  these  rigid  placements  can  be  regarded 
as  a  distinct  view  of  the  backbone  with  same  the  configuration  K{t*).  A  photograph 
of  a  possible  placement  (“view”)  will  appear  as  planar  2D  curve.  There  are  infinitely 
many  of  these  possible  projected  curves.  Each  of  them  can  be  characterized  by  the 
number  of  overcrossings  exhibited,  N.  (Overcrossings  appear  when  one  bond  crosses 
over  another  in  front  of  a  viewer,  thus  producing  an  actual  crossing  in  the  2D 
projection.  The  number  N  can  be  computed  algorithmically  as  discussed  in 
Ref.  [28]). 

Figure  1  shows  four  rigid  placements  of  K(t* );  each  one  is  derived  from  the  top 
placement  by  a  number  of  90°  rotations.  Next  to  each  view,  the  corresponding 
number  of  overcrossings  N  is  given.  The  top-left  view  and  the  bottom-right  view 
both  have  N=0  (i.e.,  they  exhibit  no  overcrossings).  From  this  point  of  view,  they 
are  considered  equivalent.  When  this  procedure  is  repeated  for  alt  possible  views, 
configurations  with  the  same  N  will  be  found  with  various  frequencies.  Therefore, 
one  can  introduce  the  probability  A^(n)  of  observing  a  3D  rigid  placement  with  N 
overcrossings  in  an  n-atom  backbone  [26],  or  simply  “overcrossing  probability 
.4A(n)”.  It  is  clear  that  the  set  {^A'(n)}  satisfies: 
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2 /^^(N)=  J.Vn^3.  (1) 

A’=0 

The  distribution  {/!>(«)}  is  a  global  shape  descriptor  [26],  A  configuration  K 
which  is  “open”  (not  compact),  and  exhibits  little  degree  of  entanglement,  will 
have  high  At^{n)  values  for  low  N.  That  is,  open  structures  will  mostly  appear  in 
placements  with  low  numbers  of  overcrossings.  In  the  limit  of  a  perfectly  linear 
chain  or  planar  structure,  one  has  Ao(n)=  1 ,  i.e.,  no  overcrossings  will  be  observed 
between  bonds.  In  contrast,  compact  and  entangled  configurations  will  exhibit  large 
number  of  overcrossings.  Objects  tike  random  coils  have  a  large  range  of  possible 
overcrossings,  each  one  with  a  small  probability  [27].  In  summary,  the  overcrossing 
probabilities  can  distinguish  between  configurations  with  different  folding  features. 
The  description  is  not  strongly  dependent  on  local  geometrical  features  since  it  is 
global.  A  brief  characterization  of  the  distribution  {.4,%  (at)}  can  be  given  in  terms 
of  the  most  probable  number  of  overcrossings  N*,  and  its  associated  probability 
A*: 


A*  =  A,\»(n)  =  max  .  (2) 

tA) 

For  simplicity,  the  index  n  is  dropped  from  the  notation  of  .4  *.  From  the  distribution 
f «) } ,  it  is  possible  to  compute  the  moments  of  the  number  of  overcrossings: 

W=  ^  N»AAn),p=  \,2 .  (3) 

N-’O 

A  magnitude  similar  to  the  mean  number  of  overcrossings  (p  =  1 )  has  recently 
been  suggested  to  characterize  complexity  in  self-avoiding  random  walks  in 
lattices  [40]. 

The  above  analysis  for  a  molecular  chain  (Fig.  1 )  can  be  applied  also  to  the  study 
of  macrocycles  without  branching,  as  the  one  in  Figure  2.  The  number  of  over¬ 
crossings  N  can  be  defined  for  various  types  of  connectivities.  Figure  3  shows  the 
results  of  the  computed  probabilities  of  overcrossings  for  the  chosen  configurations 
K{t*)  of  dodecanoic  acid  and  cyclododecane.  [The  configurational  dependence  is 
made  explicit  by  indicating  the  probabilities  as  {As{n,  t*)},  and  A*{K{t*))  for 
the  maximum,  whenever  necessary.]  The  histograms  of  overcrossing  probabilities 
(the  overcrossing  spectra  [27])  have  been  computed  with  several  random  distri¬ 
butions  of  viewing  directions  [41],  and  averaging  over  them.  To  achieve  an  accuracy 
of  ±0.005  in  the  maximum  probability  A*,  a  sampling  of  ca.  2,000  views  or  more 
is  needed.  As  shown  in  Figure  3,  the  most  probably  number  of  overcrossings  {N* ) 
for  the  two  configurations  is  zero.  The  maximum  probabilities  are:  A*(K{t*))  = 
Ao(  12,  t*)  =  0.799  ±  0.003,  for  dodecanoic  acid,  and  A*(K(t*))  =  0.371  ±  0.004, 
for  cyclododecane.  The  mean  number  of  overcrossings  on  N  =  0.414  (dodecanoic 
acid),  and  N  -  1.951  (cyclododecane).  Both  results  represent  well  the  fact  that  the 
backbone  of  cyclododecane  is  more  folded  than  the  backbone  of  dodecanoic  acid 
at  its  present  configuration. 
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cyclododecane 

N,  number  of  overcrossings 

Figure  3.  Overcrossing  spectra  (histogram  of  overcrossing  probabilities)  for  the  backbones 
structures  shown  in  Figures  1  and  2.  (The  probabilities  are  computed  with  an  accuracy  of 

ca.  0.004  [see  text].) 


The  analysis  above  can  be  applied  to  any  other  conformation.  If  the  procedure 
is  repeated  for  the  configurations  found  along  a  MD  trajectory,  it  is  possible  to  assess 
the  changes  in  molecular  shape  as  a  function  of  time  by  monitoring  the  values  of 
A  *,  N*,  or  N.  These  descriptors  characterize  folding  features  in  terms  of  compactness 
and  the  complexity  of  entanglements,  and  their  changes  along  a  trajectory  allow 
one  to  estimate  the  molecular  flexibility.  In  our  context,  flexibility  is  regarded  as 
the  capacity  to  move  away  from  a  given  molecular  shape.  A  rigid  molecule  can 
only  deform  to  configurations  which  have  essentially  the  same  (global)  shape.  The 
degree  of  flexibility  of  a  minimum  energy  configuration  can  be  assessed  by  analyzing 
the  fluctuations  in  its  shape  descriptors  ("shape  fluctuations” )  as  a  function  of  time, 
using  MD  trajectories  which  start  from  the  desired  minimum  [26]. 

The  trajectory  (or  ensemble)  average  of  a  shape  descriptor  can  be  used  to  establish 
the  persistence  of  a  fold.  Since  actual  md  simulations  provide  a  discrete  (numerable) 
sequence  of  configurations  {K,,  i  =  1,  2,  ...  A/},  the  trajectory  average  of  the 
probability  of  the  most  probable  number  of  overcrossings,  (,4  *  ),  is  given  by: 

{A*)  =  \\m  A*{K,)  .  (4) 

In  practice,  the  maximum  value  M  will  correspond  to  the  total  number  of  snapshots 
taken  along  the  trajectory.  In  addition,  the  shape  fluctuations  can  be  expressed 
quantitatively  by  the  standard  deviation  in  the  value  of  A  *,  : 

<T^  =  ((A*2)-(A*yy'^ . 


(5) 
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Some  of  these  shape  descriptors  have  been  used  already  to  characterize  confor¬ 
mational  flexibility  about  energy  minima  and  transition  structures  of  hexane  and 
cyclohexane  [26],  a-helices  of  glutaredoxin  [30],  some  linear  hydrocarbons  [27], 
and  to  study  the  effect  of  excluded  volume  on  the  configurations  of  soft-bead  polymer 
models  [27],  In  Ref.  [26],  the  differential  flexibility  conferred  by  the  length  of  a 
chain  was  illustrated  by  comparing  hexane  and  decane  at  a  given  temperature.  In 
this  work,  we  expand  further  these  analyses  by  comparing  a  series  of  chain  and 
cyclic  molecules  of  various  sizes.  The  role  of  temperature,  number  of  carbon  atoms, 
and  backbone  connectivity  on  the  molecular  flexibility  is  studied  in  the  next  section 
by  following  the  values  of  A*  and  N*  along  MD  trajectories. 

Shape  Fluctuations  in  Molecular  Chains:  Carboxylic  Acids 

The  dynamic  trajectories  are  generated  by  standard  techniques.  The  Dreiding 
force  field  (including  atomic  charges)  [42]  was  employed  in  the  simulations.  The 
trajectories  are  derived  by  standard  Verlet  integration  of  Newton’s  equations  of 
motion  [23].  The  coupling  to  an  external  bath  at  a  desired  temperature  T  is  sim¬ 
ulated  by  uniform  rescaling  in  the  velocities.  For  each  molecule,  30  ps  trajectories 
have  been  generated,  with  integration  step  of  1  fs.  The  trajectories,  though  not  long, 
provide  a  reasonable  sampling  of  the  accessible  configurations  at  the  simulation 
temperature.  The  results  should  be  comparable  to  other  simulations  of  methylenic 
chains  in  the  literature,  using  alternative  force  fields  [43,44]. 

Four  carboxylic  acids  have  been  studied:  hexanoic,  octanoic,  decanoic,  and  do- 
decanoic  acids  («  =  6,  8,  10,  and  12),  at  7  =  300  and  T  =  lOOOK.  To  compare 
the  ability  of  these  n -chains  to  fold,  the  simulations  have  been  started  with  the 
molecules  at  the  linear  all-trans  configurations.  These  configurations  are  potential 
energy  minima  whose  shapes  are  characterized  by  A*  =  1 .  The  MD  trajectories  are 
computed  at  constant  temperature;  the  total  energy  is  also  conserved  within  a  5% 
fluctuation,  after  an  equilibration  period  of  ca.  0.5  ps. 

Figure  4  shows  the  results  for  the  potential  energy  fluctuations  for  the  acids,  at 
300  and  lOOOK.  The  main  contribution  of  the  energy  fluctuations  comes  from 
torsional  terms.  The  results  are  rather  unchanged  if  the  atomic  charges  are  disre¬ 
garded.  Moreover,  the  molecular  chain  properties  seem  not  to  be  very  sensitive  to 
the  presence  of  a  carboxylic  terminal  group. 

Little  structural  information  can  be  extracted  from  an  analysis  as  in  Figure  4.  It 
is  clear  that  the  amplitude  of  the  potential  energy  iluctuations  becomes  larger  for 
high  T,  but  the  results  do  not  tell  how  these  fluctuations  may  affect  the  fold  of  the 
chain.  Moreover,  these  results  do  not  depend  very  strongly  on  the  chain  length. 
This  picture  can  be  compared  with  the  analysis  of  molecular  shape  descriptors. 
Figure  5  shows  the  changes  of  .4  *  along  the  same  trajectories.  The  contrast  among 
the  various  chains  is  remarkable.  Although  the  fluctuations  at  300K  are  compar¬ 
atively  small  for  all  acids,  the  results  are  very  different  at  high  temperature.  The 
amplitude  of  the  molecular  shape  fluctuations  is  larger  for  octanoic  acid  than  for 
hexanoic  acid.  The  amplitudes  for  decanoic  and  dodecanoic  are  comparable.  More- 


time  (pa) 


tima  (pa) 


Figure  4.  Fluctuations  in  the  potential  energy  along  the  dynamic  trajectories  for  carboxylic 
acids  at  3(X)K  and  lOOOK.  [The  trajectories  are  started  from  initially  linear,  ail-trans  con¬ 
formations  for  the  chains.  These  results  provide  little  information  on  how  the  energy  is 
used  to  move  the  shape  away  from  the  initial  fold.) 


over,  the  values  of  A*  decrease  along  the  trajectories  for  longer  chains,  thus  indicating 
that  the  chains  are  folding.  In  all  these  trajectories,  it  is  found  that  the  most  probable 
number  of  overcrossings  is  still  N*  =  0.  The  A*  values  indicate  the  occurrence  of 
more  entangled  objects  for  the  longer  chains  at  high  temperature.  Note  that  hexanoic, 
and  to  a  smaller  extent  octanoic  acid,  exhibits  several  configurations  along  the 
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Figure  5.  Fluctuatio.is  .n  the  shape  descriptor /J  *  ( probability  of  the  most  probable  number 
of  overcrossings)  along  the  md  trajectories  for  carboxylic  acids  at  300K  and  KKWK.  (The 
description  is  much  richer  than  the  one  in  Fig.  5.  Note  that,  the  smaller  A*,  the  more 
folded  the  structure.  Ther  vv,  at  300K  the  chains  remain  close  in  shape  to  the  linear 
backbone.  At  high  tempera  ••  ?,  the  chains  are  flexible,  but  the  amplitude  of  the  shape 
fluctuatior.'  depends  on  the  backbone  length.) 
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trajectory  with  values  of  A*  close  to  1 .  This  indicates  that  the  chain  can  stretch 
(unfold)  during  the  simulation,  and  spend  some  time  in  a  configuration  with  a 
shape  similar  to  that  of  the  original  all-trans  chain.  In  contrast,  this  situation  dis¬ 
appears  for  the  «  =  10  and  n  =  12  chains.  These  backbones  are  mostly  melted (  i.e., 
folded)  during  the  simulation. 

The  interplay  between  molecular  shape  and  potential  energy  can  be  displayed 
by  combining  the  information  of  Figures  4  and  5.  Figure  6  shows  the  sequence  of 
snapshots  along  the  MD  trajectory  represented  in  a  2D  shape  vs.  energy  diagram 
for  the  two  extreme  chains  («  =  6  and  n  =  12),  at  low  and  high  temperature.  The 
diagrams  represent  the  trajectories  as  a  sequence  of  straight  line  segments  joining 
the  pair  of  values  of  potential  energy  and  shape  descriptor  for  each  configurational 
snapshot.  The  arrow  indicates  the  orientation  of  the  trajectory,  starting  from  the 
first  configuration  after  equilibration.  These  2D  diagrams  provide  information  on 
the  type  of  folds  accessible  to  the  molecular  chains  within  a  given  range  of  potential 
energy.  At  low  temperature,  the  potential  energy  fluctuates  more  in  dodecanoic 
than  in  hexanoic  acid;  yet,  the  shape  of  the  chain  and  its  fluctuations  are  almost 
the  same  (i.e.,  the  chains  are  quite  rigid).  At  high  temperature,  the  larger  fluctuation 
in  potential  energy  for  the  n  =  12  chain  leads  not  only  to  larger  shape  fluctuations, 
but  also  to  quite  low  /I*  values.  Values  as  low  as  0.55  are  characteristic  of 
i3-turns  [30],  which  indicates  that  the  chain  is  folded  but  not  into  a  very  compact 
configuration. 

Figure  7  summarizes  the  results  for  the  shapes  of  carboxylic  acids.  The  trajectory 
average  of  the  shape  descriptor,  (A*},  and  its  standard  deviation  [Eqs.  (4)  and 
( 5 )]  are  given  as  a  function  of  the  chain  length.  At  7  =  300K,  {A*}  decays  linearly 
(and  slowly)  with  n,  whereas  the  shape  fluctuation  remains  almost  constant. 
f>lT  -  lOOOK,  the  averaged  shape  descriptor  decays  rapidly  with  the  chain  length 
lor  short  chains,  while  increases.  For  longer  chains,  the  effect  seems  to  be  less 
dramatic,  as  the  shape  fluctuations  depend  less  strongly  on  the  number  of 
carbon  atoms. 

The  results  for  the  global  folding  features  of  carboxylic  acids  along  the  dynamic 
trajectories  allows  us  to  draw  the  following  conclusions: 

{ 1 )  At  low  temperature,  the  initially  linear  chains  cannot  access  to  the  degrees 
of  freedom  that  would  rapidly  change  the  fold.  That  is,  regardless  of  the 
number  of  internal  torsions,  the  chains  cannot  fold.  Yet,  the  chains  can 
deform.  The  small  shape  fluctuations  found  indicate  the  presence  of  localized 
vibrations  about  the  original  fold,  a  feature  which  does  not  depend  strongly 
on  the  backbone’s  length.  In  these  conditions,  the  chains  are  rigid. 

(2)  At  high  temperature,  the  molecules  can  access  to  many  torsional  modes  in 
an  uncorrelated  manner,  thus  allowing  a  wide  range  of  folding  features.  The 
longer  chains  become  more  entangled  (i.e..  more  folded)  than  the  shorter 
ones.  However,  since  (A*)  moves  away  from  linearity  as  a  function  of  n 
and  decays  less  rapidly  for  longer  chains,  it  would  appear  that  the  complexities 
of  entanglements  in  sufficiently  long  backbones  are  somewhat  comparable. 
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'ngure  6.  Interrelation  between  changes  in  the  shape  descriptor  A*  and  the  potential 
energy  along  MD  trajectories  of  hexanoic  and  dodecanoic  aads.  (Each  point  on  the  diagram 
represent  one  in  a  senes  of  snapshots  (configurations)  taken  every  0.2  ps  along  the  trajectory. 
Each  configuration  A'(/)  is  characterized  by  A*,  the  shape  desenptor  of  the  chain,  and  its 
potential  energy  value.  The  arrow  indicates  the  walking  direction  along  the  trajectory, 
starting  from  the  first  structure  after  equilibration.) 


As  Stated  before,  these  observations  should  mostly  be  due  to  properties  of  the 
molecular  chain,  regardless  of  the  presence  of  a  carboxylic  terminal  group  [27], 
Most  the  above  results  are  intuitively  expected  for  linear  chains  with  identical 
monomers.  However,  the  present  approach  to  characterizing  the  molecular  shape 
provides  a  quantitative,  precise  expression  to  otherwis  qualitative  notions.  The 
previous  conclusions  could  not  have  been  drawn  from  an  analysis  based  only  on 
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n,  number  of  carbon  atoms 

Figure  7.  Changes  in  global  shape  for  carboxylic  acid  chains,  at  300K  and  lOOOK.  The 
molecular  shape  is  indica'ed  by  (  /I  *  ) ,  the  trajectory  average.  The  bars  around  the  average 
shape  are  the  standard  deviation  ei.  and  they  quantify  the  shape  fhwiuatums.  (At  300K. 
{A*)  varies  almost  linearly  with  n  (dotted  line).  At  lOOOK  the  decrease  slows  down  for 

longer  chains ) 


the  potential  energy  changes  or  the  oscillations  in  the  individual  nuclear  positions. 
In  the  next  section,  we  compare  these  results  with  those  for  a  different  type  of 
molecular  connectivity. 

Shape  Fluctuations  in  Cyclic  Molecules  and  Comparison  with  Molecular  Chains 

Dynamic  trajectories  of  cycloalkanes  have  been  computed  and  analyzed  as  done 
before  for  carboxylic  acids.  The  chosen  «-cycles  are  cyclohexane,  cyclooctane,  cy¬ 
clodecane,  and  cyclododecane,  containing  the  same  number  of  carbon  atoms  as 
the  chains  in  the  previous  section.  Any  difference  in  molecular  flexibility  for  the 
two  groups  of  molecules  is  expected  to  be  mostly  due  to  the  distinct  type  of  backbone 
connectivity. 

The  MD  simulations  of  linear  chains  were  performed  by  starting  from  a  similar 
type  of  configuration  for  all  molecules  (the  stretched,  all-trans  structures).  To  fa¬ 
cilitate  the  comparison,  we  have  started  the  simulations  of  cycloalkanes  from  con¬ 
figurations  which  also  possess  a  comparable  “shape”  for  all  cycles.  The  starting 
configuration  for  cyclohexane  is  the  familiar  “chair,”  which  is  the  lowest-energy 
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minimum.  The  other  cy-les  have  been  located  initially  at  conformational  minima 
chosen  for  their  similarity  in  shape  to  the  chair-cyclohexane  (“crown”  structures). 
These  correspond  to  structures  showing  bonds  which  “zig-zag”  uniformly  about 
the  average  plane  of  the  cycle.  Moreover,  the  initial  configurations  are  chosen  so 
that  they  also  grow  in  size  proportionally  to  the  number  of  carbon  atoms.  The 
larger  cycles  are  thus  elongated,  not  circular.  For  example,  the  backbone  for  the 
initial  configuration  “crown-cyclododecane”  resembles  two  parallel  strands  of  4 
bonds,  joined  at  their  extrema  by  one-carbon  bridges.  (The  structure  in  Fig.  2  is 
deformed  away  from  this  initial  configuration.) 

The  initial  configurations  of  the  four  macrocycles  can  be  regarded  as  formally 
derived  from  linear  chains  by  folding  them  at  their  middle,  r  nd  joining  their  extrema 
thereafter.  Consequently,  the  cycles  appear  as  structures  wh  ose  shape  is  more  folded 
than  chains.  This  is  reflected  in  the  shape  descriptors,  which  indicate  a  smaller 
probability  of  no-overcrossings  for  cyclic  molecules  (cf.  Fig.  3).  The  fluctuation  of 
their  molecular  shapes  with  the  temperature  has  been  studied  using  the  same  force 
field  as  for  the  carboxylic  acids  [42],  The  simulation  conditions  and  lengths  of  the 
MD  trajectories  are  also  the  same. 

Figure  8  shows  the  interrelation  between  potential  energy  and  molecular  shape 
in  the  two  extreme  cases,  cyclohexane  and  cyclododecane  at  300  and  lOOOK.  This 
figure  can  be  contrasted  with  Figure  6,  corresponding  to  the  molecular  chains. 
(Note  the  difference  in  scales.)  A  comparison  of  molecules  with  the  same  number 
of  carbon  atoms  reveals  that  the  fluctuations  in  the  potential  energy  are  similar  for 
the  n-cycle  and  the  n-chain  at  the  same  temperature,  but,  their  shape  fluctuations 
are  quite  different.  At  300K,  the  cycloalkanes  e'.hibit  shape  fluctuations  with  am¬ 
plitudes  twice  as  much  larger  than  those  for  linear  chains.  At  1 OOOK,  the  amplitudes 
for  chains  and  cycles  become  closer.  Note  that  the  fluctuations  for  cycles  are  quite 
symmetric  about  the  average  shape,  whereas  the  fluctuations  for  chains  at  lOOOK 
are  asymmetric,  since  the  trajectory  spends  some  time  at  low  A*  configurations 
(i.e.,  more  folded  or  entangled).  Moreover,  the  average  of  the  shape  descriptor  for 
a  given  cycle  does  not  change  much  with  the  temperature,  whereas  it  becomes  smaller 
for  chains. 

The  results  in  Figure  8  illustrate  the  evident  change  in  molecular  shape  caused 
by  the  different  connectivity;  cycles  exhibit  large  fluctuations  but  their  essential 
shape  is  conserved  (no  unfolding  is  allowed),  whereas  the  chains  can  change  their 
shape  due  to  folding.  Furthermore,  it  is  observed  that  the  cycles  are  more  flexible 
than  chains,  even  at  low  temperatures.  Note  that  the  values  of  the  shape  descriptor 
for  long  chains  at  high  temperatures  are  close  to  those  of  a  cycle  with  the  same 
number  of  atoms.  That  is,  “hot”  chains  can  fold  enough  to  resemble  cycles  (with 
one  “missing”  bond). 

Figure  9  collects  the  results  for  the  shape  descriptor  {A*')  and  its  fluctuation  a  a 
for  the  cycloalkanes.  (Comp-are  with  Fig.  7  for  the  chains.)  Except  for  a  discrepancy 
in  cyclodecane  at  high  and  low  temperatures,  the  results  are  in  line  with  the  dis¬ 
cussion  above.  The  molecular  shape  becomes  more  convoluted  for  the  larger  cycles 
(as  (A*)  decreases),  but  their  average  molecular  shape  does  not  depend  too  strongly 
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Figure  8.  Interrelation  between  changes  m  the  shape  descriptor  A*  and  the  potential 
energy  along  MD  trajectories  of  cyclohexane  and  cyclododecane.  (Each  point  on  the  diagram 
represent  one  in  a  series  of  snapshots  (configurations)  taken  every  0.2  ps  along  the  trajectory. 
Each  configuration  A((r)  is  characterized  by  ^4*,  the  shape  descnptor  of  the  chain,  and  its 
potential  energy  value.  The  arrow  indicates  the  walking  direction  along  the  trajectory, 
starting  from  the  initial  structure  [a  cycle  in  a  minimum  energy  “crown”  conformation, 

see  text).', 


on  the  temperature.  At  low  temperature,  the  s.iape  descriptor  decreases  linearly 
with  n,  whereas  at  high  temperature  the  behavior  is  more  scattered.  This  erraticity 
at  high  temperature  is  probably  a  consequence  of  the  larger  fluctuations  about  the 
average  shape,  and  may  indicate  that  longer  trajectories  are  advisable  to  compute 
more  accurate  averages.  In  contrast  with  the  behavior  of  cycles,  molecular  chains 
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Figure  9.  Changes  in  global  shape  for  cycloalkanes,  at  300K  and  lOOOK.  The  molecular 
shape  is  indicated  by  (A*),  the  trajectory  average.  The  bars  around  the  average  shape 
are  the  standard  deviation  a^,  and  they  qiantify  the  shape  fluciuaiwns.  (The  shape  de¬ 
scriptor  varies  almost  linearly  with  n  at  300K  (doited  line).  At  large  temperature  the 
behavior  is  similar,  although  the  results  are  scattered  due  to  the  larger  shape  fluctuations 

[see  text).) 


show  a  marked  dependence  on  T  in  both  the  shape  fluctuations  and  the 
average  shape. 

Further  insight  into  the  configurational  changes  along  the  dynamic  trajectories 
can  be  obtained  by  comparing  molecular  shapes  and  molecular  sizes.  Molecular 
size  can  roughly  be  described  by  R,  the  radius  of  the  smallest  sphere  enclosing 
completely  the  backbone.  The  average  molecular  size  of  a  molecule  at  a  given 
temperature,  (/?),  can  be  computed  as  before  as  a  trajectory  average  [cf.  Eq.  (4)]. 
The  parameter  </?)  is  proportional  to  a  hydrodynamic  radius  [20],  and  scales 
similarly  to  the  radius  of  gyration  [45].  Figure  10  shows  the  averaged  radii  for 
chains  and  cycles.  (In  the  case  of  chains,  the  values  for  both  carboxylic  acids  and 
hydrocarbons  [27]  are  included.) 

The  results  in  Figure  1 0  reveal  a  number  of  facts,  which  we  list  and  discuss  below; 

( 1 )  The  average  sizes  of  cycloalkanes  are  almost  independent  of  the  temperature.. 
This  correlates  well  with  our  finding  of  little  dependence  on  T  in  the  mo¬ 
lecular  shape  of  cycles.  Nevertheless,  a  difference  exists  for  larger  cycles, 
indicating  that  the  latter  can  maintain  their  size  and  yet  become  somewhat 
more  convoluted  or  entangled. 
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n,  number  of  carbon  atoms 

Figure  1 0  Average  molecular  size  <  )  ( radius  of  the  smallest  sphere  enclosing  the  back¬ 
bone,  averaged  over  the  dynamic  trajectory)  for  chains  and  cycles  at  different  temperatures. 
(The  results  labeled  "chains"  contain  the  data  for  both  carboxylic  acid  chains  and  hydro¬ 
carbon  chains  of  the  same  length.  The  change  in  radii  is  perfectly  linear;  note  that  the 
intercept  for  chains  at  large  temperature  and  cycles  are  the  same.  The  linear  behavior 
should  disappear  for  very  long,  random  chains  [see  text].) 


( 2 )  Linea’-  chain  molecules  become  more  compact  at  high  temperature,  although 
the  difference  in  size  disappears  for  short  chains.  This  agrees  with  the  results 
in  Figure  7  for  the  shape  descriptors.  For  chains,  there  is  a  close  correlation 
between  shape  (as  given  by  the  complexity  of  entanglements  in  the  fold) 
and  size  (as  expressed  by  Nevertheless,  the  amplitude  of  the  fluctu¬ 
ations  in  (a*)  indicates  that  chains  may  become  compact  and  entangled 
without  necessarily  becoming  smaller. 

( 3 )  The  radii  correlate  linearly  with  the  number  of  carbon  atoms,  hut  the  intercepts 

for  chains  at  low  T  and  high  T  are  different.  Moreover,  the  intercept  for  chains 
at  high  T  coincides  with  the  intercept  for  the  cycles.  This  result  is  interesting, 
since  it  suggests  some  similarity  between  configurations  of  cycles  and  those 
of  chains  at  high  T.  A  similar  relation  was  apparent  when  using  shape  de¬ 
scriptors,  since  the  values  of  A*  for  large  cycles  and  “hot”  chains  are  not  far 
from  each  other.  (As  a  side  comment,  it  must  be  noted  that  for  very  long, 
random  chains,  </?)  should  move  apart  from  the  linearity  and  scale  ap- 
proximatel'-  as  {/?)  fl"^***  [27,45,46].) 

The  results  above  show  some  connections  between  the  descriptions  in  terms  of 
size  and  shape,  as  well  as  some  differences.  The  differences  become  even  more 
transparent  when  looking  at  fluctuations.  The  shape  fluctuations  are  represented 
by  the  dimensionless  descriptor  o-^,  which  is  the  standard  deviation  about  the  average 
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maximum  probability  of  overcrossings  in  rigid  3D  placements,  .  For  the  sake 

of  a  proper  comparison,  we  use  as  a  dimensionless  descriptor  of  size  fluc¬ 

tuations.  Relative  r.m.s.  deviations  could  have  been  used  instead. 

The  two  approaches  are  compared  in  Figure  1 1 .  The  top  diagram  shows  the 
fluctuations  in  shape  for  cycles  and  chains  at  300K  and  lOOOK.  The  results  are 
extrapolated  to  =  0  for  n  =  3,  which  is  an  exact  limit  (the  3-atom  figure  is  always 
planar,  thence  its  shape  descriptor  is  constant).  The  lower  diagram  shows  the  results 
for  the  fluctuations  in  size.  Here,  we  choose  to  extrapolate  to  an/^R')  =  0  for 
«  =  2,  where  the  difference  between  chains  and  cycles  disappears.  This  extrapolation 
does  not  represent  an  exact  limit. 


n,  number  of  carbon  atoms 

Figure  1 1.  Comparison  between  fluctuations  in  molecular  shape  (<r^)  and  fluctuations 
in  molecular  size  (  itr/(R  ))  for  chains  and  cycles  at  300K  and  lOOOK.  ( Note  that,  whereas 
the  .si:v  fluctuations  in  "hot”  chains  differ  from  those  of  cycles,  the  shape  fluctuations  in 
long  chains  and  large  cycles  at  lOOOK  is  quite  similar.  This  is  in  line  with  the  fact  that  at 
high  temperature,  the  configurationally  averaged  properties  of  large  cycles  and  long  chains 
should  be  almost  the  same.) 
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The  fluctuations  in  size  and  shape  for  cycles  and  chains  at  300K  are  quite  similar. 
However,  it  is  clear  that,  in  terms  of  size fluctuations,  "hot "  chains  are  very  different 
from  cycles  (bottom  diagram ) .  In  contrast,  the fluctuations  in  shape  are  similar  for 
large  cycles  and  long  chains  at  high  temperature  (top  diagram).  This  outcome  is 
more  in  line  with  the  notion  that  when  averaging  global  properties  of  sufficiently 
large  macrocycles  over  configurational  space,  the  results  should  be  comparable  to 
those  derived  for  flexible  chains  with  the  same  number  of  monomers  [21,22,45], 
This  result  makes  clear  the  importance  of  monitoring  the  global  shape  features  of 
a  backbone,  and  not  only  its  size  or  ’ocal  geometrical  properties. 

Summary  and  Conclusions 

In  the  previous  sections,  a  method  to  monitor  the  persistence  over  time  of  mo¬ 
lecular  shape  features  has  been  discussed  and  applied  to  the  study  of  chain  and 
cyclic  molecules  of  various  lengths.  The  procedure  provides  a  quantitative  expression 
of  global  shape  for  each  of  the  configurational  snapshots  found  along  a  dynamic 
trajectory.  The  folding  features  of  the  backbone  at  a  given  configuration  are  described 
in  terms  of  the  number  of  overcrossings  in  every  possible  rigid  3D  placement.  The 
more  entangled,  convoluted,  and  compact  the  backbone,  the  more  probable  the 
larger  numbers  of  overcrossings.  Therefore,  the  use  of  the  overcrossing  probabilities 
as  shape  descriptors  characterizes  both  the  compactness  of  the  backbone,  as  well 
as  the  nature  of  the  entanglements.  Other  shape  descriptors  (the  moments  of  the 
number  of  overcrossings)  can  be  derived  from  the  probability  distribution.  The 
procedure  has  been  applied  here  to  chain  and  cyclic  backbones  without  branching. 
However,  the  basic  ideas  can  also  be  extern’ .id  to  deal  with  other  systems,  such  as 
clusters  and  hydrogen-bonded  networks  [27]. 

The  shape  is  characterized  in  an  absolute  manner,  not  relative  to  a  reference 
configuration.  Therefore,  the  values  of  ^  *  encountered  along  a  trajectory  tell  not 
only  how  the  shape  fluctuates,  but  also  they  identify  the  instantaneous  folded  struc¬ 
tures  that  the  backbone  explores.  The  recognition  of  the  similarity  in  shape  between 
some  folded  chains  and  molecular  cycles  (with  the  same  number  of  monomers)  is 
an  illustration  of  this. 

Summarizing  the  results,  our  analysis  reveals  a  difference  in  flexibility  at  low  and 
high  temperatures  for  all  molecules,  even  though  the  average  shape  is  conserved 
more  if  the  backbone  is  constrained  to  a  cycle.  At  low  temperatures,  the  chains  are 
rigid,  whereas  the  cycles  are  always  floppy.  In  contrast,  the  fluctuations  in  shape  at 
high  temperature  for  sufficiently  large  chains  and  cycles  are  comparable.  At  low 
temperature,  the  shape  descriptor  shows  an  approximately  linear  decrease  with  the 
number  of  monomers,  at  least  for  short  backbones.  The  dependence  is  much  softer 
at  high  temperature  for  long  backbones,  thus  indicating  that  all  folding  possibilities 
become  accessible.  Moreover,  the  present  results  appear  to  depend  mostly  on  the 
connectivity  of  the  backbone,  its  length,  and  the  temperature,  and  not  strongly  on 
the  presence  of  a  terminal  functional  group.  This  will  not  be  true  in  general  for  all 
types  of  backbones. 
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The  present  shape  characterization  describes  quantitatively  the  molecular  flexi¬ 
bility  and  the  results  agree  with  the  overall  expected  physical  picture  in  known 
cases.  The  method  should  therefore  be  a  valuable  tool  in  the  analysis  of  other 
phenomena,  such  as  various  one-molecule  “phase”  transitions,  where  a  correct 
assessment  of  global  folding  as  a  function  of  time  is  a  central  issue. 
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Abstract 

The  sensitivity  of  state-to-state  transition  probabilities  for  the  3-dimensional  H  +  H2  hydrogen-exchange 
reaction  ( at  zero  total  angular  momentum)  has  been  investigated  with  respect  to  variations  in  the  interaction 
pu..'ntial.  Several  regions  of  configuration  space  where  the  dynamics  is  highly  sensitive  to  inaccuracies 
in  the  potential  have  been  identified  These  regions  of  importance  vary  with  collision  energy,  but  do  not 
change  as  dramatically  as  the  previously  studied  tJ-  Chem.  Phys.  97, 6226  ( 1992)]  collinear  case  Near 
the  reaction  threshold,  the  dynamics  is  most  sensitive  to  the  saddle  point  region  as  expected  At  higher 
energies  ( about  1 .0  to  1 .5  eV ) .  howe  ver,  the  inner  corner  of  the  potential,  where  the  dynamics  "cuts  the 
corner"  in  going  from  reactant  to  product  arrangements,  is  most  important  for  collinear  geometnes,  and 
the  outer  corner,  where  the  H3  conformation  is  more  compact  than  the  transition  state  conformation, 
IS  most  important  for  bent  geometnes.  Surpnsingly,  the  region  of  the  potential  traversed  by  the  minimum 
energy  path  across  the  saddle  point  region  has  rather  insignificant  sensitivities  at  these  higher  energies, 
rc'  1993  John  Wilcv  &Sons.  Inc. 


Introduction 

One  of  the  goals  of  theoretical  chemistry  is  to  be  able  to  predict  experimental 
observables  to  within  chemical  accuracy  starting  completely  from  first  principles. 
In  the  field  of  chemical  dynamics,  this  goal  is  arguably  at  hand  for  the  complete 
description  of  the  simplest  chemical  reaction — that  of  the  H  +  Hy  hydrogen-ex- 
change  reaction  and  its  isotopic  variants  [!].  The  prediction  of  state-to-state  integral 
cross  sections  as  a  function  of  energy  [  2-4  ]  and  product  vibrational-rotational  state 
distributions  [5-12]  have  been  corroborated  to  a  large  extent  by  experiment  [11,13- 
19].  This  state  of  affairs  is  attributable  in  part  to  the  existence  of  very  accurate 
potential  energy  surfaces  [20-22]  for  this  3-electron  system.  Despite  these  successes, 
there  are  still  some  remaining  issues  where  theory  and  experiment  are  in  disaccord 
[8,1 1, 16(d),  18, 19].  For  example,  the  state-resolved  differential  cross  sections  for 
the  D  +  H2  DH  +  H  reaction  show  a  large  discrepancy  between  the  calculated 
and  the  experimentally  observed  ratios  of  forward-scattered  to  backward-scattered 
intensities  [18].  Since  the  various  scattering  theories  used  are  formally  exact,  and 
calculations  from  different  research  groups  employing  different  formalisms  have 
yielded  essentially  the  same  result,  the  theor,  versus  experiment  discrepancies  have 
naturally  been  ascribed,  on  the  theoretical  slue,  to  deficiencies  in  the  potential 
energy  surfaces.  Agreement  between  theory  and  experiment  is  even  more  difficult 
to  attain  for  more  complex  systems  because  the  calculation  of  accurate  potential 
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energy  surfaces  becomes  increasingly  more  elusive.  The  natural  question,  then,  is 
how  can  one  systematically  improve  a  potential  energy  surface  (I»ES)  without  re¬ 
sorting  to  calculating  the  whole  I’ES  anew  with  higher  levels  of  theory.  It  is  partly 
with  this  question  in  mind  that  we  arc  furthering  the  development  of  quantum 
functional  sensitivity  analysis  (OFSa)  [23,24] — to  provide  feedback  information 
to  quantum  chemists  in  their  pes  refinement  cflbrts.  An  equally  important  reason 
for  using  QP.SA  to  study  chemical  dynamics  is  to  gain  a  better  understanding  of  the 
potcntial-to-dynamics  relationship. 

QFSA  is  a  methodology  for  studying  the  response  of  calculated  observables  (or 
model  outputs)  to  small  variations  in  model  input(s).  When  the  PES  is  viewed  as 
the  model  input  to  a  quantum  .scattering  calculation,  QFSa  provides  information 
about  which  regions  of  the  PES  arc  most  important  for  determining  the  quantum 
dynamics.  The  particular  quantities  of  interest  are  the  functional  derivatives  (or 
sensitivities),  hOlhV(K),  where  the  observable,  O,  is  viewed  asa  functional  of  the 
PES,  0[K(R)].  This  information  is  expected  to  And  utility  among  quantum  chemists 
in  providing  a  guide  to  regions  of  the  PES  where  a  hi^cr  density  of  ah  initio  points 
should  be  calculated  ( i.c.,  regions  of  the  pes  where  the  sensitivities  are  largest),  and 
also  in  the  PES  refitting  efforts  (errors  between  ah  initio  points  and  the  fitted  surface 
should  be  minimized  in  regions  of  largest  sensitivities). 

QFSA  has  been  applied  to  I -D  [23(a)]  and  collinear  [23-25]  reactions,  and 
to  the  inverse  problem  of  obtaining  a  pes  from  scattering  [26]  or  spectral  [27] 
data.  The  latter  calculations  arc  based  on  an  iterative  procedure  where  the  po¬ 
tential  is  systematically  corrected  until  deviations  between  the  predicted  and 
the  "target”  observables  arc  below  some  tolerance.  These  studies  have  shown 
that  many  if  not  an  inhnilc  number  of  pess  can  accurately  reproduce  a  finite 
number  of  experimental  observables  [  27(b), 28],  Therefore,  there  is  no  guarantee 
that  the  iterative  procedure  of  refining  and  refitting  the  potential  using  QFSa  as 
proposed  here  will  eventually  converge  to  the  true  i  ES.  The  direction  of  our 
effort  is  not  simply  to  aid  in  obtaining  an  arbitrary  PES  which  can  reproduce  the 
experimental  observable  but  rather  to  obtain  a  pes  which  is  ab  m/r/o-based — 
that  is,  to  further  the  aforementioned  goal  of  predicting  experimental  observables 
completely  from  first  principles. 

This  article  describes  the  sensitivity  of  s]atc-to-statc  transition  probabilities 
for  the  3-dimcnsional  H  +  H2  hydrogen  '‘xchange  reaction  at  a  total  angular 
momentum  7  =  0  to  variations  in  the  underlying  PES.  fhis  is  the  first  report  of 
a  QFSA  application  to  a  chemical  reaction  in  3-D.  Sensitivities  of  the  state-to- 
state  transition  probabilities  are  shown  at  several  collision  energies  and  com¬ 
parisons  arc  made  to  the  sensitivities  for  the  collincar  H  f  H2  reaction.  In  this 
study,  we  investigate  the  similarities  and  differences  in  trends  between  the  sen¬ 
sitivities  for  the  collincar  and  3-D  reactions  to  determine  what  features  of  col¬ 
lincar  sensitivities  carry  over  to  3-0,  We  also  help  resolve  some  issues  concerning 
visualizing  data  in  many  dimensions. 

The  quantum  scattering  of  H  with  H2  is  described  by  the  formalism  of  the  log- 
derivative  Kohn  variational  method  [  29-3 1  ] .  The  salient  features  of  applying  QFSA 
within  this  scattering  framework  are: 
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( 1 )  All  the  necessary  quantities  for  calculating  sensitivities  are  readily  obtained 
from  the  scattering  calculation.  The  sensitivities  can  be  obtained  with  ap¬ 
proximately  20%  more  CPU  time. 

(2)  All  the  matrix  elements  of  the  Hamiltonian  are  energy  independent.  Hence, 
sensitivities  can  be  obtained  at  several  collision  energies  without  recomputing 
the  matrix  elements. 

( 3 )  The  functional  derivative  of  the  scattering  wave  functions  with  respect  to 
the  PES  need  not  be  calculated. 

The  potential  surface  used  in  our  studies  is  the  Liu-Siegbahn- Truhlar-Horowitz 
(LSTH)  [20]  surface. 

The  remainder  of  this  article  is  organized  as  follows.  The  second  section  presents 
a  summary  of  the  theoretical  and  computational  methodology.  Our  results  and 
discussion  are  presented  in  the  third  section,  and  the  fourth  section  concludes  the 
article. 


Theoretical  Method 


The  scattering  of  atom  A  with  diatom  BC  is  described  quantum  mechanically 
by  the  log-derivative  Kohn  variational  method  [  29  ]  as  employed  by  Manolopoulos 
et  al.  [  30,3 1  ] ,  and  the  requisite  equations  for  calculating  the  sensitivities  are  obtained 
by  taking  appropriate  functional  derivatives  of  the  scattering  (S)  and  log-derivative 
(Y)  matrices  with  respect  to  the  interaction  potential.  The  mathematical  devel¬ 
opment  of  the  sensitivity  equations  for  the  3-D  A  +  BC  reaction  at  zero  total 
angular  momentum  is  essentially  the  same  as  that  of  the  collinear  case  [24]  with 
the  sole  exception  being  that  the  internal  vibrational  functions  are  replaced  by 
rovibrational  functions.  Thus,  we  shall  provide  only  a  summary  here  and  direct 
the  interested  reader  elsewhere  [24]  for  additional  details. 

The  scattering  formalism  uses  the  well-known  mass-scaled  Jacobi  coordinates 
where  the  Hamiltonian  (//)  can  be  written  as 


1  a2  t2p 


(1) 


R.^,  r^,  and  6^  are,  respectively,  the  coordinate  of  the  lone  atom  with  respect  to  the 
center-of-mass  of  the  diatom,  the  vibrational  coordinate  of  the  diatom,  and  the 
angle  subtended  by  R.,  and  f.^,  all  in  arrangement  y.  The  Hamiltonian  of  the  isolated 
diatom  is  ,  and  f is  the  interaction  potential  with  respect  to  which  the  sensitivities 
are  computed.  Note  that  V,„t  is  not  the  total  potential,  but  we  shall  dispense  with 
the  int  subscript  in  the  functional  derivatives  for  notational  simplicity.  The  symbols 
ju,  and  are  defined  in  the  usual  manner. 

The  log-derivative  wave  function  is  expanded  in  a  direct  product  basis  of 

translational  Lobatto-shape  functions  [30]  {Uy,)  and  asymptotic  rovibrational 
functions  (<Pyn/ry)  in  the  three  possible  atom-diatom  arrangements. 
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=  2  UaMMra,  <?.)  +  2  ^  UkA  Ofc) 

,/  j„  J<hfh 

+  2  UcARc)<t>cn(rc,  eA 

Kn  Rcrc 

subject  to  the  boundary  conditions 

^PUR■r  -*►  0)  =  0 
and 


(2) 


(3a) 


=  (3b) 

Ky  fy 

R  determines  the  bounding  surface  [31]  separating  the  “interior”  and  “exterior” 
regions  of  the  potential.  It  is  the  surface  on  which  the  interaction  potential  has 
sufficiently  decayed  to  zero.  For  notational  simplicity,  we  use  the  index  n  to  represent 
the  compound  ( v,  j)  vibrational-rotational  indices  of  the  internal  rovibrational 
function. 

One  of  the  two  computationally  intensive  steps  in  the  calculation  is  the  construc¬ 
tion  of  the  “stiffness”  matrix,  K,  defined  by 


=  J^ntenor  R\"^‘i"  dR-i"  d(COS  By")  ^  ^  ^  ^ 

region  >  7  "r  7 


dRy- 


Ot\y" 


■f  ^  2  I  ^71  *^7"!  I  ^inr  hy"  +  2^^  2 


(4) 


Note  that  the  total  energy,  £,  appears  only  as  a  multiplicative  factor  in  front  of  an 
overlap  matrix.  Hence,  after  the  K-matrix  elements  have  been  computed  just  once, 
the  scattering  problem  can  be  solved  for  any  number  of  collision  energies  ( for  which 
the  chosen  basis  set  is  adequate)  without  recomputing  the  K-matrix  elements.  Fur¬ 
thermore,  the  stiffness  matrix  is  symmetric  if  the  basis  functions  are  ordered  likewise 
in  the  rows  and  columns.  In  ordering  the  basis  functions,  the  boundary  functions 
from  all  arrangements,  i.e.,  those  basis  elements  containing  the  boundary  Lobatto 
functions,  Uy^y  {My  is  the  number  of  translational  functions  in  arrangement  y), 
are  grouped  together  in  the  last  rows  and  columns.  With  this  grouping,  the  K-matrix 
can  be  partitioned  as 


Kii  K|o 

Koi  Koo. 


(5) 


where  the  subscript  0  refers  to  the  sub-block  of  matrix  elements  containing  Uyst^. 
Thus,  if  Ny  rovibrational  functions  and  My  translational  functions  are  used  to  span 
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each  arrangement  y,  then  Kio  is  a  large-by-small  rectangular  matrix  of  dimension 
[N^Ma  -  1 )  +  NhiMt.  -  1 )  +  NAM,  -  1  )]®[A^a  +  Nh  +  Kl  and  Koo  is  a  small- 
by-small  square  matrix  of  dimension  [A^,,  +  A^z,  -1-  iVJ®[iVa  -F  iV/,  +  A^r],  etc. 

The  log-derivative  matnx,  Y,  whose  elements  are  defined  by  the  extremum  of 

'/» '  1  boundan  ”  ^ntenor  ^  d{COS 

region 


X 


('l'yn)^(^y'n-) 


dRy. 


+ 


W4 


Km  +  Ity-  + 


a.-)*'-') 


(6) 


with  respect  to  the  log-derivative  wave  function,  can  be  written  in  terms  of  the  K- 
matrix  as 


Y  =  Koo-Ko,(K„)-'K,o  (7a) 

=  [K-'W.  (7b) 

The  variationally  determined  log-derivative  wave  function  expansion  coefficients 
are 


C  =  -Kri'K,o  =  (K-  ■' ]ioY  for  the  interior  basis  functions, 


and 

=  5yy'd„„'  for  the  boundary  functions.  (8) 

As  Eqs.  (7b)  and  (8)  suggest,  the  second  computationally  intensive  step  is  the 
solution  of  the  system  of  equations  KX,  =  e^,  where  is  a  column  vector  with  a 
one  at  position  j.  The  solution  is  the yth  column  of  K'' .  According  to  Eq.  (7b), 
one  obtains  Y  by  letting  j  run  over  the  last  Ng  +  Ni,  +  N,  positions  of  the  unit 
vector  e  (the  positions  corresponding  to  subscript  0  in  the  partitioning  of  K),  storing 
the  last  Na  +  Ni,  +  rows  of  X^,  and  then  inverting  the  resultant  small-by-small 
matrix  [K"'  ]oo.  To  calculate  sensitivities,  one  also  needs  to  store  the  other  rows  of 
the  already  computed  X^.  This  is  the  large-by-small  rectangular  matrix  lK“']io, 
from  which  one  obtains  the  log-derivative  wave  function  coefficients  by  multipli¬ 
cation  with  Y.  The  log-derivative  wave  functions  play  an  integral  part  in  the  cal¬ 
culation  of  sensitivities  [see  Eq.  ( 1 1 )]. 

The  remaining  steps  of  the  calculation  involve  only  small-by-small  matrices.  The 
scattering  matrix,  S,  is  related  to  Y  via 

S  =  a  +  2/b[Y-c]-'b  (9) 

where  a,  b,  and  c  are  diagonal  matrices  related  to  the  incoming  (/.,„)  and  outgoing 
{Oy„)  waves 

lyn  =  ky!,'^hl{k^„Ry), 

_ Oy„  =  K-y!.H^,^\k,„Ry), 

with  ky„  =  V2m(£'  -  Cy„)/h^,  by 
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^'yw.'yn 

^ytt.yn  ~  (^yn^ 

and  c^n.-,»  =  (10) 

all  evaluated  at  /ly'  -’  are  the  Riccati-Hankel  functions  as  defined  by 

Calogero  [  32  ] ,  and  Cy„  is  the  internal  rovibrational  energy  of  state  yn .  The  functional 
derivative  of  the  transition  probability  with  respect  to  the  interaction  potential  is 
[24] 


^Ry'n*-*~yn 

hV(Ry,ry,ey) 


=  2  Re 


5V(Ry.ry,ey) 


by 


'n'.y'n*' 


2 


[Y  - 


X 


[Y-c] 


-I 

yn.‘i'*n 


(11) 


Finally,  a  simple  interpretation  of  the  transition  probability  sensitivities  in  terms 
of  a  product  of  two  scattering  wave  functions. 


^^y'  *n*^yn 

5V(Ry,ry,dy) 


h- 


(12) 


can  be  obtained  by  relating  the  log-derivative  wave  function  {'Py„)  to  the  normal 
scattering  wave  functions  (^y„)  which  satisfy  the  boundary  conditions  [24] 


\p-l,(Ry-  ->■  X)  = 


IyniRy)<l>y 

RyKy 


Oy'n-{^y')<t>y'»' 

Ry'fy’- 


(13) 


In  our  calculations,  we  used  three-identical  particle  symmetry  [2(b),33]  to  reduce 
the  size  of  our  K-matrix  by  a  factor  of  three.  The  size  of  our  basis  ranged  from  M 
=  30  translational  Lobatto  functions  and  N  =  30  rovibrational  functions  (for  each 
arrangement)  in  the  lower  energy  regime  to  M  =  N  =  40  in  the  higher  energy 
regime.  The  values  of  all  the  other  parameters  are  the  same  as  those  employed  by 
Manolopoulo'  and  Wyatt  [  30  ] .  For  a  single  energy  slice  with  M  =  N  =  30  on  a 
Cray-2,  the  calculation  through  setting  up  the  K-matrix  took  1 0  s.  and  the  calculation 
of  reaction  probabilities  took  another  15  s.  The  sensitivity  calculation  on  a  55-by- 
45  grid  of  points  took  an  additional  9  s.  For  M  =  N  =  40,  these  three  times  are  30, 
74,  and  22  s,  respectively.  To  verify  the  accuracy  of  the  sensitivity  predictions  and 
ferret  out  any  programming  bugs,  we  did  the  usual  check  [24]  of  placing  small 
bumps  on  the  potential  and  comparing  the  predicted  with  the  actual  change  in 
transition  probabilities. 


Results  and  Discussion 

As  we  mentioned  in  the  Introduction,  this  is  the  first  article  on  a  qfsa  application 
to  a  chemical  reaction  in  3-D.  One  of  the  issues  that  concerned  us  was  how  to 
display  and  analyze  the  multidimensional  sensitivity  data  and  be  assured  that  we 
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have  not  missed  any  important  features.  hPIbVis  a  function  of  the  coordinates  R, 
r.  and  B,  as  well  as  the  total  energy,  E.  The  other  issue  was  how  the  sensitivities  for 
the  3-D  H  +  H2  reaction  compared  with  those  from  the  coMinear  calculation.  In 
order  to  acquire  a  sense  of  the  evolution  of  sensitivity  structuie  in  the  energy  range 
0.50  to  1.50  eV,  we  calculated  reactive  transition  probability  sensitivities  at  200+ 
energy  slices  for  transitions  from  (v  =  0,j  =  0)  to  (u',/)  =  (0,  0),  (0,  1 ),  (0,  2). 
and  also  calculated  the  cumulative  sensitivities  for  transitions  from  (0,  0)  to  all 
product  states  'open  ^Pvy^oo/BV).  These  sensitivities  were  displayed  as 
contour  maps  and  saved  onto  videotape.  In  order  to  facilitate  a  comparison  with 
t/ie  collinear  H  +  Hj  sensitivities  and  because  the  largest  sensitivities  occurred  when 
the  H3  system  passed  through  collinear  configurations,  most  of  the  analysis  was 
done  at  6  =  180°.  An  analysis  of  each  set  of  sensitivity  structure  is  presented  in  the 
four  subsections  below  followed  by  a  discussion  of  sensitivity  structure  at  other 
scattering  angles.  We  consider  only  reactive  transitions,  so  for  notational  conve¬ 
nience  we  omit  the  arrangement  indices.  Figure  1  shows  the  transition  probability 
from(i;  0  y  =  0)  to  (u',/)  =  (0. 0).  (0,  1 ),  (0,  2),  and  the  cumulative  transition 
probability  to  all  open  product  states.  It  provides  a  guide  to  the  discussion  of  sen¬ 
sitivity  structure  in  different  energy  regimes. 

Sensitivities  for  Transitions  from  v  =  0j  =  0tov'  =  0f-0(0=  180°) 

For  the  3-D  7  =  0  H  +  H2  reaction,  the  evolution  of  the  bPoo-oo/^^  sensitivity 
structure  in  the  threshold  region  (0.55  to  0.64  eV)  is  very  similar  to  the  collinear 


Figure  I .  Reactive  sta(c-io-state  transition  probabilities  from  r  =  0  7  =  0  to  o'  =  0. 7'. 
The  cumulative  transition  probabilit>  fiom  t  =  O7  =  0  to  all  open  product  states  has  been 
multiplied  b\  a  factor  of  0  5  before  plotting  as  a  dashed  line. 
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case  (see  Refs.  [23(a)],  [24]).  There  is  a  large  region  of  negative  sensitivity  near 
the  saddle  point  of  the  potential  energy  surface  (PES).  The  most  negative  regions 
start  out  on  the  two  shoulders  of  the  barrier  and  lie  slightly  toward  the  inner  comer* 
where  tunneling  is  important.  These  locations  correspond  roughly  to  the  classical 
turning  points  along  the  vibrationally  adiabatic  reaction  path.  As  the  energy  in¬ 
creases,  the  two  negative  sensitivity  regions  coalesce  to  a  single  extremum  near  the 
top  of  the  barrier.  Unlike  the  collinear  sensitivities,  the  magnitudes  of  these  sen¬ 
sitivities  are  relatively  small  compared  to  those  at  higher  energy  regimes. 

The  0.67  eV  energy  slice  of  Figure  2  shows  the  sensitivities  just  beyond  the 
threshold  region.  The  negative  sensitivity  lobe  in  the  saddle  point  region  has  de¬ 
creased  in  both  extent  and  magnitude,  and  the  magnitudes  of  the  positive  shoulders 
have  increased.  With  increasing  energy,  the  positive  shoulders  grow,  become  more 
prominent,  and  move  towards  the  barrier  top  where  they  coalesce  in  the  inner 
comer  of  the  PES  and  eventually  displace  the  negative  sensitivities  at  the  barrier 
top.  Figure  2(b)  shows  an  energy  slice  at  the  end  of  this  sequence. 

From  0.82  to  0.98  eV  (resonance  energy),  a  similar  pattern  is  repeated  twice. 
First,  the  negative  shoulders  grow,  move  towards  the  barrier  top  and  coalesce  in 
the  inner  comei .  The  positive  sensitivities  at  the  barrier  top  are  not  displaced  though, 
but  are  pushed  out  to  the  outer  comer.  Second,  the  more  distant  positive  shoulders 
begin  to  grow,  move  towards  the  barrier  top,  and  coalesce  in  the  inner  comer. 
Figure  2(c)  shows  an  energy  slice  in  the  resonance  region.  The  positive  shoulders 
have  coalesced  with  each  other,  and  together  with  the  positive  lobe  at  the  outer 
comer  they  form  a  large  region  of  positive  sensitivity  near  the  PES  saddle  point.  Of 
the  200+  energy  slices  investigated,  the  5foo^/5K  sensitivities  attain  their  maxi¬ 
mum  positive  values  at  this  resonance  energy.  Unlike  the  behavior  of  the  collinear 
sensitivities  [24],  the  3-D  sensitivities  do  not  change  much  over  the  resonance 
zone.  The  largest  sensitivity  regions  do  not  flip  in  sign  as  the  energy  increases 
beyond  the  resonance,  and  the  magnitudes,  though  largest  at  the  resonance  energy, 
are  not  an  order  of  magnitude  greater  than  those  at  neighboring  energy  riices  as 
was  the  case  for  the  collinear  sensitivities.  The  shape  of  the  sensitivity  stmcture  at 
this  resonance  energy  implies  that  a  narrowing  of  the  passage  across  the  barrier  will 
increase  the  transition  probability  and  reduce  the  resonance  feature  (see  Fig.  1 ). 
Conversely,  if  the  PES  is  decreased  at  the  inner  and  outer  comers,  the  symmetric 
stretch  frequency  will  decrease  and  the  resonan''p  feature  will  be  enhanced. 

At  higher  energies  [see  Figs.  2(d)-2( f )].  the  pattern  seen  at  lower  energies  repeats 
itself.  The  sensitivities  at  the  shoulder  grow  in  magnitude  and  move  towards  each 
other  eventually  coalescing  at  the  inner  comer  of  the  pes.  Each  time  the  shoulders 
coalesce  though,  they  cut  further  into  the  inner  comer.  In  the  meantime,  the  sen¬ 
sitivities  near  the  barrier  top  decrease  in  magnitude.  The  result  is  that  the  extrema 


*  For  semantic  purposes,  the  definitions  of  the  inner  (outer)  comer  and  shoulder  r^ons  of  the  potential 
are  as  follows.  The  inner  (outer)  comer  of  the  potential  refers  to  the  region  on  the  upper  wall  of  the 
saddle  point  where  the  Hj  conformation  is  less  (more)  compact  than  the  conformation  at  the  top  of  the 
barrier  The  shoulder  regions  refer  to  the  two  opposite  locations  about  halfway  down  the  banter  along 
the  minimum  energy  path. 
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Sensitivities  for  v=0,j=0  to  v’=0,j’=0 


Figure  2.  Sensitivity  maps  for  transitions  from  u  =  0  j  =  0  to  u'  =  0  /'  =  0  at  9  =  180°. 
The  contour  values  are  drawn  in  increments  of  zincr  shown  on  the  plots.  Positive  contours 
are  represented  by  solid  lines  and  negative  contours  by  dashed  lines.  The  zero  contour 
line  is  omitted.  The  numbers  in  parentheses  represent  the  minimum  and  maximum  sen¬ 
sitivity  values  on  an  80  X  65  grid.  All  sensitivity  contour  values  are  given  in  units  of  eV 
bohr'”’.  Also  shown  on  the  plots  (solid  heavy  line)  is  the  steepest-descent  path  (in  mass- 
scaled  Jacobi  coordinates)  from  the  top  of  the  barrier 
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of  the  sensitivity  peaks  trace  out  a  path  that  significantly  cuts  the  inner  corner  of 
the  PES  and  is  removed  from  the  reaction  path.  Ip  fact,  the  region  of  the  PES  followed 
by  the  reaction  path  across  the  barrier  is  important  only  for  a  small  energy  regime 
just  beyond  the  threshold  energies.  Only  Figures  2(a)  and  2(b)  have  extrema  that 
directly  fall  on  the  reaction  path. 

Lastly,  we  note  that  at  higher  energies,  th**  sensitivities  in  the  tail  regions,  though 
oscillatory  in  nature,  are  predominantly  more  positive  than  negative.  The  sensitivity 
structures  in  these  tail  regions  also  do  not  change  appreciably  with  energy. 

Sensitivities  for  Transitions  from  v  =  0j-0tov'  =  0f=  1  (6  =  180° ) 

Figure  3  shows  the  5Foi^oo/ 51' sensitivities  at  the  same  energy  slices  as  shown 
in  Figure  2.  These  sensitivities  are  nonsymmetric  about  the  symmetric  stretch  line 
because  they  are  composed  from  the  product  of  two  different  scattering  wave  func¬ 
tions  [see  Eq.  ( 12)] — one  with  an  incoming  wave  in  channel  u  =  0,  j  =  0  on  the 
reactant  side  (lower  right-hand  side  in  the  figures),  and  the  other  with  an  incoming 
wave  in  channel  u'  =  0.  y'  =  1  on  the  product  side.  Below  the  resonance  energy 
(0.98  eV).  the  sensitivities  bear  an  uncanny  resemblance  to  those  of  SFoo-oo/Sl’, 
with  the  exception  that  the  magnitudes  are  almost  exactly  a  factor  of  2  greater,  and 
the  sensitivities  are  slightly  larger  on  the  reactant  side.  Incidentally,  the  transition 
probabilities  themselves  are  almost  exactly  a  factor  of  2  greater  than  Poo-oo  for 
energies  below  the  first  resonance  (see  Fig.  1 ). 

Above  the  resonance  energy,  the  lack  of  symmetry  in  the  sensitivities  becomes 
more  distinct.  The  positive  sensitivity  in  the  transition  state  region  becomes  more 
heavily  weighted  on  the  product  side,  eventually  joining  with  the  asymptotic  positive 
lobes  on  the  product  valley.  Thus,  from  1.10  to  1.50  eV.  the  sensitivities  on  the 
product  side  are  almost  completely  positive.  On  the  reactant  side,  the  sensitivities 
above  the  resonance  energy  continue  to  be  oscillatory  with  very  little  change  in 
structure.  The  extrema  of  the  sensitivity  peaks  and  valleys  follow  a  path  that  closely 
resembles  the  v  =  OJ  =  0  -►  u'  =  0,f  =  0  case.  The  sensitivities  in  the  two  shoulder 
regions  grow  and  coalesce  in  the  inner  comer  and  away  from  the  reaction  path. 
On  the  outer  corner  of  the  PES,  the  large  positive  lobe  at  the  resonance  energy 
decays  with  increasing  energy  and  a  region  of  negative  sensitivity  grows  in  its  stead. 
The  magnitudes  of  these  negative  sensitivities  are  small  compared  to  the  extrema 
that  cut  into  the  inner  corner,  so  again  the  region  of  the  PES  followed  by  the  reaction 
path  across  the  transition  state  is  of  secondary  importance  in  affecting  the  dynamics 
at  these  higher  energies. 

Sensitivities  for  Transitions  from  v  =  0  j  =  0  to  v'  =  0  f  =  2  (6  -  180°) 

Figure  4  shows  the  sensitivities  for  transitions  from  u  =  0. 7  =  0  to  v'  =  0,  /  = 
2.  Like  the  two  previous  sets,  the  5Po2-oo/ 5  F sensitivities  are  negative  and  relatively 
small  in  the  threshold  region.  Of  the  two  negative  peaks  at  the  classical  turning 
points,  the  product  side  has  a  larger  negative  sensitivity  until  about  0.63  eV.  Then, 
the  larger  sensitivity  region  shifts  over  to  the  reactant  side.  The  positive  shoulder 
on  the  reactant  side  grows,  becomes  more  prominent,  and  moves  towards  the  tran- 
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Sensitivities  for  v=0,j=0  to  v -0,j’=l 


R  (bohr)  R  (bohr) 

Figure  3.  Sensitivity  maps  for  transitions  from  u  =  Oy  =  0  to  t'  =  O7'  =  1  at  9  =  180°. 
The  details  of  the  contour  lines  are  the  same  as  in  Figure  2.  The  reactant  arrangement  is 
on  the  lower  right  branch. 


r  (bohr)  r  (bohr)  r  (bohr 

2.5  3.5  4.5  0.5  >.5  2.5  3  5  4.5  0.5  1.5  2.5 


578 


CHANG  AND  BROWN 


Sensitivities  for  v=0,j=0  to  v’=0,j’=2 


R  (bohr)  R  (bohr) 


R  (bohr) 


R  (bohr) 


Figure  4  Sensitivity  maps  for  transitions  from  n  =  0;  =  0toii’  =  0;'  =  2at9  =  180°. 
The  details  of  the  contour  lines  are  the  same  as  in  Figure  2. 
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sition  state  alone  to  displace  the  decaying  negative  transition  state  lobe.  Unlike  the 
two  previous  sets,  the  evolution  of  sensitivity  structure  does  not  involve  growth 
and  coalescence  of  sensitivity  shoulders  and  there  is  no  substantial  growth  of  sen¬ 
sitivity  in  the  inner  comer.  The  region  of  largest  sensitivity  from  about  0.70  to  1 .04 
eV  (a  large  energy  range)  is  a  positive-negative  dyad  which  moves  from  the  3.2  < 
/?  <  4. 1  bohr  region  to  the  2.2  <R  <  3.4  bohr  region  of  the  reactant  arrangement. 
Figures  4(b)  and  4(c)  show  two  energy  slices  in  this  sequence.  The  magnitude  of 
the  positive  sensitivities  peaks  at  about  0.985  eV  in  the  resonance  zone. 

Beyond  the  resonance  energy,  there  is  a  growth  of  positive  sensitivity  on  the 
product  side  and  negative  sensitivity  on  the  reactant  side,  and  this  persists  until 
about  1.42  eV  after  which  the  sensitivity  magnitudes  decay  through  the  end  of  our 
investigated  energy  range.  This  decay  is  presumably  due  tc  the  decaying  S-matrix 
element.  Lastly,  we  note  in  agreement  with  previous  cases,  the  transition  state 
region  has  very  small  sensitivities  at  the  higher  energies. 


Cumulative  Sensitivities  for  Transitions  from  v  =  Oj  =  0  to  all  Open  Product 
Channels  (6  =  180°) 

The  cumulative  transition  probability  sensitivities  are  obtained  by  summing  the 
individual  '/-oo/  ^  ^  sensitivities  over  all  open  product  channels  in  one  arrange¬ 
ment.  The  number  of  open  product  channels  range  from  6  (u'  =  0.  /  =  0, . . . ,  5) 

at0.50eVto31  (u'  =  0,/  =  0, ...,  13;u'=  1,/ =  0 . 10;u'  =  2./  =  0 . 

5 )  at  1 .50  eV.  Figure  5  shows  the  cumulative  sensitivity  map  at  the  same  six  energy 
slices  shown  on  previous  figures. 

At  the  lower  energies  from  the  threshold  to  about  0.86  eV,  all  the  large  sensitivity 
regions  lie  exclusively  in  the  product  arrangement  with  negligible  sensitivity  in  the 
reactant  arrangement.  This  result  is  certainly  counter-intuitive  but  can  be  understood 
by  considering  the  overlap  of  scattering  wave  functions  as  given  by  Eq.  ( 12).  The 
sensitivities  for  a  transition  from  v  =  0,j  =  0  to  an  excited  {v’,f)  state  which  has 
just  become  energetically  accessible  is  composed  of  ( 1 )  a  wave  function  with 
incoming  wave  from  the  reactant  arrangement  and  which  has  sufficient  translational 
energy  to  cross  the  barrier,  and  (2)  a  wave  function  with  incoming  wave  from 
the  product  arrangement  and  which  does  not  have  sufficient  energy  to  cross  the 
barrier  into  the  reactant  side  (because  most  of  its  energy  is  tied  up  in  internal 
vibrational-rotational  motion).  Consequently,  these  two  wave  functions  have  sig¬ 
nificant  overlap  only  in  the  product  arrangement.  Figure  6  shows  an  example  of 
this  trait  for  the  (0,  0)  -►  (0,  4)  transition  at  0.67  eV.  The  sensitivities,  as  shown 
in  Figure  6(a),  lie  predominantly  in  the  product  reg'on.  The  norms  of  the  two 
scattering  wave  functions  are  shown  in  Figs.  6(b)  and  6(c). 

Above  0.86  eV,  a  region  of  positive  sensitivity  just  to  the  reactant  side  of  the 
barrier  begins  to  grow  in  magnitude  but  stays  almost  stationary  in  configuration 
space.  The  large  region  of  negative  sensitivity  around  the  barrier  top  reaches  its 
extremum  value  at  ca.  0.88  eV  and  then  decays  by  flowing  into  the  inner  comer 
and  asymptotic  product  regions.  The  positive  peak  at  about  {R,r)  =  (3.0,  1.4) 
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Sensitivities  for  v=0,j=0  to  all  open  states 


Figure  5.  Cumulative  sensitivity  maps  (at  6  =  180')  for  transitions  from  v  =  0j  =  0xo 
all  open  product  channels  The  details  of  the  contour  lines  are  the  same  as  in  Figure  2. 
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Figure  6.  (a)  Sensitivity  map  for  the  »  =  0;  =  0  -*■  e'  =  O7’  =  4  transition  at  e  =  180°. 
The  details  of  the  contour  lines  are  the  same  as  in  Figure  2;  (b)  norm  of  the  wave 
function  with  incoming  wave  from  the  reactant  side;  (c)  norm  of  the  wave  function 
with  incoming  wave  from  the  product  side.  The  contour  values  on  the  wave  functions  are 
given  in  units  of  bohr'^. 
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bohrs  grows  to  a  maximum  at  ca.  1.01  eV  and  then  decays  as  it  moves  to  the  outer 
corner  region.  It  is  interesting  to  note  that  the  product  arrangement,  which  starts 
out  with  predominantly  positive  sensitivities  at  lower  energies,  eventually  evolves 
to  predominantly  negative  sensitivities  at  the  higher  energies. 


Sensitivities  at  Other  Scattering  Ai.gles 

The  reaction  barrier  for  the  H  +  H2  reaction  is  smallest  at  collinear  geometries 
and  increases  rapidly  by  more  than  a  factor  of  six  as  the  scattering  angle  subtended 
by  R  and  f  sweeps  from  180°  to  90°.  It  is.  therefore,  not  surprising  to  find  that 
both  the  magnitudes  of  the  wave  functions  and  of  the  sensitivities  decrease  dra¬ 
matically  as  the  Hj  geometry  moves  away  from  collinear.  Because  of  space  con¬ 
straints.  we  do  not  show  any  sensitivity  maps  at  other  scattering  angles  but  provide 
a  discussion  of  some  general  trends.  We  note  that  sensitivities  in  different  regions 
of  the  (R.r)  configuration  space  decay  at  different  rates  as  6  moves  away  from 
180°.  The  sensitivities  in  the  outer  comer  of  the  PES  tend  to  persist  longer  than 
those  at  either  the  inner  corner  or  at  the  asymptotic  regions.  It  is  not  uncommon 
to  find  the  sensitivity  magnitudes  at  these  outer  corners  increase  slightly  before  they 


Figure  7.  Sensitivities  of  the  i'  =  0  7  =  0  -►  r'  =  0  7’  =  0  transition  at  fixed  (R.r) 
configurations  as  a  function  of  scattenng  angle  and  total  energy. 
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decay.  Figure  7  «h.jws  some  of  these  trends  for  the  SPoi.-oo/ 5  F  sensitivities  at  select 
(R,r)  points  in  configuration  space  and  at  different  energies. 

At  0.98  eV,  two  points  were  chosen — one  on  the  outer  corner  at  (2.60,  1.50) 
bohrs,  and  the  other  on  the  inner  corner  at  (3.30,  1.60)  bohrs.  Both  points  corre¬ 
spond  to  extrema  of  their  respective  sensitivity  peaks  [  see  Fig.  2(c)].  The  sensitivity 
at  the  outer  corner  decays  more  slowly  and  goes  through  a  negative  extremum  at 
about  142°  whereas  the  inner  corner  dies  out  at  about  150°.  Two  other  points 
chosen  at  the  outer  comer  and  at  higher  energies  actually  have  sensitivity  magnitudes 
increase  as  0  moves  away  from  180°  before  they  eventually  decay.  This  is  not 
surprising  in  view  of  Figures  2  to  5  where  regions  of  large  sensitivities  at  tht  'gher 
collision  energies  were  seen  to  move  away  from  the  minimum  energy  path.  The 
cases  considered  here  are  somewhat  different  though  as  the  (R,r)  configurations 
which  do  have  large  sensitivities  at  bent  geometries  are  confined  to  the  outer  corner 
region  of  the  potential. 


Concluding  Remarks 

We  have  investigated  the  sensitivities  of  state-to-state  transition  probabilities  for 
the  3-D  y  =  0  H  +  Ht  reaction  at  several  energies  between  0.50  and  1.50  eV.  The 
particular  transitions  investigated  are  the  reactive  ones  starting  from  (u  =  0J  =  0) 
to  (u',/)  =  (0,  0),  (0,  1 ),  (0,  2),  and  the  cumulative  transition  from  (0,  0)  to  all 
open  channels.  Each  one  has  their  own  interesting  sensitivity  structure.  For  the  (0, 
0 )  -*•  ( 0. 0 )  and  ( 0, 0 )  -*■  ( 0, 1 )  transitions,  we  find  great  similarity  in  the  evolution 
of  their  sensitivity  structure  ( for  =  1 80° )  over  the  whole  energy  range.  It  was 
surprising  to  find  large  sensitivity  regions  localized  in  the  inner  comer  of  the  potential 
at  the  higher  collision  energies.  These  regions  of  large  sensitivities  should  be  inter¬ 
preted  carefully.  It  is  well  known  that  at  higher  energies,  the  dynamics  can  explore 
regions  of  configuration  space  far  removed  from  the  minimum  energy  path.  Inter¬ 
pretation  of  the  sensitivities  reveals  that  not  only  does  the  dynamics  explore  the 
inner  corner  region,  but  that  the  “fraction”  of  the  dynamics  which  does  the  ex¬ 
ploring — no  matter  how  large  or  small — will  be  sufficiently  distorted  by  inaccuracies 
in  the  PES  at  the  inner  corner,  that  the  “complete”  dynamics  wilt  be  distorted  the 
most  by  inaccuracies  from  this  region.  The  sensitivities  say  nothing  about  whether 
the  dynamics  “prefers”  to  go  over  the  lop  of  the  barrier  or  cut  the  comer. 

Another  caution  with  regard  to  interpreting  the  sensitivities  concerns  the  asymp¬ 
totic  regions.  Although  the  sensitivities  appear  to  be  smaller  in  the  asymptotic 
regions,  a  given  potential  perturbation  placed  in  the  asymptote  could  have  the  same 
perturbative  effect  on  the  transition  probabilities  as  if  the  same  perturbation  were 
placed  closer  to  the  shoulder  region.  This  is  because  the  Jacobian  factor  R^r^  in 
the  integral 

JbP 

— — - -  bV(  R,  r,  e)R^r^dR  dr  d{cos  6) 

bV(R,  r,  B) 

could  compensate  for  the  smaller  sensitivity  magnitudes.  However,  we  expect  that 
most  potential  variations  contemplated  for  the  asymptotic  region  would  be  rather 
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broad  (with  respect  to  /?)  so  that  the  oscillatory  nature  of  the  sensitivities  in  the 
asymptotic  region  would  tend  to  cancel  out  contributions  from  neighboring  regions 
and  diminish  the  perturbative  effect  on  the  observable. 

Among  the  other  regions  of  high  sensitivity  found  in  our  study,  the  most  notable 
is  the  outer  comer  of  the  potential  for  bent  geometries.  This  is  the  region  where 
the  H3  molecular  conformation  is  more  compact  than  the  conformation  at  the 
barrier  top.  The  sensitivity  magnitudes  tend  to  decay  as  the  geometry  becomes 
more  bent,  but  the  sensitivities  at  the  outer  comer  region  tend  to  decay  slower  with 
scattering  angle  and  may,  in  fact,  increase  ( for  some  noncollinear  geometries )  before 
they  decay. 

Lastly,  we  note  that  the  3-D  sensitivities  do  not  change  m  structure  as  rapidly 
with  increasing  energy  as  did  the  collinear  sensitivities  studied  earlier  [24].  This  is 
particularly  encouraging  as  the  simpler/slower  evolution  in  sensitivity  stmcture 
means  a  greater  promise  for  QFSA-aided  PES  refinement. 
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Abstract 

The  reaction  pathways  of  the  interaction  of  the  methane  molecule  with  the  cadmium  atom  in  the 
ground  state  (’S)  and  the  two  lowest  excited  states (’P,  'P)  were  determined  using  Hartree-Fock  followed 
by  multireference  configuration  interaction  plus  second-order  multireference  Mbller-Plesset  calculations. 
No  spontaneous  activation  of  the  methane  molecule  was  observed  by  these  electronic  states  of  the  cadmium 
atom  due  to  the  existence  of  activation  barriers  of  107.2,  27,  and  17  3  kcal/mol,  respectively.  The 
geometry  and  electronic  structure  of  the  HCcCHj  intermediate  molecule  and  the  CdH  and  CdCHj 
products  involved  in  the  reaciion  were  also  determined,  as  well  as  the  dissociation  pathway  of  HCdCH3 
towards  the  CdH  -I-  CH3  and  H  +  CdCH3  products.  We  found  that  the  HCdCH3  intermediate  reaches 
the  final  products  without  any  barriers  and  also  that  the  initial  reactants  in  the  ground  state  need  to  be 
provided  with  88.5  kcal/mol  to  obtain  the  CdH  +  CH3  products  and  with  99.8  kcal/mol  to  get  the  H 
CdCHi  products.  ®  1993  John  Wiley  &  Sons,  Inc 


Introduction 

During  the  last  decade  a  great  variety  of  experimental  techniques  have  been  used 
for  the  study  of  metal  atom  plus  methane  ( M  +  CH^ )  reactions.  These  experiences 
range  from  photoexcitation  of  metallic  atoms  trapped  in  cryogenic  alkane  matrices 
[1-8],  photoexcitation  of  metallic  vapors  in  the  presence  of  hydrocarbons  at  high 
temperatures  [9,10],  and  guided  ion  beam  tandem  mass-spectrometer  reactions  of 
metallic  atoms  with  hydrogen  and  alkanes  [1 1]  to  the  laser  pump-and-probe  re¬ 
actions  of  atoms  with  a  variety  of  polyatomic  molecules  [12,13],  For  the  Cd 
+  CH4  reaction,  an  important  experimental  work  by  Wallace  and  Breckenridge 
was  reported  [14]  regarding  the  study  of“action”  spectra  which  provide  information 
about  the  excited  Cd-CH^  potential  energy  surfaces  from  which  electronic  energy 
transfer  processes  occur.  In  that  work,  the  authors  used  the  technique  of  action 
spectroscopy  to  obtain  information  about  the  ground  and  excited  states  of  the  Cd- 
CH4  van  der  Waals  complex  and  to  study  “half-collision”  electronic  energy  transfer 
processes  within  the  complex.  We  were  particularly  interested  in  that  paper  because 
these  authors  have  found  that  the  interaction  of  Cd('P;5s'5p')  with  CH4  is  not 
efficient  for  the  formation  of  CdH  -F  CH3,  a  fact  which  is  attributed  to  the  possible 
existence  of  a  small  potential  barrier  for  the  chemical  interaction  of  Cd('P)  with 
the  C — H  bonds  in  CH4.  We  undertook  the  task  of  determining  the  symmetries, 
the  positions,  dissociation  energies,  and  vibrational  frequencies  of  the  ground  and 
the  two  lowest  excited  states  of  the  Cd-CH4  van  der  Waals  complexes  observed  in 
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Ref.  [14]  and  confirmed  the  facial  nature  of  the  Cd{  'S)-CH4  complex  and  the 
Cd(^P)-CH4  exciplex  [15],  For  the  Cd(  ‘P)-CH4  exciplex  it  has  just  been  possible 
to  exclude  the  edge-on  coordination;  this  still  leaves  the  Ca^  facial  or  vertex-on 
nature  of  this  exciplex  as  an  open  question. 

Ab  initio  studies  of  the  activation  of  C — H  bonds  by  metallic  atoms  start  with 
the  work  of  Blomberg  et  al.  [16],  who  investigated  the  interaction  of  a  nickel  atom 
with  methane.  From  this  work  the  importance  of  the  role  played  by  the  excited 
states  of  the  metallic  atom  was  clear.  This  relevant  role  of  excited  states  has  been 
confirmed  in  the  experimental  work  of  Georgiadis  and  Armentrout  on  the  activation 
of  methane  by  some  ions  of  transition  metals  [11].  More  recently,  theoretical  results 
for  rhodium  [17]  and  copper  [18-22]  in  the  activation  of  methane  also  stress  the 
importance  of  the  excited  states.  Similar  experimental  [  23-26  ]  and  theoretical  [  27- 
36]  studies  on  transition  metal  atom  activation  of  H2  molecules  exist  as  well.  Par¬ 
ticularly,  we  have  been  interested  on  the  interaction  of  ( 'p:ns  'np ' )  metal  atoms  with 
the  C — H  bonds  in  methane.  In  a  very  recent  work  [36]  we  predicted  the  'P  atomic 
state  of  Zn  to  be  responsible  for  the  formation  of  the  HZnCH3  intermediate.  We 
decided  to  test  if  the  same  nonadiabatic  pathway  could  lead  to  the  formation  of 
the  corresponding  intermediate  if  one  uses  cadmium  (which  is  isoelectronic  in  the 
valence  shell  with  zinc)  instead  of  zinc. 

The  aim  of  the  present  work  is  to  study  theoretically  the  Cd  +  CH4  reaction  to 
obtain  the  reaction  pathways  as  well  as  the  governing  mechanisms.  For  this  purpose 
we  have  used  the  methodology  followed  in  previous  studies  [21,22].  As  our  main 
interest  is  the  study  of  the  interaction  of  the  cadmium  atom  in  its  ^  'P  excited  states 
with  methane,  we  have  used  a  strategy  that  allows  us  to  obtain  a  pathway  which  is 
very  close  to  the  true  pathway  of  interaction.  We  keep  the  CH4  geometry  fixed 
during  the  insertion  process  except  the  C-H„  distance,  which  is  relaxed  in  the  C, 
step  of  the  insertion  (see  Fig.  1 ).  First  of  all,  we  have  optimized  the  geometry  of 
the  HCdCHs  molecule  which  has  the  Cj^,  symmetry  as  expected;  this  means  that 
we  know  the  final  exact  position  of  the  cadmium  atom  between  the  C  and  H„ 
atoms.  The  insertion  mechanism  can  essentially  be  divided  in  two;  (a)  The  Cd 
atom  approaches  the  C — H  bond  in  the  reflection  plane  (x-v  €21  plane)  keeping 
the  CH4  geometry  fixed  (from  ,4  to  B  in  Fig.  1 ).  (b)  As  the  metal  atoms  leaves  the 
Civ  plane  (from  R  to  C),  for  each  C — H„  relaxed  distance  considered,  we  have 
optimized  the  position  of  the  cadmium  atom  in  the  C,  symmetry  (this  implies  two 
orthogonal  directions)  until  the  metal  is  between  the  C  and  H„  atoms  to  finally 
obtain  the  optimized  HCdCH3  molecule  in  C31.  symmetry. 

All  optimizations  were  performed  at  the  configuration  interaction  -I-  second- 
order  multireference  Moller-Plesset  perturbation  (mrci-mp2)  level.  The  paths 
leading  to  the  HCd  +  CH3  and  H  +  CdCHs  products  that  can  be  attained  from 
HCdCH3  were  separately  studied,  and  their  geometries  were  also  optimized  in  order 
to  estimate  relative  energy  differences.  The  next  section  is  devoted  to  the  compu¬ 
tational  details  of  the  calculations.  The  third  section  presents  the  molecular  and 
electronic  structure  of  the  optimized  reactants,  products,  and  intermediate  complex. 
A  discussion  of  the  reaction  mechanism  and  its  features  is  followed  by  a  comparison 
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Figur*"  I .  Reaction  path  followed  by  the  cadmium  atom  ( 'P:3d  '®4s'4p ' ),  leading  to  the 
formation  of  the  HCdCHj  molecule. 


of  the  present  reaction  with  the  Zn  +  CH4  reaction  stressing  the  differences  between 
them.  Finally,  in  the  fourth  section  we  present  our  conclusions. 

Method 

The  self-consistent  field  (SCF)  calculations  were  performed  with  the  pshf  code 
[37],  which  includes  the  nonempirical  relativistic  effective  core  potential  (recp) 
method  of  Durand  et  al.  [  38-42  ] .  The  calculations  have  been  made  with  a  crypton- 
type  RECP  for  cadmium  and  a  helium-type  recp  for  carbon,  leaving  us  with  only 
20  electrons  to  deal  with  explicitly.  The  recp  of  carbon  and  cadmium  were  taken 
from  Refs.  [42]  and  [43],  respectively.  For  carbon  a  double-f  gaussian  basis  set 
was  taken  from  Ref.  [42],  while  for  cadmium  we  have  taken  a  basis  set  [43]  which 
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Table  I.  Geometry  of  the  HCdCHj  molecule. 


Distances  (A) 

Cd-C 

2.2 

Cd-H 

1.7 

C-H 

1.09 

Angle  (deg) 

^HCCd 

III.O 

is  of  triple-f  quality  for  the  5s  and  4d  shells  with  two  polarization  p  orbitals;  the 
contraction  for  this  Cd  basis  is  (111/21/211).  The  basis  sets  for  carbon  and  hydrogen 
have  been  successfully  tested  before  and  are  the  same  as  those  reported  in  Ref. 
[21].  In  order  to  obtain  the  two  open-shell  scf  orbitals  of  the  1  and  2  'A'  mo¬ 
lecular  states  arising  from  the  Cd(^’'P)  +  CHi  asymptotes  we  used  the  SCF  molecular 
orbitals  (MO)  of  the  1  ‘A'  {‘S)  state  obtained  with  the  PSHF  program  as  input  for 
the  MCSCF  process  [44].  The  electronic  correlation  effects  have  been  taken  into 
account  through  the  CIPSI  algorithm  in  its  two-class  version  [45].  First,  a  zeroth- 
order  multireference  space  §  (containing  approximately  100  determinants  in  this 
work)  is  diagonalized  and  is  used  to  generate  the  perturbational  space  P  through 
the  mp2  scheme.  P  contains  from  20  to  30  million  determinants  depending  on  the 
geometry  in  question.  All  geometries  and  energies  have  been  optimized  using  this 
MRCI-MP2  scheme. 


Results  and  Discussion 

Molecular  Structure  of  the  HCdCH 3  Intermediate  Molecule 

The  most  stable  geometry  of  the  HCdCHs  intermediate  has  C3„  symmetry 
and  corresponds  to  a  '  Ai  electronic  state.  The  optimized  geometrical  parameters 
appear  in  Table  I.  The  'Ai  valence  electronic  configuration  is  given  by 
10a?6e^7e^l  la?8e''l2a?13a?,  where  the  lOai  orbital  is  mainly  the  binding  of  the 
carbon  and  the  hydrogens  of  methyl  radical,  6e  =  (djtr,  d,.;),  7e  =  (d;^,.,  d;c2_y3), 
1  lai  =  (d^j).  The  8e  orbitals  bind  the  hydrogen  atoms  to  carbon.  The  t2ai  is  a  ff 
orbital  that  links  C(2p;)  with  Cd(4s)  (with  traces  of  d;j)  and  H(  Is).  Finally 
the  1 3ai  is  a  <t  orbital  that  binds  the  three  centers  on  the  C3  axis  with  the  C(2p.), 
Cd(5p2),  and  the  H(  Is)  orbitals.  The  scf  Mulliken  electronic  population  reveals 
a  4d'°5s°  ’^5p®  *^  charge  distribution  on  Cd  which  corresponds  approximately 
toCd(II). 

Molecular  Structure  of  CdCHs 

The  most  stable  geometry  of  CdCHy  corresponds  also  to  a  C3„  conformation 
with  a  ^Ai  electronic  state.  The  optimized  geometrical  parameters  are  presented  in 

Table  IL  Geometry  of  the  CdCHj  molecule 


Distances  (A)  Cd-C  2.2 

C-H  1.1 1 

Angle  (deg)  zHCCd  108.5 
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Table  II.  The  ^A|  valence  electronic  configuration  is  dominated  by 
10ai6e‘’7e‘‘l  Ia?8e^l2ail3ai,  where  the  lOai  orbital  is  mainly  the  binding  of  the 
carbon  and  the  hydrogens  of  the  methyl  radical,  6e  =  (d;^;,  d^j),  7e  =  (d^j, 
d;t2->=)and  llai  =  (d^j).  The  8e  orbitals  bind  carbon  (2p;t,  2p^.)  with  the  hydrogen 
atoms  (Is  orbitals).  The  12ai  orbital  links  the  carbon  (2p-)  with  cadmium  (5s) 
and  finally  the  13ai  orbital  corresponds  to  a  bond  between  carbon  sp(2p-  +  2s) 
and  cadmium  sp(-5s  -  5p;). 

The  CdH  Molecule 

The  most  stable  geometry  of  CdH  corresponds  to  a  ^2  state,  which  has  a  valence 
electronic  configuration  given  by  9<r^25^2ir^lO(T^l ler*,  where  9a  =  (d^Oi  26 
=  (d;f2_^2,  djc,.),  2ir  =  (djfj,  d,-).  The  10<r  orbital  binds  the  cadmium  sd(s  -  d^:) 
with  the  hydrogen  ( Is)  and  finally  the  1  l<r  orbital  mixes  the  cadmium  sp(-s  +  p.) 
with  the  H(  Is).  The  equilibrium  distance  of  CdH  was  found  to  be  1.76  A  and  is 
in  excellent  agreement  with  the  experimental  Re  value  of  1.761  A  [46]. 

Energetics  of  the  Cd  +  CH4  -*■  HCdCHj  Reaction 

Figure  2  shows  the  relative  energies  of  the  reactants,  the  HCdCH3  complex  and 
the  HCd  +  CH3  or  CdCH3  +  H  products.  We  present  there  our  calculated  values 
(full  lines)  as  well  as  available  experimental  data  (dotted  lines). 

The  experimental  bond  disociation  energies  (bde),  on  the  one  hand,  allow  us 
to  perform  an  analysis  of  the  energetics  of  the  reaction  and,  on  the  other  hand, 
they  will  be  used  to  generate  reliable  energy  differences  against  which  our  calculated 
values  can  be  compared.  The  experimental  bde  of  the  C — H  bond  in  methane  is 
105  kcal/mo!  whereas  that  of  the  Cd — H  bond  is  15.6  kcal/mol  [46].  These  values 
allow  us  to  calculate  that  the  “experimental”  energy  difference  between  the  CdH 
+  CH3  products  and  the  Cd(  'S)  +  CH4  reactants  is  89.4  kcal/mol;  our  MRCI-MP2 
value  for  this  difference  is  in  very  good  agreement  and  was  found  to  be  88.5  kcal/ 
mol.  The  bde  for  the  Cd — C  bond  of  CdCH3  has  not  been  experimentally  deter¬ 
mined  but  our  calculated  value  is  99.8  kcal/mol.  As  far  as  the  spectroscopy  of  the 
cadmium  atom  concerns,  the  experimental  transition  energies  ( 7,)  between  the 
^P(4d'°5s‘5p')  and  'P(4d'®5s'5p')  excited  states  and  the  'S(4d''’5s^)  ground  state 
are  89.3  kcal/mol  and  124.9  kcal/mol  [47].  The  ^P  energy  is  the  7-averaged  value 
of  the  three  spin-orbit  components  of  Ref.  [47].  Our  calculated  7.  are  84.9  and 
125.6  kcal/mol,  respective! 

The  optimized  HCdCHs  complex  was  calculated  to  lie  31.9  kcal/mol  above  the 
Cd(  ’S)  +  CH4  reactants;  the  endothermic  formation  of  an  HMCH3  molecule  (M 
=  metal)  has  also  been  found  in  Ni,  Cu,  and  Zn  [16,22,36].  Figure  3  shows  the 
reaction  paths  followed  by  the  reactants  to  form  the  HCdCH3  complex.  The  curve 
2 'a'  initially  presents  an  attractive  section  between  the  Cd(  'P)  excited  atom  and 
methane;  next,  this  curve  shows  an  activation  barrier  of  1 7.3  kcal/mol.  This  means 
that  the  Cd('P)  excited  atom  does  not  insert  spontaneously  into  the  C — H  bond 
of  CH4.  Also  this  figure  presents  two  avoided  crossings  between  the  2  'A'  and  1  'A' 
curves.  This  means  that  initially  the  Cd(  ’P)  +  CH4  fragments  follow  the  2  'A'  curve 
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Figure  2.  Energy  diagram  o‘‘the  reactants,  the  intermediate  HCdCH}  molecule,  and  the 
products:  ( - )  experimental  figures;  ( — )  calculated  results. 


and  then  by  means  of  a  nonadiabatic  transition  in  any  of  the  avoided  crossing 
regions  they  may  fall  to  the  1  'A'  curve  that  correlates  with  the  Cd('S)  +  CH4 
fragments.  Note  that  the  lower-lying  curve  is  initially  repulsive  and  that  it  becomes 
attractive,  as  expected,  only  after  the  first  avoided  crossing  has  taken  place.  This 
1  'A'  curve  has  an  activation  barrier  of  107.2  kcal/mol. 

The  experimental  picture  presented  in  Ref.  [14]  shows  that  the  cadmium  atom 
in  the  ‘P  excited  state  does  not  break  efficiently  the  C — H  bond  in  CH4.  We  have 
seen  that  this  can  be  explained  by  the  existence  of  a  barrier  of  more  than  17  kcal/ 
mol  in  the  2  'A'  curve  in  Figure  3. 

As  far  as  the  Cd(^P)  +  CH4  reaction  is  concerned,  we  have  previously  reported 
a  C3„  face-on  Cd(^P)-H3CH  van  der  Waals  exciplex  [15],  and  its  corresponding 
minimum  can  be  seen  (here  exaggerated )  at  the  left  of  the  1  ^A'  curve  in  Figure 
4.  Note  that  another  activation  barrier  calculated  to  be  27  kcal/mol  high  separates 
the  reactants  from  a  stable  bent  intermediate.  This  value  is  in  excellent  agreement 
with  the  experimentally  measured  value  of  9000-10,000  cm"'  for  the  barrier 
height  [14]. 


»9 


Potential  energy  curves  of  the  interaction  of  Cd(  'S)  and  Cd(  'P)  with  methane  vs.  insertion  angle  towards  the  HCdCHj  intermediate. 


594 


CASTILLO,  RAMI'REZ-SOLI'S,  AND  POULAIN 


Figure  4.  Potential  energy  curve  of  the  interaction  of  Cd(’P:5s5p)  state  with  methane. 


Energetics  of  the  HCdCHj  Dissociation  towards  the  Products 

In  Figure  5  the  reaction  pathways  corresponding  to  the  dissociation  of  HCdCHs 
are  depicted.  The  HCdCHj  complex  is  shown  to  dissociate,  without  any  activation 
barrier,  into  the  HCd  +  CHy  products  when  furnished  with  56.6  kcal/mol  or  to 
the  H  +  CdCHa  products  when  given  67.9  kcal/mol.  The  2 ‘A'  curve  becomes 
monotonically  increasing  after  the  avoided  crossings  with  the  i '  A'  curve  and  leads 
eventually  to  the  excited  CdH(^2^)  +  CHy  fragments. 

Comparison  of  the  Interaction  Cd — C//<  with  Zn — CH^ 

The  cadmium  and  zinc  atoms  are  isoelectronic  in  their  valence  shells,  so  it  is 
expected  that  their  respective  interactions  with  methane  would  present  some  com¬ 
mon  features.  We  can  note,  for  example,  in  both  cases  that; 

a.  The  presence  of  high  energy  barriers  between  the  ground  state  M('S:d’“s^) 
+  CH4  and  the  HMCH3  intermediate. 

b.  The  formation  of  the  HMCH3  molecule  is  only  reached  by  means  of  an  en¬ 
dothermic  process. 

c.  The  relative  energetic  position  of  the  products  and  the  intermediate  is  the 
same,  namely,  HMCH3  <  HM  +  CH3  <  H  +  MCH3. 


Dissociation  curves  of  HCdCHj  towards  the  CdH  +  CHj  and  H  +  CdCHs  products. 
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d.  The  formation  of  the  HM  +  CH3  and  H  +  MCH3  systems  can  only  be  reached 
by  means  of  endothermic  processes  and  without  any  barrier  between  the  in¬ 
termediate  HMCH3  and  the  products. 

We  would  like  to  note  that  the  ‘P(d*°s‘p')  excited  state  of  Cd  and  Zn  has  two 
special  features  in  the  M  +  CH4  interaction:  It  is  the  lowest  excited  state  that  has 
the  same  spin  multiplicity  as  the  stable  HMCH3  complex,  and  its  electronic  con¬ 
figuration  allows  it  to  interact  favorably  with  the  tr  and  a*  molecular  orbitals  of 
methane. 

However,  unlike  with  the  Zn(‘P:3d'®4s'4p')  state,  the  Cd  atom  in  its  (‘P: 
4d  “’5s '5p')  shite  cannot  spontaneously  break  the  C — H  bonds  in  CH4;  this  result 
is  very  similai  to  that  found  for  the  interaction  of  Mg('P:2s'2p‘)  with  methane 
[48].  This  seems  to  indicate  that  since  the  formation  of  the  Cd(  'P)-CH4  van  der 
Waals  exciplex  has  already  been  observed  [14]  and  theoretically  confirmed  by  our 
recent  calculations  [15],  the  attractive  interaction  of  the  5p  orbital  of  the  Cd  with 
the  a*  orbital  of  methane  that  could  lead  to  a  stable  HCdCH3  intermediate  is 
overcome  at  short  distance  by  the  existing  repulsion  between  the  singly  occupied 
5s  orbital  of  the  metal  and  the  doubly  occupied  a  orbital  of  the  methane. 


Conclusions 

The  ‘S(4d'°5s^)  ground  state  of  the  cadmium  atom  is  not  capable  of  breaking 
the  C — H  bond  in  CH4  due  to  the  presence  of  a  very  high  barrier  ( 107  kcal/mol) 
between  Cd('S)  +  CH4  reactants  and  HCdCH3  molecule. 

The  reaction  pathways  of  Cd(  ‘S)-CH4  and  Cd(  'P)-CH4  show  two  regions  where 
these  curves  present  avoided  crossings.  The  'P( 5s '5p')  cadmium  state,  unlike  the 
isoelectronic  Zn(  ‘P:4s‘4p ' )  state,  does  not  break  spontaneously  the  C — H  bonds 
in  CH4,  due  to  the  presence  of  a  17.3  kcal/mol  barrier  in  the  reaction  pathway  of 
this  state  with  methane.  This  excited  cadmium  atom  can  only  form  exothermically 
a  C3„  van  der  Waals  complex  with  methane. 

The  HCdCH3  intermediate  is  formed  endothermically  if  one  starts  with  the  ground 
states  of  Cd  and  methane,  and  it  can  also  be  reached  from  the  interaction  between 
Cd('P)  and  CH4;  it  was  found  that  it  is  necessary  to  provide  approximately  17 
kcal/mol  to  the  Cd(  ‘P)  -f  CH4  reactants  to  obtain  the  HCdCH3  intermediate  via 
a  nonadiabatic  transition  with  the  repulsive  lower-lying  curve,  which  is  asymptot¬ 
ically  related  to  the  ground  state  fragments. 

The  cadmium  ^P  state  has  been  shown  not  to  activate  spontaneously  the  methane 
molecule  because  there  is  a  27  kcal/mol  calculated  energy  barrier  (in  excellent 
accordance  with  recent  experiments)  between  the  Cd(^P)  +  CH4  reactants  and  a 
stable  bent  minimum. 

The  HCdCH3  complex  is  shown  to  dissociate  into  the  HCd  +  CH3  or  to  the  H 
+  CdCH3  products  without  any  activation  barrier. 

Finally,  the  results  presented  in  this  work  are  in  accordance  with  those  reported 
in  the  experimental  studies  already  mentioned. 


Cd('S.  ’P.  ’P)  REACTION  WITH  METHANE 


597 


Bibliography 

[  1  ]  W.  E.  Billups,  M.  M  Konarski,  R,  H.  Hauge.  and  J.  L.  Margrave.  J.  Am.  Chem.  Soc.  102,  7394 
(1980). 

[2]  G.  A.  Ozin,  D.  F.  McIntosh,  S.  A.  Mitchell,  and  J.  Garcia-Prieto,  J.  Am.  Chem.  Soc.  103,  1574 
(1981). 

[3]  G.  A.  Ozin,  J.  G.  McCaffrey,  and  D.  F.  McIntosh,  Pure  Appl.  Chem.  56,  1 1 1  ( 1984). 

[4]  J.  M.  Pamis,  S.  A.  Mitchell.J.Garcia-Prieto,  andG.  A.  Ozin,  J.  Am.  Chem.  Soc.  107, 8169  ( 1985). 

[5]  G.  A.  Ozin  and  J.  G.  McCaffrey,  Angew.  Chem.  Int.  Ed.  Engl.  25,  1072  ( 1986). 

[6]  J.  M.  Pamis  and  G.  A.  Ozin,  J.  Phys.  Chem.  93.  1204  ( 1989). 

[7]  J.  M.  Pamis  and  G.  A.  Ozin,  J.  Phys.  Chem.  93,  1220  ( 1989). 

[8]  J.  M.  Pamis  and  G.  A.  Ozin,  J.  Phys.  Chem.  93, 4023  ( 1989). 

[9]  S.  Yamamoto,  Bull.  Chem.  Soc.  Jpn.  61, 769  (1988). 

1 10]  S.  Yamamoto  and  H.  Hokamura,  J.  Phys.  Chem.  95,  2138  ( 1991 ). 

[11]  R.  Georgiadis  and  P.  B.  Armentrout,  J.  Phys.  Chem.  92,  7067  (1988). 

[  12]  W.  H.  Breckentidge  and  H.  Umemoto,  J.  Chem.  Phys.  75, 698  ( 1981 ). 

1 13]  W.  H.  Breckenridge  and  H.  Umemoto,  J.  Chem.  Phys.  81 , 3852  ( 1984). 

[  14]  1.  Wallace  and  W.  H.  Breckenridge,  J.  Chem.  Phys.  97,  2318  ( 1992). 

[15]  A.  Ramirez-Solis  and  S.  Castillo.  J.  Chem.  Phys  98,  8065  ( 1993). 

[16]  M.  R.  A.  Blomberg,  U.  Brandemark,  and  P.  E  M.  Siegbahn.  J.  Am  Chem.  Soc  105,5557(1983). 

[17]  M.  R.  A.  Blomberg,  P.  E.  M.  Siegbahn,  and  M.  Svensson,  J.  Phys.  Chem.  95, 4313  ( 1991 ). 

[18]  R.  A.  Poirier,  G.  A.  Ozin,  D.  F.  McIntosh,  I.  G.  Csizmadia,  and  R.  Daudel,  Chem.  Phys.  Lett. 

101,221  (1983). 

[  19]  G.  E.  Quelch  and  I.  H.  Hillier,  J.  Chem.  Soc.  Faraday  Trans.  2  83,  2287  (1987). 

[20]  G.  E.  Quelch  and  I.  H.  Hillier,  Chem.  Phys.  121,  183  ( 1988). 

[21  ]  S.  Castillo,  E.  Poulain,  and  0.  Novaro,  Int.  J.  Quant.  Chem.  23, 509  ( 1989) 

[22]  S.  Castillo,  E.  Poulain.  and  O.  Novaro.  Int.  J.  Quant.  Chem.  25,  577  ( 1991 ). 

[23]  W.  H.  Breckenridge  and  H.  Umemoto,  J.  Chem.  Phys.  80,  4168  ( 1984). 

[24]  P.  D.  Kleiber,  A.  M.  Lyyra,  K.  M.  Sando.  Zafiropulos,  and  W.  C.  Stwalley,  J.  Chem.  Phys.  85, 
5493(1986). 

[25]  J.  L.  Elkind  and  P.  B.  Armentrout,  J.  Chem.  Phys.  86,  1868  ( 1987). 

[26]  J.  M.  Pamis  and  G.  A.  Ozin,  J.  Phys.  Chem.  93,  1215  ( 1989). 

[27]  M.  R.  A.  Blomberg,  U.  Brandemark,  L.  Petterson,  and  P.  E.  M.  Siegbahn,  Int.  J.  Quantum.  Chem. 
23,855(1983). 

[28]  A.  Goddard  III  and  J.  J.  Low,  J.  Am.  Chem.  Soc.  106,  8321  ( 1984). 

[29]  J.  Garci'a-Pneto,  M.  E.,  Ruiz  and  E.  Poulain,  J.  Chem  Phys.  81,  5920  (1984)  and  references 
therein. 

[30]  J  Garcia-Prieto,  M.  E.  Ruiz,  and  O.  Novaro,  J.  Am.  Chem.  Soc.  107,5635(1985). 

[31  ]  M.  E.  Ruiz,  J.  Garcia-Prieto,  E.  Poulain,  G.  A.  Ozin,  R.  A.  Poirier.  S.  M.,  Matta,  1.  G.  Czismadia. 
C.  Grade,  and  O.  Novaro,  J.  Phys.  Chem.  90,  279  ( 1986). 

[32]  E.  Poulain,  J. Garcia-Prieto,  M.  E.  Ruiz, and  O.  Novaro.  Int.  J  Quant  Chem. XXIX.  1 181  ( 1986) 

[33]  K.  Balasubramanian  and  J.  Z.  Wang.  J.  Chem.  Phys.  91,  7761  (1989). 

[34]  E.  Poulain,  F.Colmenares,S. Castillo, and O.  Novaro.  J.  Mol.  Struc  (Theochem)  210,337  ( 1990) 

[35]  J.  M.  Martinez-Magadan,  A.  Ramircz-SoIis,  and  O.  Novaro.  Chem.  Phys.  Lett.  186, 107  ( 1991 ). 

[36]  S.  Castillo,  A.  Ramirez-Solis,  D.  Diaz.  E.  Poulain.  and  O.  Novaro.  Mol.  Phys.  ( 1993).  to  appear. 

[37]  J.  P.  Daudey,  PSHF  program  and  pseudopotential  adaptation,  based  on  the  original  HONDO  76. 
QCPE  Program  338  ( 1977)  by  M.  Dupuis,  J.  Rys,  and  H.  F.  King. 

[38]  Ph.  Durand  and  J.  C.  Barthelat.  Theor.  Chim.  Acta  38,  283  ( 1975). 

[39]  J.  C.  Barthelat,  Ph.  Durand,  and  A.  Serahni,  Mol.  Phys.  33,  179  ( 1977). 

[40]  J.  C.  Barthelat  and  Ph.  Durand,  Gazz.  Chim  Ital.  108,  225  ( 1978). 

[41  ]  M.  Pelissier  and  Ph.  Durand,  Theor.  Chim.  Acta  55, 43  ( 1980). 

[42]  G.  F.  Pacchioni.  P.  Fantucd,  G.  Giunchi.  and  J.  C.  Barthelat.  Gazz.  Chim.  Ital  110, 183  ( 1980). 

[43]  L.  A.  LaJohn,  P.  A.  Christiansen,  R.  B.  Ross,  T  Atashroo,  and  W.  C.  Ermler.  J.  Chem.  Phys.  84, 
2812(1987). 


598 


CASTILLO.  RAMiREZ-SOLfS,  AND  POULAIN 


[44]  R.  Carbo,  J.  Rubio,  M  Pelissier,  and  J.  P  Daudey.  GMCP  program,  based  on  the  Elementary 
Jacobi  Rotation  algorithm. 

[45]  B.  Huron.  J.  P.  Malneu,  and  P.  Rancurel.  J.  Chem.  Phys.  58.  5745  (197.^);  M.  Pelissier.  J.  P. 
Daudey.  J.  P.  Malrieu,  F.  Speigelmann,  D.  Maynau,  and  S.  Evangelisti,  CIPSI  code. 

[46]  G.  Herzberg.  Molecular  Spectra  and  Molecular  Structure  I  Spectra  of  Diatomic  Molecules  (Van 
Nostrand,  Princeton,  NJ  1950). 

[47]  C  E.  Moore,  v4Mw/c£ne.-g>’LevWs(U.S.  National  Bureau  of  Standards.  Washington,  DC,  1971). 

[48]  P.  Chaquin,  A.  Papakondylis,  C.  Giessner-Prettre  and  A.  Sevin,  J.  Phys.  Chem.  94, 7352  ( 1990). 


Received  May  1 1,  1993 


Use  of  Energy  Partitioning  for  Predicting  Primary 
Mass  Spectrometric  Fragmentation  Steps: 

A  Preliminary  Account 

I.  MAYER  and  A.  GOMORY 

Central  Research  Institute  for  Chemistry  of  the  Hungartan  Academy  of  Sciences.  H-1525  Budapest. 

PO.  Box  17.  Hungary 


Abstract 

Energy  partitioning  (decomposition  of  the  total  energy  into  one-  and  two-center  contributions)  per¬ 
formed  at  the  MNDO  level  of  theory  has  been  applied  to  interpret  the  first  steps  of  mass  spectrometric 
fragmentation  processes.  In  all  cases  considered  until  now.  the  changes  of  bond  contributions  during  a 
vertical  ionization  process  could  be  used  to  predict  the  main  primary  bond  cleavages  observed  in  the 
mass  spectra  of  organic  molecules.  ©  W93  John  Wiley  &  Sons.  Inc 


Introduction 

In  the  course  of  mass  spectrometric  experiments,  molecules  undergo  ionization 
and  fragmentation  under  the  electron  impact.  Although  the  fragmentation  of  ions 
formed  by  electron  bombardment  ha  par  excellence  dynamic  process,  the  existence 
of  many  regularities  in  mass  spectra  of  related  organic  molecules  indicates  that 
there  are  some  simple  underlying  physical  factors  influencing  them  to  a  great  extent. 
Obviously,  the  electronic  structure  of  the  lowest  electronic  state(s)  of  the  ions  ob¬ 
tained  after  the  ionization  step  should  often  be  of  large  (or  even  decisive)  importance 
in  determining  the  primary  bond  cleavages.  This  assumption  is  supported  by  the 
recent  successes  [1-6]  in  interpreting  the  primary  mass  spectrometric  fragmentation 
steps  by  considering  the  changes  in  bond  order  (multiplicity)  indices  [7]  taking 
place  during  the  ionization.  In  the  case  of  the  NDO-type  semiempirical  theories, 
these  bond  orders  reduce  to  the  familiar  Wiberg's  bond  indices  [8].  It  has  recently 
been  demonstrated  [9]  that  the  Wiberg  indices  obtained  in  the  framework  of  the 
semiempirical  MNDO  theory  correlate  very  well  with  the  ab  initio  values,  as  well  as 
that  the  mndo  Wiberg  indices  represent  rather  useful  tools  in  interpreting  mass 
spectra  [1-6]. 

Bond  order  or  Wiberg  indices  reflect  the  actual  multiplicity  of  the  bonds  in  ques¬ 
tion;  they  are  in  a  correlation  with  the  bond  strengths  but  are  not  energetic  quantities 
which  could  be  directly  related  to  the  energy  consumption  of  the  bond  cleavage 
processes.  For  that  reason  we  have  investigated  the  applicability  of  energy  parti¬ 
tioning  performed  at  the  MNDO  level  of  theory  for  interpreting  and  predicting  the 
primary  mass  spectrometric  fragmentations.  We  do  not  assume  that  the  new  method 
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should  replace  the  use  of  bond  orders  in  discussing  fragmentations.  However,  it 
gives  some  additional  information  about  the  effects  taking  place  during  the  ion¬ 
ization,  which  is  worth  being  utilized.  In  fact,  we  suggest  using  these  two  sets  of 
parameters  simultaneously.  However,  an  obvious  advantage  of  energy  parti¬ 
tioning  is  in  putting  the  changes  in  bond  strengths  to  the  energetic  scale,  relating 
them  better  to  the  physical  conditions  of  a  mass  spectrometric  experiment  than 
the  bond  order  does. 


Energy  Partitioning 

As  is  well  known,  the  total  rhf  energy  of  a  closed  shell  molecule  or  that  of  an 
ion  treated  by  the  “half-electron”  scheme  is 

£  =  2  KJ),,  +  i  2  \  DM(tip  I  vr )  +  2  ^  ( 1 ) 

It.**  A<B 

with  the  usual  notations  for  the  elements  of  the  one-electron  matrix  h,  density 
matrix  D,  two-electron  integrals  in  the  ( 1 1 1 22)  convention  as  well  as  nuclear  charges 
Za  and  intemuclear  distances  R^b-  In  the  framework  of  the  MNDO  scheme,  there 
are  only  one-  and  two-center  integrals  contributing  to  the  elements  of  matrix  h  as 
well  as  only  one-  and  two-center  two-electron  integrals  (/iplvr)  are  considered. 
Therefore,  similarly  to  other  ndo  theories  (cf.  [  1 0, 1 1  ] ) ,  the  total  mndo  energy  can 
unambiguously  be  decomposed  into  one-  and  two-center  contributions.  In  practice, 
the  introduction  of  such  an  energy  partitioning  into  the  standard  mndo  program 
for  rhf  closed  shells  and  half-electron  open  shells  [12]  represented  a  quite  straight¬ 
forward  programming  task.  We  used  the  energy  { 1 )  written  down  in  the  trace  form 

£=  iTr[{h  +  F)D}+  2  (2) 

A<B 

with  F  being  the  usual  Fock  matrix,  following  closely  the  build  up  of  matrices  h 
and  F.  Off-diagonal  elements  h^,  and  with  orbital  indices  p  and  v  corresponding 
to  different  atoms  should  simply  be  multiplied  with  |  and  added  to  the  respective 
two-center  contribution.  One-center  elements  of  h  and  F,  however,  contain  not 
only  true  one-center  terms  but  reflect  also  the  Coulombic  interaction  with  the 
nuclei  and  electrons  of  other  atoms.  The  latter,  however  can  trivially  be  separated 
out  and  assigned  to  the  corresponding  two-center  energy  components. 

In  our  opinion,  a  theoretical  tool  of  such  type  can  be  considered  truly  predictive 
if  one  can  determine  what  bonds  will  become  weaker  by  investigating  vertical  ion¬ 
ization  processes,  i.e.,  by  comparing  the  bond  order  indices  or  the  two-center  energy 
components  of  the  neutral  molecule  and  of  the  ion  at  the  equilibrium  (optimized) 
geometry  of  the  former.  In  other  words,  the  longer  and  weaker  bonds  obtained  for 
an  adiabatic  ionization  obviously  should  exhibit  lower  bond  orders  and  lower  (in 
absolute  value)  bonding  energy  contributions.  These  lower  values  may  be  considered 
as  a  simple  consequence  of  the  bonds  in  question  being  longer.  However,  if  one 
obtains  reduced  bond  contributions  after  ionization  at  the  geometry  of  the  neutral 
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molecule  (i.e.,  if  a  significant  bond  weakening  takes  place  even  for  the  vertical 
ionization),  then  this  is  an  indication  or  prediction  that  the  bond  considered  will 
be  looser  if  the  geometry  of  the  ion  is  relaxed.  One  can  expect  that  bonds  of  this 
type  will  be  those  that  undergo  primary  bond  cleavages. 

One  should  keep  in  mind  that  the  two-center  enei;gy  contribution  corresponding 
to  a  given  bond  is  not  equal  to  the  dissociation  energy  of  that  bond.  It  is  merely  a 
static  parameter  corresponding  to  a  fixed  geometry  and  given  wave  function,  so  it 
has  only  indirect  relationships  with  the  rather  complicated  dynamic  processes  oc¬ 
curring  duiing  the  ion  fragmentation. 

Applications 

In  all  cases  studied  until  now,  the  comparison  of  the  two-center  contributions 
for  the  neutral  molecule  and  the  positive  ion  could  be  used  to  identify  the  primary 
bond  cleavages;  The  energy  contributions  for  the  bonds  undergoing  actual  rupture 
decrease  significantly  (in  absolute  value)  after  the  ionization.  (Changes  by  a  value 
about  1  eV  or  more  are  typical.)  However,  even  large  changes  in  energy  contributions 
corresponding  to  double  and  aromatic  bonds  do  not  indicate  bond  cleavage;  in 
these  cases,  one  is  merely  faced  with  some  decrease  of  the  actual  bond  multiplicity. 

In  the  Figures  1-4,  the  two-center  energy  components  obtained  by  energy  de¬ 
composition  are  displayed  in  electron  volts;  in  each  case  the  upper  numbers  cor¬ 
respond  to  the  neutral  molecules,  and  the  lower  ones  to  the  ions  at  the  optimized 
geometry  of  the  neutral  molecule  (vertical  ionization). 


Rgure  1 .  Twcxenter  mndo  energy  components  ( in  eV )  obtained  for  methyl-ethyl-ketone 
and  diethyl-ketone.  Upper  numbers  relate  to  the  neutral  molecule,  lower  ones  to  the  positive 
ion  (vertical  ionization). 
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Figure  2.  Two-center  mndo  energy  components  (in  eV)  obtained  for  buthyl-amine. 
Upper  numbers  relate  to  the  neutral  molecule,  lower  ones  to  the  positive  ion  (vertical 

ionization). 


Figure  1(a)  shows  the  energy  decomposition  results  for  methyl-ethyl-ketone. 
The  bond  with  the  largest  change  in  the  two-center  contribution  indicates  that  the 
main  process  is  the  ethyl  loss,  in  agreement  with  experiment  [13].  Another  C — C 
bond  at  the  carbonyl  group,  which  corresponds  to  methyl  loss,  is  changed  to  a 
somewhat  less  extent,  and  its  energy  in  the  ion  remains  slightly  larger  (in  absolute 
value).  This  is  in  agreement  with  the  fact  that  only  a  low  intensity  M-15  peak  is 


O 
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H 

Figure  3.  Two-center  mndo  energy  components  (in  eV)  obtained  for  a  laaone.  Upper 
numbers  relate  to  the  neutral  molecule,  lower  ones  to  the  positive  ion  (vertical  ionization). 
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Figure  4.  Two-center  mndo  energy  components  ( in  eV )  obtained  for  a  norbomane  de¬ 
rivative.  Upper  numbers  relate  to  the  neutral  molecule,  lower  ones  to  the  positive  ion 

(vertical  ionization). 


observed  in  the  experimental  mass  spectrum.  These  energy  decomposition  results 
also  indicate  that  this  methyl  loss  does  not  originate  from  the  ethyl  group.  This 
conclusion  is  in  agreement  with  the  comparison  with  the  mass  spectrum  of  diethyl- 
ketone,  in  which  no  methyl  loss  is  observed  experimentally  or  predicted  theoretically 
[Fig.  1(b)]. 

Figure  2  shows  the  energy  contributions  for  buthyl-amine.  One  can  see  that  the 
C — C  bond  in  /3-position  to  N-atom  is  expected  to  be  broken.  This  is  in  full  agree¬ 
ment  with  the  fact  that  the  only  dominating  product  in  the  mass-spectrum  [13]  is 
[CH2NH2]^.  It  is  interesting  to  note  the  orientation  sensitivity  of  this  step:  The 
C — C  bond  cleavage  takes  place  only  if  the  bond  is  parallel  to  the  nitrogen  lone 
pair,  but  not  in  the  other  rotation  isomers.  Further  studies  are  required  to  clarify 
whether  this  result  (and  some  similtir  ones)  can  be  generalized  as  to  develop  a  rule 
relating  mass  spectra  to  the  rotational  isomers  of  the  parent  molecule. 

Looking  at  the  energy  partitioning  of  the  lactone  molecule  depicted  on  Figure 
3,  it  is  not  surprising  that  one  can  observe  in  its  mass  spectrum  CO  loss  and  all  the 
fragments  of  the  remaining  furane  ring  [13]. 

Figure  4  displays  a  norbomane  derivative  from  the  recent  MS  study  of  one  of 
us  [3].  The  calculations  agree  nicely  with  our  experimental  analysis  in  which  two 
competing  primary  ring  opening  processes  were  postulated.  These  processes  start 
with  the  cleavages  of  either  of  the  two  bonds  for  which  energy  partitioning  also 
indicates  significant  bond  weakening. 
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Discussion 

Energy  partitioning  performed  at  the  mndo  level  of  theory  seems  to  be  a  promising 
tool  to  interpret  and  predict  the  primary  bond  cleavages  influencing  the  mass  spectra 
of  organic  molecules.  It  appears  to  us  that  it  is  meaningful  to  apply  it  together  with 
analyzing  the  changes  taking  place  in  the  bond  order  ( Wibeig)  indices  during  the 
ionization.  As  expected,  there  is  a  great  overall  correlation  between  these — con¬ 
ceptually  different — quantities.  This  does  not  mean,  however,  that  they  change  in 
accordance  with  each  other  in  all  cases  without  exceptions.  It  is  our  first  impression 
that  in  the  doubtful  cases  the  energy  partitioning  gives  superior  results.  For  instance, 
for  the  lactone  molecule  shown  in  Figure  3,  only  the  cleavage  of  the  C — O  bond 
would  be  predicted  by  considering  the  bond  order  changes  (Fig.  5). 

It  seems  too  important  that — according  to  our  experience  up  till  now — the  qual¬ 
itative  conclusions  which  can  be  drawn  from  the  energy  partitioning  results  are 
rather  insensitive  to  accuracy  of  the  geometry  applied;  we  should  recall,  however, 
the  orientation  sensitivity  mentioned  above.  It  may  also  be  noted  that,  in  some 
cases  considered,  the  weakening  of  the  bonds  formed  by  hydrogen  atoms  were  not 
manifested  by  the  appearance  of  M-1  peaks  in  the  mass  spectra;  such  a  weakening 
can  probably  be  considered  as  indications  of  more  complicated  rearrangement 
processes. 

In  some  molecules,  due,  e.g.,  to  the  presence  of  a  phenyl  group,  the  homo  may 
correspond  to  a  jr-orbital.  In  such  cases  ionization  from  HOMO  (or  even  from  the 
next-to-HOMO  orbital )  can  lead  to  quite  stable  molecular  ions,  and  the  explanation 
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Figure  5.  Bond  order  (Wiberg)  indices  obtained  for  the  lactone  molecule  shown 
on  Figure  3.  Upper  numbers  relate  to  the  neutral  molecule,  lower  ones  to  the  positive  ion 

(vertical  ionization). 
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of  the  observed  fragmentation  may  require  consideration  of  ionizations  from  lower 
lying  orbitals.  Development  of  a  technique  permitting  use  of  energy  partitioning 
for  discussing  mass  spectra  in  such  cases  is  also  in  progress. 
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Abstract 

Slow  electron  scattering  from  CH<,  and  thermal  positrons  annihilating  with  a  variety  of  organic  mol¬ 
ecules.  will  first  be  discussed,  the  major  role  of  molecular  polarizability  being  the  focal  point.  In  the  same 
context,  initially  neutral  carbon  clusters  with  an  added,  but  now  bound,  electron  will  be  considered,  and 
in  particular  for  Cm  the  role  of  polarizability  will  again  be  stressed.  Some  attention  will  also  be  paid  to 
electron-electron  interaction  m  Cm  -  Photoelectron  spectroscopic  data  on  ionization  potentials  for  both 
linear  and  terminated  polyacenes  will  then  be  bnefly  considered,  both  the  role  of  hole  localization  and 
hole-hole  interaction  being  referred  to  in  these  quasi-two-dimensional  systems.  Finally  some  directions 
for  further  work  are  proposed  ©  1993  John  Wiley  &  Sons,  Inc. 

Introduction 

The  interaction  of  light  particles  with  organic  molecules  is  a  relatively  old  subject. 
To  cite  two  examples,  Buckingham  et  al.  [1]  carried  out  a  phase  shift  analysis  to 
explain  experimental  scattering  cross-section  data  for  slow  electrons  by  methane, 
while  Paul  and  Saint-Pierre  [2]  reported  experiments  on  the  rapid  annihilation  of 
positrons  in  some  gases  of  organic  molecules. 

The  motivation  for  reopening  theoretical  work  in  this  latter  area  was  provided 
by  very  recent  positron  annihilation  experiments  of  Murphy  and  Surko  [3]  on  a 
variety  of  organic  molecules,  and  in  particular  on  alkanes  C„H2„+2  and  substituted 
alkanes  (see  Fig.  2  below).  Because  of  considerable  current  interest  in  electron 
density  theory,  the  next  section  below  (see  also  Appendix  2)  on  positrons  interacting 
with  organic  molecules  will  be  briefly  introduced  by  comparing  and  contrasting 
the  experimental  results  of  Murphy  and  Surko  [3]  with  positron  annihilation  in 
an  electron  gas  of  varying  mean  density,  such  as  occurs  in  simple  metals  [4,5].  As 
to  the  slow  electron  scattering  from  CH4,  a  very  recent  phenomenological  reanalysis 
of  the  experimental  data  by  Freeman  and  March  [6]  has  resulted  in  a  quite  clearcut 
demonstration  that  what  was  missing  from  the  earlier  theoretical  work  of  Buck¬ 
ingham  et  al.  [  1  ]  ( see,  however,  ( 7  ]  and  below )  was  a  recognition  of  the  importance 
of  molecular  polarizability.  This  same  quantity  also  will  be  seen  to  play  a  major 
role  in  determining  positron  lifetimes  in  organic  molecules. 

In  the  third  section  the  topic  of  additional  electrons  bound  to  small  C  clusters 
and  also  to  €50,  will  be  considered,  the  latter  study  being  again  intimately  linked 
with  molecular  polarizability.  A  subsection  deals  then  with  “holes”  in  quasi-two- 
dimensional  (2d)  organic  molecules  while  later  hole-hole  and  electron-electron 
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interactions  in  such  2d  situations,  a  matter  motivated  by  current  interests  in  high 
r,  cuprates,  are  referred  to  briefly.  The  fourth  section  constitutes  a  summary,  and 
also  points  some  directions  for  possible  future  studies. 

Slow  Electrons  and  Thermal  Positrons  Polarizing  Organics 

The  purpose  of  this  section  is  to  present  and  analyze  available  experimental  data 
for: 

(i)  slow  electron  scattering  from  CH4, 

(ii)  thermal  positrons  annihilating  in  organics:  especially  alkanes, 

and  to  demonstrate  in  both  areas  the  initlmate  correlation  with  molecular  polar¬ 
izability.  Some  links  with  electron  density  theory  (see  also  Appendix  2)  will  be 
used  to  motivate  the  analysis  and  interpretation  of  the  data. 

Slow  Electron  Scattering  from  Methane 

Recent  work  by  Freeman  and  March  [6]  has  reanalyzed  available  experimental 
data  on  slow  electron  scattering  from  CH4  [  8,9 ) .  What  is  demonstrated  first  is  that 
slow  electron  scattering  data  for  this  molecule,  together  with  that  for  the  noble 
gases  Ar,  Kr,  and  Xe  which  are  useful  for  comparison,  can  be  fruitfully  analyzed 
in  a  phenomenological  way.  This  is  achieved  by  plotting  the  log  of  the  scattering 
cross  section  <r(£)  against  the  variable  {£<t(£)}’'*.  Thereby  the  behavior  is 
clearly  separated  into  two  separate  regimes  common  to  all  the  systems,  as  shown  in 
Figure  1. 

The  initial  motivation  for  such  a  plot  was  that  the  asymptotic  form  of  the  free 
molecule  electron  density  p(r)  far  from  all  nuclei  is  governed  by  the  ionization 
potential  through  the  exponential  decay  as 

p(r)  ~exp(-2{2/}'«r).  (1) 

Here  I  is  in  units  of  while  r  is  measured  in  units  of  the  Bohr  radius  Oo-  If  we 
argue  dimensionally  that  a  “characteristic”  distance  r  can  be  used  in  Eq.  ( 1 ),  related 
to  the  cross  section  as  { <r} '  then  the  exponent  in  that  equation  strongly  suggests 
the  use  of  the  quantity  {Ea(E)Y'^  as  an  “independent”  variable  in  plots  of  the 
scattering  data. 

The  Ramsauer-Townsend  minimum  then  affords  a  basic  separation  between 
the  two  regimes  referred  to  above.  Below  the  energy,  Ek  say,  characterizing  this 
minimum,  it  turns  out  for  all  four  systems  CH4,  Ar,  Kr,  and  Xe  that  the  law 

{Ea{E)Y'‘  =  const,  E  ^  Er  ,  (2) 

is  accurately  obeyed  (see  Fig.  1 ).  The  behavior  for  £  s  £r  is  more  complex  and 
will  not  be  considered  further  here. 

Theory  of  Elastic  Scattering  Cross  Section  for  Almost  Spherical  Systems.  For 
energies  below  that  of  the  Ramsauer-Townsend  minimum  £«,  the  measured  cross 
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(c  (E)E)*[10-’®kg'^m2/s] 

Figure  1.  Scattering  cross  section  a(E)  for  CH4  compared  with  noble  gases  Ar.  Kr.  and 
Xe.  “Independent"  variable  is  taken  as  {EaiE)]  as  motivated  below  Eq.  { I ).  (Data 
from  Gee  and  Freeman  [8]  and  Fioriano  et  al  (9J).  Note  that  this  plot  phenoinenologically 
divides  data  into  two  regions  with  very  different  ciiaractenstics.  separated  by  the  Ramsauer- 

Townsend  minima. 


section  (r(E)  is  dominated  by  elastic  scaxtenng.  For  the  closed  shell  spherical  atoms 
Ar,  Kr,  and  Xe,  this  elastic  scattering  cross  section  can  be  written  [1] 

ffciamcvif)  =  ^  2  (2/  +  1 )  sin^  ri!{k) ,  (3) 

where  represents  the  phase  shift  of  the  Ixh  partial  wave  (/  =  0.  s  wave:  1,  p 
wave  etc.).  Equation  (3)  is  readily  rewritten  as 


la!-  =  f2:(A:)]'''(A:|a!)-', 


where  2(^')  denotes  the  sum  over  all  /  values  in  Eq.  (3). 

In  Eq.  (4),  a  is  the  scattering  length  defined  through  the  zero-energy  limit  of  the 
total  scattering  cross  section: 


lim  a(E)  =  4ira^ . 
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The  empirical  relation  ( 2 )  now  require?  that,  over  a  range  of  oc  the  quantity 
[2(A')]''-  is  essentially  independent  of  k.  Using  the  reduced  variable  k^/k,  one 
finds  the  approximate  result  representing  the  experimental  data: 

=  const  =  0.18  .  (6) 

Equation  ( 3 )  is  immediately  applicable  to  the  closed  shell  noble  gases  Ar.  Kr,  and 
Xe.  With  Buckingham  et  al.  [  1  ] ,  we  shall  regard  CH4  as  an  almost  spherical  molecule 
for  our  present  purposes.  One  of  the  conclusions  then  drawn  by  Freeman  and 
March  [6]  is  that  the  (constant)  values  of  2(A:)  extracted  from  the  experiments  are 
very  small  (e.g.,  for  Ar,  2(k)  =  1.4  X  10“^;  see  Appendix  1  also).  If  we  adopt  for 
methane  the  static  potential  set  up  in  Ref.  {1],  the  values  of  2(A:),  as  nott^  in  that 
leference,  are  much  too  large  to  agree  with  experimeni.  From  the  work  of  Gianturco 
and  Thompson  [7],  it  seems  that  careful  inclusion  of  the  polarization  potential  is 
essential  to  obtain  realistic,  and  very  low,  values  of  2(^). 

What  also  emerges  from  the  above  study  [6]  is  the  important  role  in  scattering 
of  slow  electrons  played  by  the  dimensionless  product  A:/?  |  a  | .  Earlier  work  [  1 0, 11  ] 
has  demonstrated  a  close  correlation  of  the  magnitude  1  a  1  of  the  scattering  length 
with  polarizability  «:  Freeman  and  the  writer  [6]  have  exposed  subsequently  a 
linear  relation  between  /crIaI  and  log  a. 

Having  discussed  the  interaction  of  slow  electrons  from  CH4 ,  and  in  particular 
having  established  the  major  role  polarizability  is  playing,  we  turn  next  to  the 
positron  annihilation  experiments  of  Murphy  and  Surko  [3]. 

Positron  Annihilation  with  Organic  Molecules 

Briefly,  the  experiments  of  Murphy  and  Surko  [3]  were  performed  with  room 
temperature  positrons  confined  in  a  Penning  trap.  For  the  molecules  studied,  such 
thermal  energy  is  fiom  0.7  to  18  eV  below  the  positron  energy  required  for  the 
formation  of  positronium,  which  therefore  eliminates  this  process  as  a  channel  for 
positron  annihilation. 

It  is  customary  for  experimentalists  to  express  positron  annihilation  rates  relative 
to  the  Dirac  annihilation  rate  of  positrons  in  a  gas  of  uncorrelated  electrons  (see 
Appendix  2).  by  an  effective  nuclear  charge  Zefr[12].  Roughly  speaking,  Zeff/Z  is 
the  average  duration  of  a  positron-molecule  resonance  relative  to  the  duration  of 
a  simple  elastic  collision.  As  atomic  examples,  the  values  of  Ze(r  for  the  noble  gases 
Xe.  Kr,  and  Ar  are  ~300,  70,  and  30  respectively. 

Murphy  and  Surko  [3]  tabulate  Zjfrfor  substituted  alkanes,  hexenes,  aromatics, 
and  aromatic  perfluorocarbons.  One  point  to  be  noted  is  that  increasing  molecular 
size  has  a  major  effect  on  the  annihilation  rate  (see  also  the  section  after  next). 
What  we  stress  here  in  the  present  context  is  the  analysis  made  of  the  annihilation 
rate  per  molecule  (measured  by  Zefr)  as  a  function  of  polarizability  a.  While  there 
is  marked  correlation,  it  turns  out  that  for  fixed  molecular  polarizability  the  an¬ 
nihilation  rate  per  molecule  as  measured  by  Z^jr  is  very  different  for  different  series 
(e.g.,  alkanes  C„H2n+:,  C„F2„,2.  etc.),  as  demonstrated  in  Figure  2. 

Although,  at  the  time  of  writing,  no  detailed  theory  exists  of  the  correlation 
exhibited  in  Figure  2  between  annihilation  rate  and  polarizability,  there  can  be  no 
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l  iguff  2.  I’lol  Ilf  annihiblion  rale.  measurctJ  by  vciau*  polanrahihly.  While  there 
IS  a  gixKl  correlation,  ilifTcrcnl  senes  of  organic  molecules  lie  on  (liffeienl  curves.  (  Kih  a 
povible  plot  csliihiting  more  "universar’  behavior,  see  discussion  in  the  summary  section). 


dmibt  that,  paralleling  the  discussion  of  the  previous  subsection  for  slow  electrons, 
one  is  involved  here  in  treating  the  way  the  positron  polarizes  the  electronic  cloud 
of  these  organic  molecules. 

It  is  also  noteworthy  that  the  experimental  data  conte  closer  to  a  "universal” 
curve  when  ln(/r(r)  is  plotted  against  (  /  -  Ail  where  /  is  the  ionization  potential 
of  the  molecule  while  /o  is  the  binding  energy  of  the  ground  state  of  the  positronium 
atom  (6.8  eV).  Returning  to  their  early  study,  Paul  and  Sainl-Pierrc  (21,  in  order 
to  explain  the  very  large  annihilation  cross  sections  of  the  heavier  molecules,  pro¬ 
posed  the  physical  picture  of  a  slow  positron  making  several  orbits  close  to  the 
“surface  of  (he  molecule.”  Murphy  and  Surko  { 3 )  refer  in  their  worV  to  the  possibility 
that  a  highly  correlated  positron  and  electron  move  in  the  field  of  a  positive  molecular 
ion.  In  this  context  positrons  in  liquids  arc  relevant  also  (sec  end  of  Appendix  2). 
Further  thcorctit  d  work  is  in  progress  113)  currently,  but.  as  in  the  electron  gas 
problem,  there  can  be  no  doubt  that  the  positron  polarizes  the  ( now  exponentially 
decaying)  electron  density  in  Fq.  ( I )  in  a  major  fashion. 

Photoeicctron  Spectroscopy:  Ionization  Potentials  and  Klectron  Affinities 

We  turn  from  scattering  states  in  the  previous  section  to  the  study  of  added  light 
particles  which  arc  bound  to  organic  molecules.  More  specifically,  we  shall  be  con¬ 
cerned  with  ionization  potentials  and  electron  affinities  as  measured  by  photoclcc- 
tron  spcctro.scopy. 

Additional  lilcctrons  Hound  to  C  Clusters 

We  shall  consider  below  results,  from  both  experiment  and  theory,  on  small  C 
clusters,  namely,  linear  Cs-Q  (14)  and  C«)  with  an  added  electron.  Because  of  the 
close  link  of  the  Cw)  work  with  molecular  polarizability,  stressed  in  the  discussion 
of  light  panicle  scattering  above,  we  lake  this  first.  The  electron  affinity  of  Cjib  has 


612 


MARCH 


been  measured  at  2.65  ±  0.02  eV.  We  shall  be  concerned  below  with  a  brief  dis¬ 
cussion  of  the  way  such  a  result  can  be  represented  by  a  pseudopotential  constructed 
by  Hettich  et  al.  [15]. 

Electron  Density  Calculation  for  Added  Electron  in  Cm  Pseudopotential.  The 
binding  of  an  additional  electron  to  C6o  has  been  explored  numerically  by  Hettich 
et  al.  [15]  using  the  central  field  pseudopotential  description 


V{r)  = 


-e^a 


[c+2'/^(r-r,)2]^ 


(7) 


Here  a  is  the  polarizability,  r^  is  approximately  the  radius  of  the  “European  football” 
representing  Cm,  and  c  is  an  adjustable  constant.  Hettich  et  al.  [  1 5 ]  fit  c  to  reproduce 
the  known  binding  energy  of  2.65  eV  [16]. 

Here  an  approximate  calculation  of  some  properties  associated  with  the  pseu¬ 
dopotential  (7)  will  be  obtained  analytically  from  electron  density  theory.  The 
starting  point  is  to  model  the  electron  density  of  the  ground-state  generated  by 
F(r)  in  Eq.  (7)  as  the  Gaussian 

p(r)  =  Pm«exp(-^{r- (8) 


Next  we  shall  invoke  the  approximate  result  of  March  and  Wind  [17],  which  can 
be  described  as  a  spatial  generalization  of  Kato’s  theorem: 


dp{r)  _  -2r^  dV 
dr  e^oo  dr 


pAr), 


(9) 


where  ps('‘)  is  the  s-state  contribution  to  the  total  density  p(r).  Equation  (9)  is 
exact  for  closed  shells  in  a  bare  Coulomb  field  [18]  but  otherwise  approximate. 

For  the  ground  state  generated  by  V{r)  in  Eq.  (7),  p  =  p,.  We  now  differentiate 
Eqs.  ( 7 )  and  ( 8 )  and  substitute  the  results  into  Eq.  ( 9 ) .  Working  near  the  “surface” 
of  Cm,  i.e.,  r  =  r^,  we  find  the  approximate  result 

A  =  l^'^arhc^Oo,  ( 10) 


which  demonstrates  the  parametric  dependence  of  the  model  electron  density  (8) 
on  the  pseudopotential  (7).  For  Cm,  Hettich  et  al.  [15]  take  the  polarizability 
«  80  A^.  The  depth  F(r<.)  of  the  pseudopotential,  namely, 

F(r,)  =  -eVc'  (11) 


is  plainly  an  upper  bound  to  the  binding  energy.  It  needs  correcting  by  the  zero- 
point  energy  associated  with  the  curvature  of  the  potential  energy  (7)  around  r  = 
Tf.  Evidently,  the  expansion  of  Eq.  (7)  for  r  Tr  is 


V(r)  =  Vir,) 


1  - 


c 


(12) 


with  force  constant  k  given  by 


\k  =  l^'^e^ct/c^. 


(13) 
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Hence  the  zero-point  energy  {hui  referred  to  above  corresponds  to 

o)  =  (klnt)''~  (14) 

with  m  the  electron  mass.  Finally,  therefore,  the  binding  energy,  £/,  say,  of  the 
added  electron  to  the  neutral  Cjo  molecule  is  given  by 


e‘a 


h(t) 


Eh  - - r"  + 


c 

e“a 


2 


„3/2  • 


(15) 


In  summary,  Eq.  (10)  shows  how  the  (model)  ground-state  density  (8)  is  to  be 
expected  to  vary  with  the  pseudopotential  parameters  in  Eq.  (7).  Equation  ( 15)  is 
the  corresponding  result  for  the  binding  energy.  Work  is  in  progress  by  Alonso  et 
al.  [19]  to  refine  the  above  treatment  by  pseudopotential  density  functional  cal¬ 
culations  for  both  Cm  and  Cm  ■  For  a  brief  discussion  of  the  laber,  see  subsection 
on  hole-hole  and  electron-electron  interactions  below,  and  also  Appendix  3. 

"Extra  "  Electron  in  Linear  C  Clusters.  Many-electron  calculations  based  on  coupled 
cluster  theory  have  been  presented  by  Adamowicz  [14]  for  linear  C3-C6.  He  mini¬ 
mized  the  bond  lengths  in  each  case  (one  length  for  C3,  two  for  C4  and  C5.  and  three 
for  C6).  Adamowicz  was  concerned  with  vertical  electron  affinities;  That  is.  the  ge¬ 
ometries  of  the  anions  were  the  same  as  for  the  respective  neutrals.  His  results  are 
reproduced  in  Table  I,  where  the  coupled  cluster  values  of  electron  affinities  are 
compared  with  experiment  [16],  as  well  as  with  self-consistent  field  results. 

The  conclusions  [14]  are  seen  to  be  from  Table  I  that: 

( i )  Electron  affinities  are  significantly  greater  for  even  atom  than  for  odd  atom 
chains. 

(ii)  There  is  an  increase  with  the  length  of  chain  for  both  odd  and  even  chains. 

( iii )  Even  for  sophisticated  many-electron  coupled  cluster  calculations,  the  elec¬ 
tron  affinities  are  0.3-0.4  of  an  eV  smaller  than  the  experimental  values, 
though  the  trend  of  the  empirical  data  is  now  faithfully  reflected. 


Behavior  of ‘Holes"  in  Organic  Moleades 

There  are  two  different  quantities  one  can  use  to  characterize  a  “hole”  in  an 
organic  molecule: 


Table  1.  Electron  affinities  in  eV  for  small  linear  C  atom  clusters*. 


Cj 

C4 

c. 

C6 

Experiment 

1.95 

.3.70 

2.80 

4.10 

Coupled  cluster  results  {14] 

1.58 

3.41 

2.43 

3.69 

Self-consistent  field  values 

1.87 

2.28 

2.95 

2.62 

*  After  Adamowicz  [14];  expenmental  values  from  1 16). 
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( i )  llic  measured  ioni/ation  potential  /,  say  from  photoeicctron  spectroscopy. 

(ii)  the  electron  density  din'crcncc  Ap(r)  between  the  neutral  molecule,  with 
density  p(/  ),  and  the  positive  ion  density  /h(r)' 

„(r)-  pAr),  (16) 

with  the  nuclear  gcomctr>  frozen”  in  the  ion  as  in  the  equilibrium  neutral. 

It  would  be  valuable,  for  future  studies,  to  have  ploK  of  Ap(r)  for  some  typical 
organic  molecules,  derived  from  correlated  electron  wave  functions.  To  date,  we 
know  of  no  information  of  this  kind  that  is  directly  available  (see,  however,  the 
subsection  above,  for  electrons  rather  than  holes).  We  note  here  that  (i)  and  (ii) 
are  linked  because  asymptotically  Ap(r)  behaves  as  in  Eq,  ( 1 ), 

Therefore,  below  we  shall  content  ourselves  with  summarizing  some  regularities 
in  the  ionization  and  chemical  potentials  of  certain  series  of  quasi-two-dimcnsional 
organic  molecules. 

Rcf'iildritic.s  in  Linear  Polyacenes.  In  linear  polyaccnes,  Pucci  and  March  [20] 
demonstrated  that  if  one  takes  experimental  values  of  /  for  the  presently  available 
members  of  the  linear  polyacene  series,  one  then  finds  a  marked  correlation  with 
the  number  of  7r-clcctrons.  They  also  consider  in  some  detail  [20]  the  chemical 
potential  p  of  the  electron  distribution,  measured  in  units  of  the  energy  per  ir- 
electron.  and  connect  this  ratio  (near  to  2  for  HOckcI  and  extended  theories 
and  precisely  2  on  an  electron  gas  model)  with  the  dimensionality  (2d)  of  these 
molecules. 

Terminated  Polyaeenes:  Hole  Localization  in  Backbone.  In  relation  to  Eq,  (16) 
above.  Kemister  and  March  [21]  have  emphasized  from  experiment  that  when 
there  is  a  "backbone”  of  three  rings  or  more,  adding  off-line  rings  to  "terminate” 
the  linear  polyaeenes  changes  /  by  but  a  few  hundredths  of  an  cV,  They  interpret 
this  result  by  replacing  Ap(r)  in  Eq,  ( 16)  by  the  probability  density  of  the  homo 
orbital  in  Hiickcl  theory.  Though,  of  course,  this  is  quite  rough,  they  were  able  to 
conclude  that  this  "hole  density”  was  localized  in  the  backbone  of  these  molecules. 

Hole-Hole  and  Llectron- Electron  Interactions.  We  shall  conclude  this  section 
by  commenting  briefly  on  (i)  hole-hole  interaction  in  the  linear  polyaeenes  and 
(ii)  electron-electron  interaction  in  and  relate  i  systems. 

We  have  m-r  C-sti gated  situations  with  two  holes  in  me  linear  polyaeenes  from  the 
higher  ionization  potentials  as  measured  by  Schmidt  [22].  To  reflect  hole-hole 
separation,  F'igurc  3  shows  /  plotted  vs.  1  fn  with  n  the  number  of  rings,  and  / 
measured  relative  to  the  six-ring  member.  The  linearity  is  striking  and  should  mo¬ 
tivate  a  many-body  study  of  hole-hole  correlation  functions  in  these  quasi-2d  mol¬ 
ecules. 

In  high  y,  cuprates,  electron-electron  and  hole-hole  interactions  in  such  quasi- 
2d  structures  arc.  of  course,  of  considerable  interest,  and  indeed  this  was  the  initial 
motivation  for  this  phase  of  the  work  being  presented.  In  this  context,  it  seems 
natural  to  mention  finally  electron-electron  interactions  in  both  Cm  and  in  K- 
doped.  and  now  solid.  Cut.  The  former  ease  has  been  studied  numerically  by  Hettich 
ct  al.  [15]:  We  refer  the  reader  to  the  more  analytical  study  in  Appendix  3  below. 
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Figure  3.  Higher  ionization  potentials  for  linear  polyacenes  (corresponding  to  two  electrons 
stopped  off  initially  neutral  molecules)  versus  reciprocal  of  number  of  rings  n.  A/  is  ion¬ 
ization  potential  for  n  (<6)  rings  measured  relative  to  that  for  six  rings.  (Experimental 
data  taken  from  Schmidt  [22]). 


As  for  K-doped  Ceo-  The  K  atom  readily  donates  its  valence  electron  to  the  lumo 
orbital  in  Cso-  Then  independent  studies  of  Gedik  and  Ciraci  [23]  and  of  Holas 
and  March  [24]  indicate  that  the  added  electrons  are  well  localized  around  the 
surface  of  the  “European  football”  (see  also  Refs.  [25]  and  [26]).  It  is  well  known 
that  it  is  qualitatively  easier  to  form  a  bound  state  in  2d  than  in  3d  with  a  weak 
attractive  interaction,  a  fact  exploited  in  both  [23]  and  [24].  Unfortunately,  the 
mechanism  for  even  an  arbitrarily  weak  attractive  interaction  remains  unclear  at 
the  time  of  writing. 


Summary  and  Directions  for  Further  Work 

Slow  electrons,  and  also  thermal  positrons,  polarize  organic  molecules  in  a  way 
which  dominates  the  momentum  transfer  cross  section,  and  the  positron  annihi¬ 
lation  rate,  respectively.  Though  the  case  of  an  additional  electron  bound  to  initially 
neutral  Ceo  is  clearly  chemically  different;  nevertheless,  the  binding  is  again  dom¬ 
inantly  due  to  polarizability.  Electron-electron  interaction  in  Cm"  still  leaves  the 
added  electrons  quite  well  localized  around  the  surface  of  the  "European  football.” 
This  is  true  also  of  K-doped,  but  now  solid.  Cm-  Electron-electron  as  well  as  hole- 
hole  interactions  are  important  in  quasi-two-dimensional  systems,  the  latter  being 
briefly  discussed  in  the  linear  polyacenes  in  a  semiempirical  manner  using  data  on 
higher  ionization  potentials  from  photoelectron  spectroscopy. 

As  to  future  work,  it  will  clearly  be  of  interest  if  quantum  chemical  calculations 
can  be  made  of  electron  density  in  both  neutral  organics  and  in  singly  charged  ions 
with  frozen  geometry  as  in  the  neutrals.  The  “difference”  electron  density  in  Eq. 
(16)  will  be  one  focal  point  of  such  calculations.  For  the  terminated  polyacenes 
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discussed  on  the  basis  of  a  single  hole  difference  density  being  simply  the  probability 
density  of  the  homo  orbital  [21],  the  hole  remains  localized  in  the  backbone,  but 
it  would  be  of  interest  to  know  if  this  result  is  robust  against  inclusion  of  many- 
body  effects.  If  such  studies  can  be  made,  then  clearly  for  two  added  holes  the  pair 
correlation  function  of  these  (compare  the  studies  ofThorson  et  al.  [27])  is  especially 
interesting  in  quasi-two-dimensional  molecules  like  the  linear  polyacenes,  because 
of  interest  in  the  high  TV  cuprates. 

Finally,  we  return  to  the  positron  annihilation  studies  (see  Fig.  2  above).  As 
already  noted  a  more  “universal”  plot  of  annihilation  rate  is  given  by  taking  the 
independent  variable  as  (/  —  7o)“‘  instead  of  polarizability  a,  with  I  the  ionization 
potential  and  Iq  the  positronium  binding  energy.  While  Murphy  and  Surko  [  3  ] 
offer  no  physical  interpretation  of  such  a  plot,  we  conclude  by  noting  that,  in  metallic 
clusters,  increasing  cluster  radius  R  can  lead  to  a  linear  relation  between  R  and  (/ 
—  ,4)“',  with  A  the  electron  affinity  [28],  It  is  therefore  of  interest  for  further  work 
to  examine  whether  the  Murphy-Surko  plot  of  annihilation  rate  vs.  (7  -  7o)“'  is 
also  reflecting  a  direct  correlation  with  molecular  size.  If  so,  though  such  a  plot  is 
more  universal  than  Figure  2,  this  latter  plot  may  give  more  insight  into  mechanism 
and  further  work  is  clearly  called  for  here. 
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Appendix  1:  Bounds  on  Phase  Shifts  for  Scattering  from 
“Almost  Spherical”  Molecule  CH4 

Clearly,  from  Eq.  (6)  of  the  main  text  the  magnitude  of  the  phase  shifts  sum 
2(7:)  in  Eq.  (4)  is  determined  by  the  dimensionless  product  of  the  Ramsauer  wave- 
number  kn  and  the  modulus  of  the  scattering  length  |  a  | .  As  explained  in  the  text, 
Eq.  (4)  is  immediately  applicable  to  the  closed  shell  noble  gases:  It  is  approximately 
valid  also  for  the  “almost  spherical”  molecule  CH4.  The  feature  to  be  stressed  is 
that  the  constant  values  of  2(7: )  thereby  extracted  in  the  range  of  validity  of  Eq. 
(6)  are  extremely  small.  For  example,  for  Ar,  2(7:)  =  1.4  X  10“^. 

From  the  definition  of  2(7c)  in  Eq.  (4),  one  can  write  an  immediate  inequality 
for  Ar  that 

(2/+ l)sin^  77,(7:)  <;  1.4  X  10"\  (Al.l) 

The  equality  sign,  of  course,  applies  only  when  a  single  phase  shift  is  dominant.  If 
we  conjecture  that  for  E<Er  and  in  the  range  of  Eq.  (2)  this  condition  is  satisfied 
and  the  s  wave  alone  dominates  the  sum,  then  sin^  770  =  const  =  1.4  X  10"^.  This 
means  that  770  can  differ  from  a  multiple  of  tt  by  only  0.04  radians.  Plainly,  if  we 
study  the  phase  shifts  for  the  static  methane  potential  used  in  Ref.  [1],  the  values 
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of  2(/:),  as  these  workers  noted,  are  much  too  large  to  agree  with  experiment.  It 
seems,  from  the  work  of  Gianturco  and  Thompson  [7],  that  the  careful  inclusion 
of  the  polarization  potential  is  essential  to  obtain  realist  ":,  and  very  low  values  of 
S(/c).  Of  course,  we  recognize  that  constancy  of  S(k),  say  in  Ar,  in  the  range  kn! 
k  from  1 .5  to  5,  may  come  from  a  balance  between  s,  p,  and  d  wave  phase  shifts. 
In  Ref.  [  1  ] ,  it  is  noted  that  d  and  higher  partial  wave  phase  shifts  can  be  usefully 
calculated  semiclassically.  We  stress  again  that  careful  inclusion  of  the  polarization 
potential  is  essential  in  future  work  using  such  a  semiclassical  approach  for,  say, 
1^2. 


Appendix  2:  Electron  Gas  Model  of  Positron  Annihilation 


In  connection  with  the  recent  experiments  on  positron  annihilation  with  organic 
molecules  discussed  in  the  second  section,  let  us  briefly  summarize  the  essence  of 
the  explanation  of  positron  lifetimes  in  an  electron  gas  such  as  exists  in  simple 
metals.  Let  us  recall  that  the  mean  electron  density,  po  say,  in  such  metals,  is 
generally  represented  in  terms  of  a  mean  interelectronic  separation  g  by 

Po  =  3/47rr|.  (A2.1) 


Then  in  terms  of  the  Bohr  radius,  r,goes  from  2ao  to  S.Soo  over  the  range  of  metallic 
densities.  While  more  refined  calculations  have  been  made  more  recently  using 
density  functional  theory  [29],  let  us  briefly  summarize  the  self-consistent  field 
calculations  of  March  and  Murray  [5]  for  the  perturbed  density  p(0),  say,  at  the 
position  of  a  unit  positive  charge  at  the  origin,  embedded  in  an  originally  uniform 
electron  gas.  Table  II  shows  their  results  for  p(0)  as  a  function  of  and  also  the 
ratio  p(0)/po. 

A  useful  fit  of  the  results  in  Table  II  is  given  by  [5] 


'p(O)  -  Po 
Po 


2.23G-E0.766r?  +  0.593  r? 


(A2.2) 


The  standard  Dirac  result  for  the  positron  lifetime  t  [4]  may  be  written  as 


T 


Po 

P{0) 


X  lO-'^s. 


(A2.3) 


If  we  take  the  example  of  Cu  metal  with  r,  =  2.66  then  from  Table  II,  we  find 


Table  H. 


r,  (units  of  Oa) 

2 

2.66 

4 

5 

6 

P(0) 

0.396 

0.298 

0.188 

Po 

0.0298 

0.0127 

0.001 1 1 

p(0)/po 

13 

23 

60 

■■■■ 

169 

618 


MARCH 


T  =  0.6  X  10"'®s.  (A2.4) 

whereas  the  experimental  value  is  2.3  X  10“'®  s.  The  agreement  is  quite  reasonable, 
especially  as  the  positron  is  a  light  particle,  whereas  the  unit  positive  charge  was 
taken  as  fixed  at  the  origin  in  constructing  Table  11. 

But  the  most  interesting  conclusion  from  this  table  is  that  whereas  the  electron 
gas  density  po  varies  by  a  factor  of  30,  the  density  at  the  position  of  the  unit  positive 
charge  changes  by  little  more  than  a  factor  of  2  over  the  same  range  of  po.  Therein 
lies  the  gist  of  the  explanation  of  the  observed  modest  vanation  of  positron  lifetimes 
through  the  simple  metals.  Evidently,  from  Eq.  (A2.2).  it  is  very  hard  to  polanze 
the  electron  gas,  as  -►  0,  i.e..  in  the  high  density  limit  where  the  kinetic  energy 
per  electron  is  proportional  to  rj^. 

To  conclude  this  appendix,  it  is  of  interest  to  compare  and  contrast  results  on 
metals  with  those  on  condensed  forms  containing  organics.  First  though,  and  some¬ 
what  more  generally,  positron  annihilation  measurements  are  available  not  only 
for  metals  as  discussed  above  but  also  for  nonmetallic  crystals,  liquids,  and  amor¬ 
phous  solids.  Many  of  the  materials  in  the  latter  two  categories  exhibit  a  complex 
time  decay  curve  which  has  two  distinct  components  of  comparable  intensity  with 
different  mean  lives  [30].  Roughly  speaking,  in  such  complex  time  decay,  about 
%  of  the  positrons  annihilate  with  a  mean  life  (ti  )  of  a  few  times  10“'®  s  while  the 
remaining  '/}  exhibit  a  mean  life  (t2)  of  (0.5-3.5)  X  10“’  s.  depending  on  the 
substance.  The  longest  lifetime  is  found  to  decrease,  in  general,  as  the  sample  is 
cooled. 

Bell  and  Graham  [31]  propose  that  the  component  is  due  to  the  formation  of 
triplet  positronium  which  is  converted  to  the  singlet  state  by  collision.  Because  of 
the  recent  experiments  on  organics  in  the  gaseous  phase  discussed  in  the  second 
section  of  the  main  text,  further  theoretical  work  to  understand  the  difference  be¬ 
tween  positron  annihilation  in  the  liquid  and  gaseous  states  of  organics  is  clearly 
of  interest. 

Appendix  3:  Electron  Density  Theory  of  Additional  Electrons  in  Cm 

Here  we  set  out  *•  "iefly  the  extension  of  the  pseudopotential  theory  given  in  the 
main  text  for  Cm  to  apply  to  Cm~.  We  construct  a  new  effective  potential  energy, 
‘Vfr)  say,  as 

V(r)  =  V(r)  +  e'^  j  dr'  (A3.1) 

J  |r-r| 

Again  we  assume  the  electron  density  p{r)  in  Eq.  ( A3.1 )  is  of  Gaussian  form 

p(r)  =  jVexp(-fi(r- r,.)’].  (A3.2) 

where  5  <  ^  in  Eq.  (8).  Again  we  use  the  relation  (9)  of  March  and  Wind  [17]. 
Differentiating  Eq.  (A3.2)  with  respect  to  r,  we  then  find,  again  near  r  =  r,.. 


—  =  -2B(r  -  rc)p(r^). 
dr 


(A3.3) 
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Turning  to  construct  r^d^V tdr  from  Eq.  (A3.1 ),  the  first  term  from  the  pseu¬ 
dopotential  r(r)  is  already  dealt  with  in  the  third  section  of  the  main  text.  The 
second,  screening  contribution  to  V(r),  say  Vdr)  in  Eq.  (A3.1 )  can  be  handled 
using  Poisson’s  equation  of  electrostatics  in  the  form 


1  d  \  ,dVAry 
dr  ^  dr 


^irp{,r)e^. 


(A3.4) 


Hence  r^dl\.{r)ldr  can  be  written  in  terms  of  the  total  amount  of  charge  Q{r) 
inside  a  sphere  of  radius  r,  namely. 


as 


4irr^p(r)  dr 


(A3.5) 


r^dVAr) 

dr 


=  Q{r) 


(A3.6) 


From  Eq.  (9)  and  (A3. 3),  one  finds  then 


-2fi(r-r,)  = 


/  dK  ^  Q(r)~Q{r,) 
\e^ao  dr  ao 


(A3.7) 


Using  the  explicit  form  (A3.2)  in  Eq.  (A3.5),  one  can  obtain  Q{r)  in  terms  of  B 
and  r^.  Equation  (A3.7)  can  then  be  used  iteratively  to  determine  5,  for  given 
parameters  a.  r<,  and  c  in  the  Hettich  et  al.  pseudopotential  r(r)  starting  with  a 
zero-order  approximation  B  =  ^,4:  >  2:  1.  Though  this  numerical  iteration  has  not 
been  performed  at  the  time  of  writing,  we  have  estimated  B  ~  (0.043)  (a.u.)"‘ 
from  the  different  numerical  study  reported  by  Hettich  et  al.  [15].  Further  calcu¬ 
lations  are  now  in  progress  [19]  to  refine  the  above  electron  density  treatment  of 
Cm'  .  including  the  choice  of  pseudopotential.  However,  these  refinements  are  still 
within  a  spherically  averaged  pseudopotential  approximation  already  employed  for 
Na  clusters  [32]. 
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Abstract 

We  have  located  five  zero-angular-momentum  resonance  states  in  the  funnel  associated  with  the  lowest 
conical  intersection  of  the  Na(3p)-H2  exciplex,  and  we  have  characterized  the  four  narrowest  of  these 
in  terms  of  total  and  partial  widths.  The  resonant  contnbutions  of  the  metastable  states  to  state-to-state 
energy  transfer  greatly  exceed  the  background  contnbutions.  ©  1993  John  Wiley  &  Sons,  Inc. 

Introduction 

Conical  intersections  [1]  of  potential  energy  surfaces  lead  to  important  conse¬ 
quences  in  structure,  spectroscopy,  and  dynamics.  Structurally,  the  best  known 
consequence  of  conical  intersections  is  the  Jahn-Teller  effect  [2,3],  which  causes 
the  equilibrium  structure  of  molecules  in  degenerate  electronic  states  to  distort. 
Conical  intersections  also  have  important  spectroscopic  effects,  where  they  often 
lead  to  a  characteristic  bimodal  signature  in  the  electronic  spectrum  [4],  States  of 
the  upper  cone  are  typically  broadened  to  a  large  extent  by  nonadiabatic  mixing 
with  states  of  the  lower  potential  energy  surface,  even  when  the  latter  is  nondis- 
sociative  [4,5],  In  molecular  collisions  where  the  lower  surface  is  dissociative, 
broadening  might  often  be  even  more  severe  since  the  funnel  state  will  have  a  finite 
lifetime.  An  example  would  be  the  Na(3p)  +  Hz  system  [6-12],  where  the  ground 
state  is  repulsive,  dissociating  to  Na(3s)  +  Hz,  and  the  excited  state  is  an  exciplex, 
formally  Na(3p)’Hz,  whose  potential  energy  surface  intersects  the  ground  one 
conically.  We  have  recently  carried  out  converged  quantum  mechanical  dynamical 
calculations  on  this  system,  using  a  previously  parameterized  set  [10]  of  coupled 
potential  energy  surfaces,  and,  despite  this  broadening  mechanism,  we  found  five 
resonances,  four  of  which  are  broadened  by  only  2  to  6  meV,  corresponding  to 
lifetimes  of  0.2  to  0.7  ps  [12].  Furthermore  we  found  substantial  enhancement  of 
energy  transfer  probabilities  in  inelastic  collision  processes  that  proceed  through 
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the  metastable  state.  In  the  present  article,  we  characterize  the  resonances  by  their 
partial  widths  [13],  and  we  study  the  inelastic  enhancements  in  more  detail. 

The  next  three  sections  present  the  system,  the  methods  used  for  the  accurate 
quantum  dynamics  calculations,  and  the  methods  for  analyzing  the  resonances. 
The  final  section  presents  results  and  discussion. 

System 

The  collision  system  studied  is  Na(3s)  +  H2  using  a  two-state  electronically 
diabatic  [14]  representation  described  previously  [10].  This  representation  includes 
two  diabatic  potential  energy  surfaces  Un  and  U22  and  a  coupling  matrix  element 
C/12,  all  of  which  depend  on  all  three  internal  coordinates.  Electronic  angular  mo¬ 
mentum  and  spin-orbit  coupling  are  neglected.  The  upper  surface  dissociates  to 
Na(3p)  -I-  H2(with  excitation  energy  2.1037  eV)  but  has  an  accessible  well  at  small 
Na — H2  distances,  corresponding  to  the  Na  •  H2  exciplex. 

In  this  article  (and  in  Refs.  [12]  and  [18]),  we  set  the  zero  of  energy  at  Na(3s) 
infinitely  far  from  H2  at  its  classical  equilibrium  intemuclear  distance.  (Note  though 
that  in  Ref.  [10],  the  zero  of  energy  is  such  that  this  configuration  had  an  energy 
of  5.0  meV.)  Figure  1  illustrates  U\  i  and  U22  for  the  C2v  geometry.  In  this  geometry 


Figure  1 .  Energies  of  the  two  lowest  potential  energy  surfaces  for  the  C2,  approach  of  Na 
to  Hj  as  a  function  of  the  distance  5  from  the  Na  to  the  center  of  mass  of  H2.  For  each 
value  of  S,  the  value  of  the  H — H  distance  s  is  1.40  ao  when  the  Na — H2  distance  is 
greater  than  3.856  ao  and  equals  4. 1 235  +  17.375ao  when  S  is  less  than  this.  The  preceding 
linear  relation  is  determined  by  the  requirement  that  the  line  pass  through  both  the  lowest- 
energy  point  on  the  upper  surface  at  5  =  3.7 1 5ao,  J  =  1  9£4ao  and  the  lowert-energy  conical 
intersection  at  5  =  3.708ao,  s  =  2.0i3ao.  The  five  horizontal  lines  in  the  well  are  the 
resonance  energies  found  in  this  work.  The  horizontal  lines  at  the  nght  are  the  vibrational- 
rotational  thresholds  of  H2.  The  thin  horizontal  line  across  the  whole  figure  is  the  energy 
ofNa(3p)+H2  (0=7  =  0). 
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C23 


energy  (eV) 

Figure  2.  The  sum  of  the  eigenphases  as  a  function  of  energy  in  the  vicinity  of  the  first 
resonance.  The  circles  are  the  quanta!  calculations,  and  the  solid  curve  is  a  fit  consisting 
of  a  seventh-order  polynomial  for  the  background  (direct  scattering)  plus  a  Breit-Wigner 
resonance  term.  The  short-dashed  line  is  the  background,  and  the  long-dashed  line  is  the 
resonance  contribution. 


Un  =  0,  and  so  the  adiabatic  potentials  are  the  same  as  the  diabatic  ones.  As 
explained  in  the  figure  legend,  the  plot  corresponds  to  a  path  in  which  the  H  —  H 
distance  increases  significantly  as  the  Na — H2  distance  decreases,  because  the  well 
involves  a  considerably  stretched  H  —  H  distance.  More  accurate  calculations  [  8  ] , 
for  which,  however,  full  potential  energy  surfaces  were  not  presented,  indicate  that 
the  potential  energy  surfaces  used  here  may  quantitatively  overestimate  the  H — H 
extension  at  the  minimum  of  Un  and  at  the  lowest-energy  conical  intersection; 
nevertheless  the  qualitative  picture  of  an  exciplex  with  an  H  —  H  distance  consid¬ 
erably  larger  than  in  diatomic  hydrogen  is  valid. 

In  this  article  we  consider  total  angular  momentum  7=0  and  the  even  symmetry 
block  corresponding  to  para  Hj .  For  7  =  0,  the  orbital  quantum  number  /  associated 
with  motion  of  Na  relative  to  H2  equals  the  rotational  quantum  number  j  of  H2. 
Thus  a  channel  n  may  be  specified  by  two  quantum  numbers:  v,  w-'ich  is  the 
vibrational  quantum  number  of  H2,  and  j. 

Theory 

For  an  isolated  narrow  resonance  (INR)  [13]  the  resonance  contribution  to  the 
scattering  matrix  is  completely  specified  by  a  complex  resonance  energy  and  N 
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Figure  3.  State-to-state  transition  probabilities  from  the  full  calculation  (total)  and  the 
separate  background  (direct )  and  resonant  contributions.  The  first  column  is  for  resonance 
2.  and  the  second  is  for  resonance  S. 


additional  complex  numbers,  where  N  is  the  number  of  open  channels.  The  reso¬ 
nance  energy  corresponds  to  a  simple  pole  of  the  S  matrix  in  the  complex  plane 
and  will  be  denoted  by 


= 


(1) 


where  a  is  an  index  specifying  the  particular  resonance,  E„  is  the  location  of  the 
resonance  on  the  real  energy  axis,  and  r„  is  the  total  resonance  width.  Near  r„  the 
scattering  matrix  may  be  written  [13] 
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Table  1. 

Resonance  parameters. 

energy  (eV) 

Width  (meV) 

ft 

V} 

r(fs) 

1.890 

1.960 

0 

0 

0 

699 

1.977 

4.679 

0 

1 

0 

281 

2  053 

5.741 

0 

2 

0 

229 

2.08 

Wide 

0 

0 

2 

n/a 

2.105 

2.474 

1 

0 

0 

532 

o  ^  r»b  ..  Tan'Tan 

^n*n  •^n*n  *  r'  • 

E-  z„ 

where  n  and  n'  denote  the  initial  and  final  channels,  S'*  is  the  background  (non¬ 
resonant)  contribution  to  the  scattering,  and  the  7„„,  n  =  1,  2,  . . . ,  A^,  are  the 
complex  parameters.  The  partial  widths  arc  given  in  terms  of  these  quantities  by 

Table  iI.  Partial  widths  for  first  resonance  (eV). 

Different 

Different 

V 

J 

Standard* 

A/=  12 

column 

E  range 

0 

0 

2.435E-07 

2.441E-07 

2.431E-07 

2.435E-07 

0 

2 

5.553E-07 

5.559E-07 

5.551E-07 

5.565E-07 

0 

4 

4.765E-06 

4.774E-06 

4.757E-06 

4.768E-06 

0 

6 

1.529E-05 

1.532E-05 

1.528E-05 

1.530E-05 

0 

8 

1.213E-05 

1.2I5E-05 

1.211E-05 

1.214E-05 

0 

10 

7.285E-07 

7.290E-07 

7.297E-07 

7.302E-07 

0 

12 

4.339E-10 

4.357E-10 

4.341E-10 

4  331E-10 

0 

14 

4.588E-I5 

4.59  IE- 15 

4.58  IE- 15 

4.587E-15 

1 

0 

1.078E-04 

1.078E-04 

1.077E-04 

1.076E-04 

1 

2 

I.726E-04 

1.735E-04 

1.725E-04 

1.730E-04 

1 

4 

2  240E-04 

2.235E-04 

2.239E-04 

2.249E-04 

1 

6 

1  103E-04 

1.106E-04 

1.104E-04 

1.103E-04 

1 

8 

3.810E-06 

3.827n-06 

3.810E-06 

3.8I7E-06 

1 

10 

4.948E-09 

4.978E-09 

4.948E-09 

4.959E-09 

1 

12 

1.094E-16 

1.103E-16 

1.138E-16 

1.096E-16 

2 

0 

1.079E-03 

1.079E-03 

1.079E-03 

1.079E-03 

2 

2 

1.179E-04 

1.178E-04 

1.177E-04 

1.181E-04 

2 

4 

4.606E-05 

4.599E-05 

4.601  E-05 

4.603E-05 

2 

6 

2.792E-07 

2.788E-07 

2.790E-07 

2.789E-07 

2 

8 

3.530E-13 

3.54  IE- 13 

3.735E-13 

3.526E-13 

3 

0 

1.626E-12 

1.635E-12 

1.596E-12 

1.588E-12 

3 

2 

2.081E-11 

2.192E-11 

2.I05E-11 

2.103E-1I 

3 

4 

2.778E-15 

I.376E-15 

4.010E-15 

2.897E-16 

•.w  = 

10.  the  fit  IS  based  on  the  column  of  the  S  matnx  with  the  largest  partial  width,  and  the  energy 

region  used  for  the  fit  is  a  wide  region  between  two  vibrational-rotational  thresholds  of  Hj. 
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Table  III.  Partial  w  ^ths  and  jn:  o-oanial  widths  (eV)  and  ratio  of  sum  to  total  •^ldth. 


V 

J 

Resonance  1 

Resonance  2 

Resonance  3 

Resonance  5 

0 

0 

2.44E-07 

9.41E-07 

6.50E-07 

1.67E-07 

0 

2 

5.55E-07 

3.I6E-06 

4.22h-06 

3.86E-07 

0 

4 

4.77E-06 

1.03E-05 

3.84E-06 

1.7  IE-06 

0 

6 

l.,53E-05 

1.59E-05 

9.29E-06 

2.54E-06 

0 

8 

1.21E-05 

6.06E-06 

l.PIE-Oo 

8.98E-07 

0 

10 

7.28E-07 

I.6SE-06 

3  00E-06 

7.67E-07 

0 

12 

4.34E-10 

1  45E-09 

4.55E-10 

3  99E-07 

0 

14 

4.59E-15 

2.I7E-13 

2.48E-1 1 

1.54E-10 

0 

16 

closed 

6.38E-20 

1.39E-20 

3  45E-I8 

1 

0 

1.08E-04 

1  48E-04 

1  46E-04 

8  59E-06 

1 

2 

1.73E-04 

I.I9E-04 

I.54E-05 

4.85E-05 

1 

4 

2.24E-04 

I.64E-06 

4.30E-05 

2.99E-05 

1 

6 

l.lOE-04 

1.73E-05 

1.14E-05 

2.24L-05 

1 

8 

3  8IE-06 

3  68E-07 

1.29E-04 

l.lOE-05 

1 

10 

4.95  E-09 

8.84E-07 

1  34E-06 

5.53E-06 

1 

12 

1.09E-I6 

6.62E-I3 

6.66E-IO 

7.40E-09 

1 

14 

closed 

closed 

close 4 

4.39E-I8 

2 

0 

1.08E-03 

I..ME-03 

3.91E-04 

I.86E-04 

2 

2 

1.18E-04 

I.35E-03 

1.28E-03 

4  97E-04 

2 

4 

4.6  IE-05 

I.27E-03 

2.18E-03 

1  1  IE-03 

2 

6 

2.79E-07 

7.57E-05 

3.37E-05 

5.6-JE-05 

2 

8 

3.53E-I3 

3.70E-I0 

I.99E-06 

1.62E-06 

2 

10 

closed 

6.29E-lt> 

5.75E-I7 

l.llE-12 

3 

0 

1.63E-12 

I.52E-07 

8.50E-94 

1  52E-04 

3 

2 

2.08E-II 

301E-I0 

4.65E-04 

3.39E-04 

3 

4 

2.78E-15 

2.63E-10 

1  09E-07 

4  I2E-06 

3 

6 

closed 

closed 

I.21E-13 

8.03E-1I 

O'a 

I.90E-03 

4.35E-03 

5.57E-03 

2  48E-03 

o„/r. 

0.967 

0.931 

0.970 

1.002 

We  will  denote  the  sum  of  the  partial  widths  by  G„,  i.e.. 


For  an  inr  we  have  [13] 

I'a  =  G„ 


(3) 


(4) 


(5) 


but  in  general  this  does  not  hold,  and  it  has  been  observed  [15]  that  G„  is  frequently 
significantly  below  r„  for  wide  resonances.  In  general  though,  we  define  decay 
probabilities  by  [  1 5  ] 
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Figure  4.  Decay  probabilities  into  various  v\j'  states,  (a)  First  resonance;  (b)  second 
resonance;  (c)  third  resonance;  (d)  fifth  resonance 


Pan  =  Tan/ G^.  (6) 

Computational  Procedure 

The  S  matrices  were  generated  by  the  outgoing  wave  variational  principle  [16]. 
extended  to  electronically  inelastic  collisions  as  described  elsewhere  [11,17,18]., 
The  numerical  parameters  are  identical  to  those  describee  elsewhere  [11,18].  We 
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Figure  5.  Ctecay  probabilities  for  first,  second,  third,  and  fifth  resonances  into  the  various 
]'  states  of  the  v'  =  2  level. 


performed  scattering  calculations  at  energies  clustered  around  estimated  bound- 
state  energies  of  the  IJ22  diabatic  surface.  The  scattering  matrices  are  all  well  con¬ 
verged  with  respect  to  these  numerical  parameters. 

The  first  step  of  the  fit  is  to  determine  the  resonance  energy.  We  begin  with  the 
generalized  Breit-Wigner  formula  [19] 


A  =  A®  +  arctan 


(2(£„-£))’ 


(7) 


where  A  is  the  eigenphase  sum  defined  by 

exp(2/A)  =  det  S ,  (8) 

and  A®  denotes  the  background  (i.e.,  direct)  contribution.  Since  the  eigenphase 
sum  is  only  determined  in  Eq.  ( 8 )  modulo  tt  we  added  multiples  of  tt  to  the  calculated 
data  until  a  continuous  function  of  total  energy  E  was  obtained.  Numerical  values 
of  r„  and  £„  were  determined  by  a  nonlinear  least  squares  fit  to  a  smoothed  version 
of  Eq.  (7),  i.e., 

A  =  A® -f- arctan|^^^^^3^j  +  0(£- c„) ,  (9) 

where  0  is  a  Heaviside  step  function,  and  A®  is  represented  as 

A®  =  2  , 

x-o 


(10) 
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where  Xmax  is  chosen  high  enough  that  stable  results  are  obtained.  Actually,  of 
course,  the  “smoothing”  simply  corresponds  to  picking  the  correct  branch  of  the 
inverse  tangent. 

In  the  next  phase  of  the  fitting  procedure  we  expanded  5’’  in  Eq.  (2)  as 

M 

(11) 

x=o 

and  we  multiplied  this  by  E  -  to  obtain 

A/+1 

2  (12) 

x-o 


where 


^A/+l.n'n  “  ^Mn'n  (  i^) 

~  X  —  1,2,...,  A/  (  14) 

Bon^n  ,  (15) 

and 

^an*n  ~  yan^ycin  •  (  Ifi) 

The  5„„'„  were  obtained  by  a  linear  least  squares  fit,  and  from  these  the  y4„„'„and 
C„„'„  were  determined.  By  fitting  an  entire  column  of  the  S  matrix  we  obtained 
results  for  the  and  thus  the  partial  widths.  In  practice  we  fit  the  column  of  the 
S  matrix  that  has  the  largest  partial  width  (as  determined  by  a  preliminary  fit), 
although  the  results  were  stable  if  obtained  from  any  column  that  had  a  reasonably 
sized  partial  width.  More  powerful  methods  for  treating  difficult  cases  have  been 
given  elsewhere  [15]. 


Results  and  Discussion 

Figure  2  displays  the  eigenphase  sums  used  to  fit  the  first  resonance.  The  fit  is 
very  good.  The  region  shown  in  Figure  2  (as  well  as  each  energy  range  shown  in 
Fig.  3  below)  is  narrow  enough  that  it  contains  no  rotational  thresholds. 

In  Table  1  we  give  the  fitted  r„  and  E„  as  well  as  the  quantum  number  assignments 
obtained  by  examining  the  wave  function  of  th.e  bound-state  calculation  mentioned 
above  for  the  U22  diabatic  surface.  Vibrational  modes  I'l  and  V2  have  ai  symmetry, 
and  mode  Uj  has  b\  symmetry.  Note  that  states  with  odd  U3  do  not  occur  in  the 
even  symmetry  block  studied  in  this  article. 

Figure  3  gives  plots  of  selected  state-to-state  transition  probabilities.  In  each  case, 
the  figure  shows  the  background  (direct)  contnuution,  i.e..  1 l^  and  the  reso¬ 
nance  contribution,  given  by  the  absolute  square  of  the  second  term  of  Eq.  (2),  as 
well  as  the  total  transition  probability,  given  by  The  total  probability  is 
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not  the  sum  of  the  other  two  because  there  is  also  an  interference  term  ( not  shown ) , 
which  can  be  of  either  sign.  The  plot  in  the  upper  left  shows  the  depletion  of  an 
elastic  |  S„„  1  ~ ,  and  the  other  three  show  enhancements  of  inelastic  processes.  ( Note 
that  1 5„  I  *  is  a  transition  probability,  but  elastic  scattering  cross  sections  depend 
on  1 1  -  S„„|^  not  on  |5„„|’). 

Each  resonance  couples  with  a  different  distribution  of  strengths  to  the  possible 
state-to-state  processes.  For  an  inr,  however,  the  extent  of  the  coupling  to  each  of 
these  processes  is  summarized  succinctly  in  the  partial  widths,  which  are  dis¬ 
cussed  next. 

In  Table  II  we  illustrate  the  quality  of  the  convergence  for  the  partial  widths.  We 
performed  three  separate  tests  for  the  first  resonance.  The  first  test  shows  the  level 
of  convergence  when  the  polynomial  order  used  in  Eq.  ( 1 1 )  is  increased  from  10 
to  1 2.  The  second  test  uses  the  column  of  the  S  matrix  associated  with  the  second 
largest  partial  width  to  generate  the  results.  The  final  test  indicates  the  level  of 
convergence  when  the  range  of  energies  is  modified  by  removing  the  two  lowest 
and  two  highest  energy  points.  Table  III  lists  the  partial  widths  obtained  using  a 
10th  order  polynomial  for  the  background  at  each  of  the  four  resonances. 

Resonance  4  was  too  hard  to  fit  to  get  accurate  widths. 

Since  the  inr  approximation  holds  well  for  resonances  1,2,3,  and  5,  their  resonant 
transition  probabilities  are  given  to  a  good  approximation  by 


where 


F„(£)  = 


(E-Ey  +  Tl/4 


(17) 


(18) 


and  P„„  is  a  decay  probability  given  by  Eq.  (6).  Equation  (17)  has  a  semiclassical 
interpretation  that  is  useful  when  all  the  are  less  than  0.25;  in  that  case  we  can 
interpret  F„{E)P„„  as  the  probability  that  initial  channel  n  accesses  resonance  a  at 
energy  E  and  P„„'  as  the  probability  that  this  resonance  decays  to  state  Figures 
4  and  5  show  the  decay  probabilities.  One  feature  of  immediate  interest  is  the  high 
specificity  of  decay,  i.e.,  in  each  case  there  is  at  least  one  final  state  to  which  decay 
occurs  with  a  probability  greater  than  0.3,  and  in  one  case  the  largest  decay  prob¬ 
ability  exceeds  0.5.  We  see  that  the  largest  decay  probabilities  are  into  the  u  =  2 
states  for  all  four  resonances  for  which  partial  widths  were  obtained,  and  the  first 
resonance  also  shows  appreciable  coupling  to  v  =  0.  The  coupling  to  highly  excited 
vibrational  states  is  understandable  in  terms  of  the  large  H  —  H  distances  accessed 
by  the  exciplex.  It  would  be  interesting  to  study  the  effect  of  changing  the  potential 
energy  surfaces  on  the  decay  probabilities. 

Figure  5  compares  the  j  distributions  directly  for  n  =  2  for  all  four  resonances. 
The  figure  shows  that  the  first  resonance  couples  mainly  to  low  j  states,  the  second 
to  a  broad  range  of  j,  and  the  third  and  fifth  to  a  narrow  range  centered  at  j  =  4. 
Since  resonances  2  and  3  both  involve  excitation  of  but  show  quite  different 
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patterns,  but  resonances  3  and  5  involve  excitation  of  different  modes  and  show 
similar  patterns,  the  interpretation  of  these  trends  is  a  fascinating  topic  for 
further  work. 
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Abstract 

Proton  affinities  have  been  calculated  for  1 19  compounds  using  the  pm3  semiempincal  molecular 
orbital  model  pm3  seems  to  be  not  as  good  as  am!  for  the  calculation  of  proton  affinities  At  the  same 
time.  It  can  be  a  valuable  tool  for  investigation  of  proton  affinities,  when  am  1  is  not  usable  ( for  hypervalent 
compounds  of  second-row  elements).  ©  1993  John  Wiley  &  Sons.  Inc 

Introduction 

Molecular  orbital  calculations  are  carried  out  by  an  increasing  number  of  chemists 
on  a  widest  range  of  problems.  Both  ab  initio  and  semiempirical  self-consistent 
field  molecular  orbital  calculations  are  often  used.  The  widespread  use  of  the  semi- 
empirical  mindo/3  [1],  MNDO  [2],  AMl  [3],  and  pm3  [4]  methods  is  due  to  the 
fact  that  they  often  give  reliable  answers,  are  easy  to  use,  require  relatively  little 
computer  time,  and  can  handle  large  molecules. 

Proton  transfer  reactions  are  widespread  and  of  fundamental  importance  in 
chemistry  and  in  biochemistry.  Therefore,  the  knowledge  of  the  proton  affinities 
of  neutral  bases  is  essential.  If  these  quantities  could  be  calculated  theoretically  by 
some  quantum  chemical  procedure  with  sufficient  accuracy,  this  would  be  a  great 
value,  while  such  calculations  can  be  carried  out  much  faster  and  supposedly  at 
much  less  cost  than  experiments.  Such  calculations  are  also  not  limited  by  the 
physical  properties  of  the  samples  (stability,  vapor  pressure,  etc.).  As  a  result,  nu¬ 
merous  calculations  of  gas-phase  (intrinsic)  basicities  (or  proton  affinities)  of  neutral 
bases  have  appeared  [  5-7  and  references  therein  ] . 

However,  there  have  been  only  a  few  systematic  investigations  [5-10]  testing 
the  reliability  of  different  methods  for  prediction  of  proton  affinities  of  neutral  bases 
and  none  dealing  with  the  reliability  of  the  pm3  method  on  the  large  scale  of  com¬ 
pounds.  Therefore  the  current  investigation  was  undertaken  to  evaluate  the  reliability 
of  pm3  method  for  predicting  gas-phase  proton  affinities. 

A  useful  way  to  compare  the  experiment  and  theory  is  to  calculate  the  linear 
regression  between  the  theoretical  and  experimental  quantities  and  to  calculate  the 
mean  difference  (unsigned  average  error)  between  the  experimental  and  theoretical 
quantities.  The  degree  of  agreement  between  the  two  is  then  reflected  by  the  slope 
and  intercept  of  the  correlation  line,  the  unsigned  average  error,  the  standard  de¬ 
viation  from  correlation  line,  and  the  correlation  coefficient.  The  last  two  depend 
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on  the  scatter  of  the  points  about  the  correlation  line.  A  slope  different  from  unity 
and  a  nonzero  intercept  imply  systematic  deviation  between  experiment  and  theory 
and  means  that  the  theoretical  values  of  relative  proton  affinities  will  be  system¬ 
atically  either  too  high  or  too  low.  A  nonzero  mean  difference  between  theory  and 
experiment  reflects  an  overall  bias  in  the  absolute  values  of  the  calculated  proton 
affinity. 


Method 

The  proton  affinity  (pa)  of  compound  B  was  found  as  the  heat  of  reaction  for 
the  proton  addition  equilibrium  to  form  the  conjugate  acid  BH  ^ : 

B  +  PA(B)^  AHj(B)- AH,(BH^)  (1) 

where  AH/  ( H*),  AHj  ( B),  and  Alf  ( BH*)  are  the  standard  heats  of  formation 
for  proton,  neutral  base  and  its  conjugated  acid,  BH^.  In  the  case  of  proton  the 
experimental  heat  of  formation  (367.2  kcal/mol  [11])  was  used  instead  of  the 
calculated  value  (353.6),  because  it  improves  the  calculated  proton  affinities  con¬ 
siderably.  We  acknowledge  that  this  is  not  quite  a  correct  procedure,  despite  the 
fact  that  such  a  way  was  also  used  in  earlier  works  on  semiempirical  calculations 
of  acidities  and  basicities  [7,12], 

Tht  calculations  were  carried  out  on  MicroVA.x  II  and  vax  8650  computers 
using  the  standard  pm3  procedure,  as  implemented  in  the  mopac  6.0  program 
package  [13].  All  geometries  were  fully  optimized  by  minimizing  the  energy  with 
respect  to  all  geometrical  variables  without  using  the  aid  of  symmetry.  In  order  to 
avoid  premature  completion  of  calculations  by  reaching  some  local  minimum,  the 
calculations  were  carried  out  starting  off  from  several  different  initial  geometries. 


Results  and  Discussion 

Proton  affinities  for  1 19  compounds,  calculated  using  pm3  method,  are  given  in 
Table  1,  along  with  the  corresponding  "experimental”  values.  These  particular 
compounds  were  chosen  because  they  cover  a  wide  range  of  basicities  for  different 
classes  of  neutral  Bronsted  bases  (C-.  0-,  N-,  S-.  P-bases;  alcohols,  amines,  hydro¬ 
carbons.  etc.). 

The  results  of  statistical  (regression)  analysis  of  the  relationship  between  the 
calculated,  pa  (calc),  and  experimental  proton  affinities,  pa  (exp)  according  to  for¬ 
mula 


/’.•I(calc)  =  a  + />-P.'I(exp)  (2) 

are  presented  in  Table  II  (see  also  Fig.  1 ),  where  N  is  the  number  of  points,  pa""" 
is  minimal,  and  PA*""*  maximal  experimental  gas-phase  proton  affinity  in  a  con¬ 
sidered  group;  Apa  is  the  average  unsigned  error  in  proton  affinities,  R  is  the  cor¬ 
relation  coefficient,  a  is  the  standard  deviation,  and  7  is  the  average  unsigned  error, 
when  PAcaic  is  corrected  using  formula 
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Table  I.  Companson  with  expenment  of  pm3  heats  of  formation  and  proton  affinities  (kcal/mol, 
expenmental  values  from  Refs.  [I  I].  [IS],  and  [16]). 


Calc  AHf  PA  Error 


B 

BH* 

Calc 

Exp 

Error 

Exp 

AH,(B) 

in  calc 
AHKB) 

Carbon  bases 

Methane 

-13.0 

228.9 

125.2 

132.0 

-6  8 

-17.8 

4.8 

Ethane 

-18.1 

221.5 

127.6 

143.6 

-16.0 

-20  1 

2.0 

Ethene 

16.6 

222.5 

161.4 

162.6 

-1.2 

12.5 

4.1 

Propene 

6.4 

197.3 

176.3 

179.5 

-3.2 

4.8 

1.6 

I  -methylcylcohexene 

-14.4 

168.6 

184.2 

197.5 

-13.3 

-10.3 

-4.1 

Benzene 

23.5 

212.5 

178.2 

181  3 

-3.1 

19.8 

3.7 

Naphthalene 

40.7 

218.1 

189.8 

194.7 

-4  9 

35.9 

4.8 

Toluene 

14.1 

197.5 

183.8 

189.8 

-6.0 

12.0 

2.1 

Styrene 

39.2 

210.6 

195  8 

200.9 

-5.1 

35.3 

3.9 

Hexamethyibcnzene 

-20.9 

151.0 

195.3 

207.0 

-11.7 

-21.0 

0.1 

pMe-C8H«-C(Me)— CH2 

21.8 

183.9 

205.1 

211.2 

-6.1 

1 .3,5-tnmetho\ybenzene 

-90.8 

73.6 

202.8 

220.0 

-17.2 

-86.7 

-4.1 

Acetylene 

50.7 

263  9 

154.0 

153.3 

0.7 

54.5 

-3.8 

Propyne 

40.2 

238.2 

169  2 

182.0 

-128 

44  6 

-44 

CO 

-19.7 

176.9 

170.6 

168.4 

2.2 

-26.4 

6.7 

Nitrogen  bases 

Nj 

17.6 

237.2 

147.6 

118.2 

29.4 

0.0 

17.6 

Ammonia 

-3.1 

153.4 

210.7 

204.0 

6.7 

-11.0 

79 

HNF2 

-11.4 

193  4 

162  4 

160.0 

24 

-160 

46 

NF, 

-24.4 

201.7 

141.1 

136.9 

4.2 

-31.0 

6.6 

Methylamine 

-4.0 

153.3 

209.9 

214.1 

-4.2 

-5.5 

1.5 

Ethylamine 

-12.5 

144.3 

210.4 

217.0 

-6.6 

-11.3 

-1.2 

Propylamine 

-17.7 

138.5 

211.0 

217.9 

-6.9 

-17.3 

-0.4 

Iso-propylamine 

-17.3 

135.1 

214.8 

218.6 

-3.8 

-20.0 

2.7 

Butylamine 

-21.7 

132.9 

2126 

218.4 

-5.8 

-22.0 

0.3 

Iso-butylamine 

-23.0 

131  8 

212.4 

218.8 

-6.4 

-23.6 

0.6 

rcrt-butylamine 

-18.7 

125.7 

222.8 

220.8 

20 

-28.9 

10.2 

Neopentylamine 

-26.9 

125.4 

215.0 

219.3 

-4.3 

-31.0 

4.1 

Cyclohexylamine 

-26.0 

125.8 

215.4 

221.2 

-5.8 

-25.0 

-1.0 

2-aminonorbornane 

-9.5 

142.3 

215.4 

221.7 

-6.3 

-8.0 

-1.5 

Etyhdeneamine 

9.5 

168.5 

208.2 

213.9 

-5.7 

2.0 

7.5 

Dimethylamine 

-5.4 

152.6 

209  3 

220.6 

-II.3 

-4.4 

-1.0 

N-ethylmethy!amine 

-1.3.0 

143.7 

210.4 

222.8 

-12.4 

-11.0 

-2.0 

Dirthylamine 

-20.3 

135.0 

211  9 

225.9 

-14.0 

-17.4 

-2.9 

Hydrazine 

20.7 

191.8 

196.0 

204.7 

-8.7 

22.8 

-2.1 

1.2-diaminoethane 

-3  9 

146  4 

2168 

225.9 

-9.1 

-4.3 

0.4 

1.3-diaminopiopane 

-9.3 

138.9 

218.9 

234.1 

-152 

-4  2 

-5.1 

3-aminopropan-l-ol 

-56.4 

92.5 

218.3 

228.6 

-10.3 

-52.0 

-4.4 

1,4-diaminobutanc 

-17.4 

129.7 

220.1 

237.6 

-17.5 

-13.0 

-4.4 

(CH,)2N(CH2)4N(CH3)2 

-26.0 

134.4 

206.8 

240.6 

-33.8 

1 .8-diaminonaphialene 

43  3 

184.2 

226.3 

223.8 

2.5 

46.0 

-2.7 

Trimethylamine 

-109 

155.6 

200.7 

225.1 

-24.4 

-5.7 

-5.2 

N.N-dimethylcthylamine 

-15.9 

143.2 

208.1 

227.5 

-19.4 

-11.0 

-4.9 

N.N-dimethylmethylamme 

-20,5 

I4I.9 

204  8 

230.0 

-25.2 

-17.0 

-3  5 

{Omiinued) 
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Table  1.  {Continued) 


Calc  AHf 

PA 

Error 

Exp 

AH,(B) 

Error 
in  calc 
AHr(B) 

B 

BH* 

Calc 

Exp 

Nitrogen  bases 

Tributylamine 

-58.8 

92.3 

216.1 

236.0 

-19.9 

-53.0 

-5.8 

N-methylformamide 

-43.7 

133.5 

190.1 

204.6 

-14.5 

-45.0 

1.3 

Aniline 

21.3 

175.1 

213.4 

209.5 

3.9 

20.8 

0.5 

Pyrrolidine 

-12.0 

144.8 

210.5 

225.2 

-14.7 

-0.8 

-11.2 

Piperidine 

-16.5 

137.2 

213.5 

226  4 

-12.9 

-11.7 

-4.8 

2,2,6, 6-tetramethyi- 

piperidine 

-39.7 

105.4 

222.1 

231.7 

-9  6 

-38.0 

-1.7 

l-methylpyrrolidine 

-14.6 

143.7 

208.9 

228.7 

-19.8 

-0.5 

-14.1 

1-methylpipendine 

-19.3 

136.2 

211.7 

229.7 

-18.0 

-12.0 

-7.3 

Quinuclidine 

-13.1 

142.9 

211.2 

232.1 

-20  9 

-1  0 

-12.1 

l-asabicyclo[2,2,2] 

-oct-2-ene 

16.8 

171.1 

212.9 

228.5 

-15.6 

37.0 

-20.2 

Pyndine 

30.4 

187.3 

210.3 

220  8 

-10.5 

58  0 

-27.6 

2-methylpyridine 

21.1 

174.7 

213.6 

225.0 

-114 

23.7 

-2.6 

3-methylpyridine 

20.8 

175.7 

212.3 

224.1 

-11.8 

25.4 

-4.6 

4-methylpyridine 

20.8 

175.1 

212.9 

225.2 

-12.3 

24.8 

-4.0 

Tetrazole 

86.3 

261.1 

192.4 

197.8 

-5.4 

80.0 

6.3 

HCN 

33.0 

213.6 

186.6 

171.4 

15.2 

32.3 

0.7 

Acetonitrile 

23.3 

197.9 

192.6 

188.4 

4.2 

18.0 

5.3 

Butyronitnle 

14.7 

188.0 

193.9 

192.4 

1.5 

7.0 

7.7 

(CH)jC=C(CN)j 

169.2 

353.1 

183.3 

166.6 

16.7 

169.0 

0.2 

Trichloroacetonitrile 

18.5 

201.5 

184.2 

175.7 

8.5 

20.0 

-1.5 

FCN 

6.5 

196.0 

177.7 

163.0 

14.7 

Trifluoroacetonitnie 

-115.0 

78.6 

173.5 

164.1 

9.4 

-119.4 

4.4 

(CNh 

77.5 

266.6 

178.1 

163.5 

14.6 

73.8 

3.7 

Oxygen  bases 

O2 

-4.2 

257.8 

105.2 

105.5 

-0.3 

0.0 

-4.2 

Water 

-53.4 

159.1 

154.7 

166.5 

-11.8 

-57.8 

4.4 

Methanol 

-51.9 

156,6 

158.8 

181.9 

-23.1 

-48.2 

-3  / 

Ethanol 

-56.9 

143.9 

166.5 

188.3 

-21.8 

-56.1 

-0.8 

2-methyl-2-propanol 

-71.3 

120.0 

175.9 

193.7 

-178 

-74.7 

3.4 

2,2-dilluoroethanol 

-151.8 

64.7 

150.8 

175.2 

-24  4 

(CF3)3C0H 

-501.0 

-266.6 

132.7 

163.0 

-30.3 

-54.90 

48.0 

(CFjhCHOH 

-360.7 

-132.8 

139.4 

163.4 

-24.1 

-367.0 

6.4 

Methyl  ether 

-46.9 

157.0 

163.2 

192.1 

-28.9 

-44.0 

-2.9 

Ethyl  ether 

-59.6 

135.9 

171.7 

200  2 

-28  5 

-60.1 

0.5 

tert-butylmethyl  ether 

-64.4 

116.5 

186.3 

202.2 

-159 

-67  8 

3.4 

ter/-butyl  ether 

-75.0 

102.3 

189.8 

212.0 

-22  2 

-87.0 

12.0 

(CF3):0 

-381.4 

-134.9 

120.7 

158.0 

-37.3 

Carbondioxide 

-85.0 

139.5 

142.7 

130.9 

11.8 

-94.5 

9.5 

Formaldehyde 

-34.1 

166.3 

166.8 

171.7 

-4.9 

-26.0 

-8.1 

Acetaldehyde 

-44.2 

144.7 

178.3 

186.6 

-8.3 

-39.6 

-4.6 

F2CO 

-138.5 

75.4 

153.3 

158.2 

-4  9 

-153.0 

14.5 

Hexafluoroacetone 

-338.4 

-105.6 

134.5 

159.3 

-24.8 

-334.0 

-4.4 

PM3-CALCULATED  PROTON  AFFINITIES 


637 


Oxygen  basts 

CFjCOCl 

-194.3 

12.7 

160.3 

CF3COOH 

-244.0 

-39.8 

163  0 

Methylacetate 

-92  7 

89.3 

185.3 

Acetylacetone 

-86  9 

91.5 

188.8 

Formamide 

-41.8 

128.3 

197.1 

Sulphurdioxide 

-50.8 

135  3 

181.2 

Dimethylsulfoxide 

-38.6 

117.2 

211.4 

Methylsulfonylbenzene 

-40.6 

1 1 1.8 

214.8 

CF3SO2NH2 

-207.8 

-11.0 

170.4 

F2SO2 

-1843 

9.3 

173  6 

Tnmethylphosphineoxide 

-81.9 

51.3 

234.0 

(C3H,)3P0 

-101.4 

32.2 

233.6 

bis(dimethylamino)- 

mathylphophineoxide 

-85.1 

51.2 

230.9 

Diethylchloromethyl- 

phosphonate 

-191.9 

-46.6 

222.0 

Tnmethylphosphate 

-239  1 

-89.1 

2172 

Triethylphosphate 

-250.1 

-84.6 

201  6 

Hexamethylphosphoric 

-85  0 

53.7 

228.5 

Amide 

Phosphor  bases 

Phosphine 

0.2 

117.2 

250.3 

Methylphosphine 

-9.5 

114.8 

243.0 

Dimethylphosphine 

-18.5 

112  8 

235.9 

Tnmethylphosphine 

-28.6 

109.7 

228.8 

Triethylphosphme 

-35.2 

109.3 

222.6 

Phosphorustrifluonde 

-252.2 

-70.4 

185.5 

Sulphur  bases 

H2S 

-0.9 

175.6 

190.7 

Ethanethiol 

-8  7 

161.4 

197.1 

/ert-butanethiol 

-13.2 

153.7 

200.3 

Methylsulhde 

-10.4 

159.4 

197.4 

Ethylmethylsulfide 

-14.5 

155.7 

197.C 

Propylsulfide 

-30.1 

135.3 

201.9 

Other  bases 
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Tabi  II  Rc'ults  of  statistical  anlavsis  of  calculated  (pm3)  and  expcnmental  proton  atfinities  in  terms 

of  Eq.  (2). 


N 

pa""" 

PA'”” 

APA 

a 

h 

R 

a 

7 

All 

119 

63  6 

240  6 

12.9 

28.3 

0.8 

0.884 

149 

12.8 

AH' 

118 

63.6 

240.6 

12.5 

n.i 

08 

0.901 

13.6 

12.2 

AlF 

Carbon 

60 

130.9 

237.6 

11.2 

14.4 

0.9 

0.937 

8.7 

7.4 

bases 

Nitrogen 

IS 

1.32.0 

220.0 

7.8 

9.4 

1  0 

0.962 

6.8 

5.8 

bases 

Oxygen 

SI 

1 18.2 

240.6 

11.7 

73.5 

0.6 

0.931 

66 

7.8 

bases 

35 

loss 

227.4 

14.5 

-12.7 

1  0 

0  875 

16.0 

12.8 

Phosphoi 

bases 

Phosphor 

6 

164.4 

232.0 

22.6 

149.9 

03 

0.408 

21.3 

42.1 

bases' 

Sulphur 

5 

164.-1 

232.0 

16.0 

97.9 

06 

0.711 

15.9 

18.9 

bases 

6 

171.1 

206.3 

7.0 

144.1 

03 

0.889 

20 

5.2 

'  Without  PHj 

^  Compounds,  calculated  by  Dewar  [7]  with  AMI. 


Figure  I.  Calculated  and  experimental  proton  affinities  tkcal/ mol)  foi  all  ba,ses  surveyed. 
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PArJ.  =  (3) 

Table  III  gives  analogous  results  for  the  statistical  analysis  of  AM  1 -calculated  proton 
affinities  (from  Ref.  [7]). 

The  average  unsigned  error  in  the  heats  of  formation  for  all  neutrals  is  7.9  kcal/ 
mol.  This  error  is  fairly  typical  for  the  pm3  method  and  is  in  keeping  with  what 
has  previously  been  found  [14].  At  the  same  time,  the  average  unsigned  error  in 
proton  affinities  of  the  same  collection  of  bases  is  12.9  kcal/mol,  compared  with 
6.0  kcal/mol  for  the  amI  method  obtained  by  us  from  analysis  of  results  of  cal¬ 
culations  for  60  compounds  presented  in  Ref.  [  7  ] .  But  when  we  compare  the  results 
only  for  compounds  calculated  by  both  methods,  the  average  unsigned  error  for 
the  pm3  method  reduces  to  1 1.2  kcal/mol,  which  is,  however,  still  larger  than  the 
same  quantity  for  the  am  1  method.  It  should  be  mentioned  that  in  the  case  of  pm3, 
nearly  one  fifth  of  calculated  proton  affinities  have  errors  greater  than  20  kcal/mol 
and  more  than  half  deviate  more  than  10  kcal/mol  (see  also  Fig.  1 ). 

From  Table  1  it  can  be  seen  that  extremely  large  error  (62.3  kcal/ mol)  corresponds 
to  PHj.  When  the  phosphine  is  excluded  from  the  statistical  analysis,  the  average 
unsigned  error  reduces  to  12.5  kcal/mol.  It  should  be  mentioned  that  alongside 
with  a  big  error  in  the  calculated  heat  of  formation  the  pm3  method  also  gives  an 
unrealistic  charge  distribution  for  phosphonium  cation — the  calculated  Mulliken 
charge  on  phosphor  atom  is  2.42. 

On  the  basis  of  the  results  of  the  overall  correlation  analysis  one  can  conclude  that 
in  the  calculated  proton  affinities  there  is  also  present  a  systematic  error,  as  indicated 
by  the  nonunity  slope  of  the  correlation  line  h  and  by  the  nonzero  intercept  a. 


Carbon  Bases 

The  average  unsigned  error  in  calculated  proton  affinities  for  the  carbon  bases  is 
7.8  kcal/mol,  which  is  considerably  higher  than  that  found  by  us  for  am  1  method 
(2.9  kcal  /  mol ).  The  correlation  between  experimentally  found  and  calculated  pro¬ 
ton  affinities  seems  to  be  reasonably  good  as  ev  idenced  by  the  lack  of  the  systematic 


Tabi.1.  hi.  Results  of  statistical  analvsis  of  proton  affinities,  calculated  with  AMI  method  in  terms  of 

Eq.  (2). 


A’ 

pa""" 

pa""' 

Apa 

a 

b 

R 

<T 

7 

All 

60 

1.30.9 

2.37.6 

6.0 

2.3.8 

0.9 

0.979 

4.8 

4.5 

Carbon 

bases 

9 

1.32.0 

194.7 

2.9 

9.5 

1.0 

0.987 

.3.7 

2.7 

Nitrogen 

bases 

.3.1 

171.4 

2.32  1 

5.8 

IbA 

0.6 

0.9.34 

.3.0 

38 

Oxygen 

bases 

12 

1.30.9 

202.2 

69 

.38.9 

0.8 

0  948 

5.4 

5.4 
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error  (see  results  of  correlation  analysis  in  Table  11),  but  the  existence  of  big  random 
errors  makes  one  prefer  the  AMl  method  for  calculation  of  proton  affinities  of 
carbon  bases. 

Nitrogen  Bases 

The  average  unsigned  error  in  calculated  proton  affinities  for  the  nitrogen  bases 
is  1 1 .7  kcal/mol,  which  is  again  considerably  higher  than  that  found  by  us  for  am  1 
method  (5.8  kcal/mol).  However,  it  seems  that  these  errors  are  systematic  for  both 
methods  while  the  slopes  of  correlation  lines  are  0.62  ±  0.04  and  0.63  ±  0.04  for 
pm3  and  amI  methods,  respectively,  and  the  application  of  empirical  correction 
reduces  the  errors  in  calculated  proton  affinities  considerably  (to  7.8  and  3.8  kcal/ 
mol  for  pm3  and  amI  methods,  respectively).  Once  again,  the  AMl  method  seems 
to  be  superior  for  the  calculation  of  proton  affinities  of  nitrogen  bases. 

Oxvgen  Bases 

The  average  unsigned  error  in  calculated  proton  affinities  for  the  oxygen  bases 
is  14.5  kcal/mol,  which  similarly  to  that  of  the  carbon  and  nitrogen  bases  is  still 
significantly  larger  than  the  average  unsigned  error  that  is  found  in  the  present 
work  for  AMl  method  (6.9  kcal/mol).  As  a  rule,  in  case  of  alcohols,  ethers,  and 
carbonyl  c<  mpounds  the  predicted  proton  affinities  are  lower  than  the  experimental 
values. 

Phosphor  Bases 

The  average  unsigned  error  in  calculated  proton  affinities  for  the  phosphor  bases 
is  22.6  kcal/mol.  However,  if  we  exclude  from  the  comparison  the  extremely  strongly 
deviating  value  for  the  phosphine  (vide  supra),  the  unsigned  average  error  reduces 
to  1 6.0  kcal  /  mol.  Still,  the  statistical  analysis  (see  Table  II )  shows  that  there  is  prac¬ 
tically  no  correlation  between  calculated  and  experimental  proton  affinities,  so  one 
must  exercise  extreme  caution,  when  using  pm3  for  investigation  of  phosphor  bases. 

Sulphur  Bases 

The  average  unsigned  error  in  calculated  proton  affinities  for  the  sulphur  bases 
is  7.0  kcal/mol,  which  when  compared  with  the  previous  groups  of  bases  is  a 
surprisingly  good  result. 


Conclusions 

On  the  basis  of  the  above  results  one  can  conclude  that  the  pm3  method  is  not 
as  good  as  am  1  for  the  calculation  of  proton  affinities  of  neutral  Bronsted  bases 
(see  Table  II).  At  the  same  time,  it  can  be  a  useful  tool  for  investigation  of  proton 
affinities,  when  the  am  1  method  is  not  usable,  e.g.,  for  hypervalent  compounds  of 
second-row  elements. 
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Abstract 

AM  1  and  MNCXD  semiempincal  electronic  calculations  have  been  carried  out  for  ZnO  surface  (( ZnO)  n . 
«  =  1 1 ,  1 6,  22,  24  33. 42,  and  44 )  cluster  models.  The  theoretical  results  can  be  summarized  as  follows 
( I )  the  energy  gap  HOMO-LUMO  is  shown  to  be  dependent  on  the  cluster  size  and  geometry  optimization, 
however,  better  agreement  of  this  gap  with  expenmental  data  is  obtained  with  cluster  of  limited  size 
(n  =  16,  22,  and  24).  (n)  The  Mulliken  charge  of  the  cluster  zinc  ion  is  invanant  with  the  optimiza¬ 
tion  procedure,  showing  an  average  value  of  0.52  a.u.  (MNDO)  and  0.62  a.u.  (  amI  ).  £  1993  John  Wiley 
&  Sons,  Inc 


Introduction 

Research  on  electronic  materials  as  photo- voltaic  cells  [1],  varistor  [2,3],  and  a 
wide  diversity  of  investigations  and  applications  related  to  defects  [4,5],  catalysis 
[6-8],  gas  sensing  [9],  chemisorption  [10],  and  semiconductivity  [11]  have  been 
extensively  reported  [12],  Furthermore,  the  investigation  of  small-particle  (ranging 
in  size  from  <10  A  diameter  to  100  A)  and  clusters  quantum  mechanics  [13],  is 
of  special  interest  to  the  fields  of  gas  chemical  sensor,  catalysis,  and  nanocrystalline 
material  [14],  Among  these  electronic  materials  ZnO  is  an  excellent  prototype 
[2,3,12]  for  the  study  of  interfacial  electronic  structure  and  transport  properties  in 
semiconductor  oxides. 

The  surfaces  of  wurtzite-type  semiconductor  ZnO  are  thus  of  considerable  interest 
for  electronic  and  catalytic  technological  applications  and.  in  particular  their  low- 
index  faces  [15]  have  been  extensively  studied.  There  are  three  natural  surfaces: 
the  Zn  polar  (0001 )  plane  where  the  Zn  ions  are  more  outwardly  positioned  than 
the  O  ions,  the  O-polar  (OOOT )  plane  where  the  O  ions  are  more  outwardly  positioned 
than  the  Zn  ions,  and  the  nonpolar  prismatic  ( lOTO)  plane  where  both  Zn  and  O 
ions  are  in  the  same  plane. 

There  is  theoretical  and  experimental  interest  in  the  study  of  the  crystal  growth 
mechanism  and  the  correlation  to  the  changes  in  the  electronic  properties.  It  is 
known  that  the  photocatalytic,  optical,  electrochemical,  and  electronic  property 
of  semiconductors  depend  on  the  size  of  the  crystal  [14],  It  is  well  known  that 
as  the  cluster  size  is  successively  increased,  the  molecular  property  reaches  semi¬ 
conductor  behavior  [14].  The  pnncioal  result  of  these  researches  are:  the  gap 
HOMO-LUMO  increases  while  the  binding  energy  decrease  as  the  size  of  the  cluster 
decreases. 
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Prev  ious  theoretical  cluster  calculations  of  the  zinc  oxide  surfaces  and  adsorption 
process  include  the  discrete  variational  (Dv)  X-n  model  [16.17],  tight-binding 
method  [18],  semiempirical  (INDO/S)  and  extended  Hiickel  methods  [19]  as  well 
as  the  self-consistent  field  X-«  scattered  wave  [20]  model. 

In  the  present  work,  am  1  and  MNDO  semiempincal  calculations  have  been  carried 
out  to  study  the  ZnO  cluster.  The  electronic  properties  as  function  of  cluster  size 
are  obtained  and  the  reliability  of  these  two  methodologies  in  describing  semicon¬ 
ductor  oxides  properties  is  analyzed. 

Method  and  Models 

The  use  of  cluster  models  has  some  advantages  for  the  chemical  analysis  of  semi¬ 
conductor  oxide  clusters  and  adsorption  processes.  Due  to  the  large  dimensions  of 
such  clusters,  a  quantum  chemical  study  requires  the  use  of  semiempirical  methods. 
In  this  work,  MNDO  (Modified  Neglect  of  Diatomic  Overlaps)  and  [21]  the  AMl 
(Austin  Model  1 )  [22]  approaches  and  its  parameterizations  [23]  have  been  em¬ 
ployed  in  the  RHF-lcaO-MO  framework. 

We  have  considered  seven  cluster  models  for  the  ZnO  ( 1 OTO )  and  ( 000 1 )  surface 
((ZnO)n.H=  1 1. 16. 22,24,33. 42.  and  44). Zincoxide  crystallizes  in  the  hexagonal 
structure  of  the  wurtzite  type  in  which  each  zinc  atom  is  at  the  center  of  a  distorted 
tetrahedron  coordinated  by  four  oxygen  neighbors  [  24  ] .  The  cleavage  planes  of 
low  index  nonpolar  (lOTO)  have  been  studied  by  theoretical  and  expenmental 
procedures  and  unlike  other  low-index  leed  studies  of  the  cleavage  planes  they  do 
not  present  a  distortion  of  the  geometry  of  the  surface  with  relation  to  the  bulk 
[25].  Therefore  the  geometry  of  the  ZnO  surface  model  is  taken  from  the  bulk 
geometry.  The  clusters  are  constructed  to  reproduce  the  crystal  growing  along  the 
caxis  [Figs.  1  (a)..  1(b).- 1  (c).  and  1(d)]  and  growing  along  the  a  axis  [Figs.  I  (e), 
1(f),  and  1(g)].  The  clusters  («  =  16,  22,  24.  and  33)  are  geometry  optimized 
with  constraints.  These  constraints  are  used  to  maintain  the  periodicity  of  the 
wurtzite  structure  of  the  cluster. 

Results  and  Discussion 


Nonoptimized  Models 

Figure  2  shows  the  results  for  amI  and  MNDO  methods  of  the  binding  energy 
against  l/n  (the  inverse  of  the  number  of  ZnO  units).  For  the  crystal  system 
(n  -►  X  )  the  value  of  extrapolated  binding  energy  is:  -3.8  eV  (mndo)  and  -3.0 
eV  (AMl ).  Therefore,  the  mndo  method  gives  a  more  stronger  bonding  than  AMl 
procedure  for  ZnO  units. 

Figure  3  presents  the  energy  gap  HOMO-LUMO  versus  the  inverse  of  the  number 
of  ZnO  units.  The  increase  of  n  decreases  the  gap  value,  which  is  in  accordance 
with  other  theoretical  calculations  [  1 2. 1 3  ] .  It  is  also  known  by  experimental  studies 
of  semiconductor  clusters  [14]  that  the  small  clusters  have  greater  gap  value.  They 
also  show  that  the  electronic  property  differs  from  the  bulk  properties  and  as  the 
size  increases  there  is  a  transition  from  molecular  clusters  to  semiconductors.  The 
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Figure  I  ZnO  surface  cluster  models:  (a)  ( ZnO )u :  ( b )  ( ZnO )j2 :  ( c )  ( ZnO Ijj;  ( d )  ( ZnO )« ; 

(e)  (ZnO),*;  (f) (ZnO)j«;  and  (g)  (ZnO)«. 


646 


MARTINS,  LONGO,  AND  ANDRES 


Figure  2.  Binding  energy  against  inverse  of  the  number  of  ZnO  units. 


extrapolation  of  the  gap  value  gives  as  results:  3.22  eV  (  am  1 )  and  2.42  eV  (  mndo)  . 
The  AM  1  gap  values  are  in  accordance  with  the  experimental  value  of  the  crystal 
bulk  ( 3.3  eV )  [  1 1  b  ] .  The  better  results  of  the  gap  value  are  obtained  for  the  clusters 
of  (ZnO)„  with  n  =  16,  22,  and  24  [Fig.  3(a)  and  3(b)].  The  gap  for  ZnO  cluster 
from  the  literature  calculations  has  been  reported  in  Table  I.  A  comparison  and 
analysis  of  the  different  results  shows  that  our  clusters  are  capable  of  reproducing 
the  experimental  energy  gap. 

The  behavior  of  the  HOMO  [Fig.  4(a)  and  4(b)]  with  the  increase  of  the  cluster 
size  is  in  accordance  to  other  theoretical  calculations  [13],  which  shows  that  the 
energy  of  the  homo  decreases  with  the  increase  of  the  cluster  size.  The  extrapolations 
of  the  homo  value  results  are:  6.14  eV  (mndo)  and  6.61  eV  (amI  ). 

It  is  well  known  that  charge  values  obtained  at  hf  level  produce  unphysical 
results.  However  in  the  present  work,  it  is  used  in  order  to  compare  previous  theo¬ 
retical  data.  The  Mulliken  charge  of  the  zinc  ion  in  the  cluster  has  an  average  value 
of  0.52  a.u.  (mndo)  and  0.62  a.u.  (amI  ).  The  results  of  zinc  charge  were  extrap¬ 
olated  to  obtain  n  -*■  co  (Fig.  5)  and  the  calculated  values  are:  0.60  a.u.  (amI  ) 
and  0.50  a.u.  (MNDO).  These  results  are  in  agreement  with  the  ab  initio  scf-ci 
calculation  for  the  diatomic  ZnO  (0.68  a.u.)  [26].  However,  INDO/S  calculation 
for  cluster  of  (ZnO)i3  produces  a  smaller  value  (0.79  a.u.)  [19] . 

The  results  of  binding  energy  and  homo-lumo  energy  gap  values  show  that 
there  is  a  correlation  between  the  model  with  the  growing  along  the  a  axis  [Fig. 
6(a)]  and  between  the  models  with  growing  along  the  c  axis  [Fig.  6(b)].  These 
sets  of  models  show  a  different  behavior  of  the  binding  energy  and  gap  variation 
with  the  increase  of  the  model  size  for  both  methods  employed. 
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0.0  0.1  0.2  0.3  0.4 
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Figures,  homo-lumo  energy  gap  (eV)  vs.  l/n.(a)AMl  results,  (o)mndo  results.  The 
continuous  curve  is  obtained  by  extrapolation  to  an  exponential  function. 


Optimized  ZnO  Models 

In  Table  II  mndo  and  am  1  values  for  the  geometric  optimization  of  the  clusters, 
n  =  16,  22,  24,  and  33  are  listed.  The  search  of  the  optimum  geometry  leads  to  a 
minimum  and  a  maximum  deviation  of  3.3  to  3.5%  and  3.5  to  3.6%  for  mndo 
and  AM  1  methods,  respectively.  These  geometric  parameters  do  not  diverge  sig- 
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Table  I.  Theoretical  and  expenmental  results  from  the  literature  for  the  homo-lumo  gap  of 

different  ZnO  clusters. 
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1  /  n 

Figure  5.  The  Mulliken  charge  of  zinc  ion  in  the  cluster  in  me  ( mili-electron ). 


nificantly  from  the  experimental  bulk  values  [24].  This  deviation  is  explained  in 
terms  of  the  experimental  results  cf  small  clusters  [14].  From  these  studies  it  is 
known  that  the  small  clusters  do  not  resemble  completely  the  properties  of  the 
crystal  bulk. 

For  AM  1  and  MNDO  optimum  geometry  of  such  models  the  gap  shows  a  better 
value  when  the  optimization  is  done.  In  addition,  the  gap  decreases  smoothly  with 
the  increase  of  the  size  of  the  cluster,  which  is  a  direct  consequence  of  the  opti¬ 
mization  procedure.  The  average  of  the  gap  value  of  ZnO  optimum  geometry  gives 
the  results:  3.19  eV  (mndo)  and  3.68  eV  (amI  ).  These  gap  values  are  better  than 
the  values  obtained  without  optimization  (Fig.  3).  These  changes  of  the  gap  value 
with  the  size  of  the  cluster  is  in  accordance  with  the  theoretical  [1 3]  and  experimental 
studies  [14]. 

The  charge  of  zinc  ion  in  the  cluster  shows  to  be  invariant  with  the  optimization 
procedure,  which  is  an  expected  behavior  because  the  coordination  of  zinc  is  not 
modified  by  the  optimization  procedure.  The  extrapolated  values  for  the  charge 
(Fig.  7)  are:  0.51  a.u.  (mndo)  and  0.64  a.u.  (amI  ). 

In  spite  of  the  fact  that  the  experimental  value  of  the  heat  of  formation  for  the 
zinc  oxide  molecule  in  a  gas  phase  at  298.15  K  is  not  reported,  this  value  for  the 
ZnO  molecule  is  calculated,  and  the  extrapolated  value  for  the  nonoptimized  clusters 
[Fig.  8(a)]  is:  4.74  kcal/mol  (mndo)  and  37.02  kcal/mol  (amI  )<  The  results  for 
the  optimized  clusters  [Fig.  8(b)]  are:  3.3®  kcal/mol  (mndo)  and  28  85  kcal/mol 
( AM  1 ).  As  could  be  noted  the  mndo  value  is  less  than  the  am  1  value  for  optimized 
and  nonoptimized  clusters.  It  is  well  known  from  the  zinc  parameterization  that 
the  error  for  zinc  dication  is  large  [23]:  83  kcal/mol  (MNDO)  and  64  kcal/mol 
(AMl ).  The  Dewar  and  Merz  [23]  AMI  optimized  parameters  of  the  two-electron 
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Figure  6  Binding  energy  and  homo-LUMO  gap  value  for:  (a)  n  =  16,  24. 42;  and 
(b)n=  11,22.  33,  and  44. 


integrals  are  taken  from  the  mndo  and  maintained  frozen  which  suggests  the  reason 
for  the  results  of  amI  and  MNDO  for  zinc  dication  is  closer.  Although  the  AMl 
results  from  the  literature  give  quite  good  values  for  heat  of  formation,  the  MNDO 
is  known  to  perform  poorly  for  diatomic  molecules.  This  difference  from  AMl  and 
MNDO  appears  on  our  cluster  models  results. 
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Table  II.  Geometrical  parameters  obtained  for  the  optimized  clusters. 


(ZnO)>i 

n  = 

16 

n  = 

22 

n  = 

24 

n  = 

33 

Exp.* 

MNDO 
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AMI 
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AMl 
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2.006 

1.929 

2.007 

1.973 

r2(A) 

(1.992) 

2,300 

2.158 

2.274 

2.168 
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2.275 

2.23 

1.992 

a  (A) 

3  125 

3  272 

3,183 

3.296 

3.195 

3.317 

3.193 

3317 

3.250 

c(A) 

5.352 

5.783 

5,5.37 

5  752 

5.540 

5.580 

5  742 

5.687 

5.207 

alpha 

111.6 

111  1 

111.5 

111.5 

111.3 

II  1.2 

II  1.3 

II  1.3 

110.8 

beta 

122  7 

107.1 

108.0 

107  7 

107.5 

107.7 

106.3 

107.3 

108.1 

*  rl  and  r2  are  the  two  distances  Zn-O.  a  and  c  are  the  cell  parameters,  alpha  and  beta  are  the  angles 
(degrees)  of  ZnO  crystal.  Value  in  parenthesis  is  not  fully  optimized.  Experimental  values  are  from 
Ref.  [24). 


Conclusions 

The  theoretical  description  of  electronic  and  geometric  properties  of  semicon¬ 
ductor  oxides  by  using  different  cluster  models  with  AMl  and  .mndo  methods  is 
presented  as  an  auxiliary  tool  for  solid  state  studies.  The  geometry  optimization 
showed  how  to  improve  the  description  of  the  electronic  property  of  the  zinc  oxide. 

MNDO  method  presents  the  greater  binding  energy  between  ZnO  units.  However, 
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Figure  7.  Zn  ion  charge  in  ihe  cluster  for  the  optimized  models. 
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•  MNtXD 


•  AMI 

•  UHX) 


1 1n 

Figure  8.  Heat  of  formation  for  the  ZnO  molecule:  (a)  optimized  and  (b)  nonoptimized 

clusters. 


for  the  gap  value  of  nonoptimized  models  the  am  1  method  shows  to  be  in  accordance 
with  the  experimental  value  of  the  crystal  bulk.  The  amI  Mulliken  charge  of  zinc 
ion  in  the  cluster  shows  to  be  in  agreement  with  others  theoretical  calculations. 
The  optimization  process  for  cluster  models  is  necessary  in  order  to  obtain;  (i) 
reability  geometries,  (ii)  gap  values,  and  (iii)  Mulliken  charges. 
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Abstract 

Hopping  conductivity  becomes  activationless  ( metallic )  when  twice  the  intersite  coupling  supercedes 
the  reorganization  energy.  Consistent  with  this  idea,  activationless  electron  pair  transfer  may  be  considered 
as  a  condition  for  the  appearance  of  superconductivity  For  electron  pairs  the  condition  is  slightly  more 
complicated  than  for  single  electrons.  The  activation  energy  depends  on  the  relative  energies  between 
ionic  states  and  a  valence  bond  stai.’  The  latter  serves  as  an  intermediate  state,  contributing  to  the 
disanpearance  of  the  activation  barner.  In  AjCj,.,  that  pamng  occurs  because  the  '  Ag  state  of  the  negative 
ions  Cm  and  CJo  is  stabilized  b^  coi  relation  effects  and  is  the  ground  state.  The  correlation  effects  are 
primarily  due  to  the  closeness  in  energy  a  td  different  panty  of  the  and  orbitals,  which  are  partly 
occupied  in  the  negative  ions,  but  empty  in  neutral  Cm  The  small  interelectronic  lepulsion  and  the 
polarization  energy  fron.  ’  le  environ  nent  also  contnbute  to  the  stability  of  the  evenly  negative  ions 
©  1993  John  Wiley  &  Sons.  Inc 


Introduction 

Semiconductors  are  in  traditional  models  considered  to  have  a  finite  band  gap 
which  may  be  determined  by  observing  the  temperature  dependence  of  the  con¬ 
ductivity.  This  is  provided  that  other  activation  processes  can  be  ignored,  which  is 
unfortunately  seldom  the  case.  Trapping  of  the  carriers  occurs  and  contributes  to 
parts  or  all  of  the  activatioi.  ene.gy,  as  has  been  discussed  in  classical  papers  [1- 
4],  Alternatives  to  the  band  model  are  cluster  models,  large  enough  to  represent 
the  bulk  in  the  sense  that  the  activation  energy  for  conductivity  due  to  structure 
relaxation  can  be  calculated.  The  working  hypothesis  in  such  a  model  is  that  the 
activation  energy  has  the  same  origin  as  in  “chemical”  electron  transfer  [5,6].  In 
tiA'  Marcus  model  [7],  essentially  equivalent  to  the  models  of  Refs.  [1-4],  the 
condition  for  activationless  transfer  is  easily  derived  and  is  simply  that  twice  the 
intersite  coupling  matrix  element  supercedes  the  reorganization  energy  [  5  ] .  This 
is  then  a  condition  for  metallic  conducti  'ity  as  well.  As  we  will  see  later,  the  cluster 
model  has  the  advantage  that  a  valence  bond  state,  without  charge  fluctuations 
between  sites,  is  easily  distint  !■  hed  from  ionic  states.  The  former  leads  to  a  magnetic 
coupling  between  electron  sp  s  on  adjacent  sites  whereas  the  latter  correspond  to 
doubly  charged  polarons. 
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In  many  cases,  for  example  the  well-known  Mott  insulators  MnO,  FeO,  CoO, 
NiO,  and  CuO,  there  is  no  energy  gap  separating  the  valence  band  from  the  con¬ 
duction  band  [  8  ]  at  least  not  in  the  traditional  band  model.  Li20  doping  introduces 
Ni^^  sites  among  the  Ni‘^  sites.  The  band  model  cannot  be  used  any  longer  since 
the  translational  symmetry  is  lost.  The  conductivity  is  activated  and  very  likely  due 
to  reorganization  just  as  is  the  case  of  mixed  valance  systems,  for  example,  KMn04- 
K2Mn04.  In  the  latter  system,  the  same  activation  energy  is  obtained  for  electron 
transfer  in  solution  as  for  conductivity  in  the  solid  phase  [9,10]. 

Similarly,  one  may  regard  superconductivity  as  barrierless  electron  pair  transfer 
[11,12].  Electron  pair  transfer  has  been  extensively  studied  in  inorganic  chemistry 
and  follows  the  same  rules  as  transfer  of  single  electrons  [  1 3  ] .  An  important  problem 
to  solve  in  that  connection  is  why  electrons  sometimes  prefer  to  be  paired  on  a 
single  site,  "i  he  most  outstanding  examples  are  the  “negative  I' compounds.”  The 
latter  term  refers  to  the  energy  U  needed  for  the  disproportionation  reaction: 

Me"^'  +  Me"^'  Me”  +  Me"^- . 

In  chemistry  such  compounds  are  well  known  and  well  understood  [13-16].  Ex¬ 
amples  are  As^VAs5\  Bi5VBi^^  Sn*VSn''%  Pb-VPb''^  and  T1VT1^\  In  all 
these  cases  the  sites  are  different  in  the  occupation  of  a  6s  ^  electron  pair.  Since  the 
intermediate  valence  state  is  unstable,  electron  transfer  between  the  ions  must  be 
a  transfer  of  electron  pairs. 

In  the  present  paper,  the  conditions  for  pair  formation  in  the  negative  ions  of 
Cm  will  be  studied.  Several  alkali  compounds  of  Cbo  of  the  type  KaCso  have  been 
found  superconducting  below  18-30  K  [17-20].  More  recently  the  compound 
CasCeo  has  been  found  to  be  superconducting  below  8.4  K  [21]. 

In  the  case  of  A3C(K)  the  above-mentioned  disproportionation  (charge  fluctuation) 
nechanism  may  be  written  as 

2  Clo  “*■  Cm  +  Cm 

and  has  been  taken  as  the  basis  for  an  explanation  of  superconductivity  in  a  number 
of  papers  [11.  22-25  ] .  It  has  been  shown  that  disproportionation  and  formation 
of  singlet  ground  states  is  energetically  favorable.  The  pairing  mechanism  is  the 
same  in  Refs.  [11]  and  [25].  This  stabilization  will  be  discussed  here  for  0*0 . 

In  a  subsequent  step  we  make  plausible  that  the  activation  energy  for  transfer  of 
an  electron  pair  is  zero.  As  we  shall  see,  an  important  reason  is  that  the  valence 
bond  state,  corresponding  to  two  ions,  is  close  in  energy  and  interacts  with  the 
corresponding  ionic  states  [11], 

One-Electron  Transfer 

Two  adjacent  sites  A  and  B  with  ’dentica!  structure  are  assumed  to  have  orbitals 
<Pa  and  v’h-  respectively,  which  can  exchange  electrons.  The  eigenvalue  problem  for 
the  one  electron-two  site  problem  may  be  written  as 

■  //aa  -  E  Wab  WCOS  |7\  ^  /COS 

//ha  //hh  -  «/Uin  I?/  \sint?j' 


(1) 
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where 


tau  2»7  = 


2//ab 

//aa  ^bb 


(2) 


(i7  =  7r/4  for  //aa  =  //bb)-  If  ffab  is  large  compared  to  the  thermal  fluctuations  in 
l//aa  “  //bblj  is  close  to  IT /4,  which  means  that  the  coefficients  cos  d  and  sin  d 
have  about  the  same  magnitude  and  that  the  localization  at  either  site  is  weak.  If 
//ab  is  small,  localization  occurs  already  for  a  small  |//aa  -  //bbl  • 

In  the  case  of  the  Mott  insulators  the  coupling  //ab  is  expected  to  increase  in  the 
series  MnO  -*■  CuO,  the  reason  being  that  the  3d  orbital  energy  of  the  unfilled 
metal  shell  decreases  relative  the  O  2p  energy  [  26  ] . 

//aa  and  //bb  are  energies  in  the  absence  of  interactions  between  sites,  which  in 
the  harmonic  approximation  are  written  as  a  function  of  distortions  from  an  equi¬ 
librium  geometry  [5,15,16]: 

//aa  =  2  +  -V,o)%  //bb  =  2  -  .V,o)'  ■  (3) 


The  vertical  excitation  energy  X  is  the  reorganization  energy  needed  to  move  the 
electron  without  bond  or  solvent  readjustments,  i.e.,  keeping  x,  constant.  X  consists 
of  bond  length  changes  when  electrons  are  added  to  or  removed  from  a  site.  In 
general  for  large  n  systems  the  bond  length  change  is  small  since  the  molecular 
orbitals  are  distributed  on  many  atoms.  Each  bond  order  change  is  then  small  and 
hence  the  corresponding  bond  length  change.  In  naphthalene  the  reorganization 
energy  for  addition  of  an  electron  has  been  determined  to  0.2  eV,  but  for  Cbo  or 
any  of  its  ions  it  is  likely  to  be  smaller.  The  main  contribution  to  X  is  then  from 
the  change  of  equilibrium  geometry  of  the  alkali  ions. 

If  we  consider  only  a  single  coordinate,  Eq.  (3)  may  be  written 

//aa  =  -  QoV  +  +  <7o)^  and 

//bb  =  +  H'(92  -  ■ 

The  minimum  energy  path  satisfies  q  =  q\  =  -qi  to  a  good  approximation  and 
heiice 


U^  =  k{q-qo)‘  and  Hbb  =  k{q  +  qof. 
Energy  minima  are  obtained  for 


2  \l/2 


(5) 


(6) 


There  are  two  minima  for 


Akq'o  =  X  >  2//3b  (7) 

and  otherwise  just  one  (for  ^  =  0).  In  the  latter  case  ihe  electrons  are  transfeired 
without  activation  energy.  Equation  (7)  is  the  condition  for  activationless  transfer 
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in  the  cluster  model  and  hence  directly  related  to  the  metal-semiconductor  tran¬ 
sition. 

From  the  equations  above  it  follows  that  the  larger  the  coupling,  the  smaller  the 
activation  energy.  In  the  series  MnO  -►  CuO,  the  coupling  increases,  as  mentioned 
above,  and  hence  the  activation  energy  decreases  towards  the  end  of  the  series,  in 
agreement  with  the  experiments  [27  ] .  In  binuclear  complexes  there  is  a  well-known 
example  when  the  activation  energy  disappears  for  intermetal  et,  namely,  the 
Creutz-Taube  complex  for  which  all  experimental  information  supports  a  delo¬ 
calized  picture  [28-30]: 

For  A3C60  we  may  use  the  bandwidth  calculation  of  Ref.  [31]  where  a  bandgap 
of  about  0.5  eV  is  calculated  for  the  tu,  orbital.  The  bandwidth  for  a  linear  chain 
of  molecules  in  the  tight  binding  approximation  is  4//ab  and  thus  //ab  =  0.12  eV. 
We  have  assumed  that  each  single  /|„  interacts  only  along  one  direction.  The  re¬ 
organization  energy  due  to  bond  changes  in  each  Cbo  ion  is  small,  as  we  have  seen. 
There  may  be  a  reorganization  energy  connected  to  the  motion  of  the  alkali  atoms. 
This  energy  cannot  be  accurately  calculated  at  the  moment,  but  it  must  be  larger 
for  small  alkali  ions  than  for  larger  ones.  A  smaller  alkali  atom  has  a  larger  space 
to  move  in,  and  the  reorganization  energy  is  then  larger.  We  thus  predict  that  the 
tendency  for  metallic  conductivity  above  Tc  is  greater  the  larger  the  alkali  atom. 
This  is  consistent  with  experimental  results  for  doped  AjCw)  above  [32-34]. 

Pair  Formation  and  Disproportionation  in  A3C60 

The  diamagnetic  properties  of  AjC^  can  be  explained  if  there  is  a  dispropor¬ 
tionation  of  the  Ceo  ions  into  and  ions  and  the  latter  ions,  with  the  electronic 

configurations  and  respectively,  have  singlet  ground  states.  The  energies  of 
the  negative  ions  of  may  be  estimated  using  Koopmans’  theorem  in  a  Hartree- 
Fock  calculation,  using  the  semiempirical  parameters  of  the  cndo/s  method  [  35  ] . 
Such  a  calculation  gives  a  good  agreement  with  the  experimental  spectrum  of  Cbo 
[36  ] .  In  the  CNDO/S  approximation  the  value  adapted  for  the  one-center  repulsion 
integral  k  U  =  I  -  A,  which  is  consistent  with  its  definition  in  local  exchange 
models  [37].  Two-center  integrals  are  estimated  using  the  Mataga-Nishimoto  ap¬ 
proximation.  For  Cfto  we  obtained,  using  the  CNDO/S  approximation,  the  ionization 
energy  /(Cm)  =  7.55  eV  and  electron  affinity  /l(C6o)  =  2.36  eV  [35].  The  exper¬ 
imental  values  are  7.6  and  2.65  eV,  respectively  [38-40] .  We  are  thus  only  slightly 
overestimating  I  -  A  for  C(,o.  For  the  negative  ions  we  have 

/(Cj!^)  =  ^(C6o)=  2.36  eV, 

Hdo)  =  A{C}^)  =  ncls)  -  U  .  (8) 

A  CNDO/S  calculation,  using  the  same  orbitals  as  for  Cm,  gives  U  =  2.65  eV.  If  the 
experimental  value  is  used  for  the  electron  affinityof  Cm,  we  thus  obtain  /(Cm  )  — 
0  for  a  theoretical  multiplet  average. 

To  estimate  the  stability  of  Cm  in  gas  phase,  it  is  necessary  to  calculate  the 
multiplet  structure  forthe  configuration  and  the  correlation  energy.  The  multiplet 
structure  is  equivalent  to  the  case  of  the  *S,  'D,  and  multiplets  of  the  atomi" 
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carbon  2p-  ground  stole  configuration.  For  C6o  we  obtain  for  the  exchange  integral 
between  two  different  t|„  orbitals  as  K  =  0.078  eV.  which  is  very  small  compared 
to  the  corresponding  value  for  the  carbon  atom  {K  =  0.63  eV ),  indicating  a  much 
less  stabilization  of  the  triplet  state  in  than  in  the  C  atom. 

Electronic  correlation  is  more  important  in  the  singlet  states  than  in  the  triplet 
state.  The  most  obvious  correlation  is  due  to  the  closeness  in  energy  of  the  unoc¬ 
cupied  t[g  orbital  [35].  The  energies  after  accounting  for  the  t]u  l\g  correlation 
are 


E('A^)=  £av  +  4A:-6.0A:', 

E{'H^)  =  +  \.iK\  (9) 

Here  £a%  is  the  multiplet  average.  K  is  due  to  multiplel  splitting  and  AT  to 
correlation.  Thus  the  state  is  the  ground  state  in  this  simple  calculation.  Other 
correlation  effects  lead  to  a  further  stabilization  of  the  state.  As  a  comparison, 
A"  for  the  corresponding  correlation  for  the  C  atom  (2p^  -►  3d^)  is  relatively  small 
since  the  orbitals  involved  belong  to  shells  with  different  principal  quantum  numbei. 
Thus  the  triplet  state  remains  the  ground  state  for  the  C  atom.  Low  spin  ground 
states  seem  to  be  favored  for  all  the  negative  ions  of  due  to  the  many  possibilities 
of  correlating  two  electrons  with  different  spins.  The  results  of  this  section  are 
consistent  with  the  observation  of  in  gas  phase  [  4 1  ]  and  with  other  theoretical 
results  (42).  Of  course,  methods  which  only  include  multiplet  structure  predict  a 
triplet  ground  state.  Some  ESR  experiments  seem  to  indicate  a  low  spin  ground 
state  for  Cm  [43]  whereas  others  contradict  this  observation  [44]. 

For  CasCw)  the  /|„  level  is  completely  filled  and  the  t2g  level  occupied  by  four 
electrons.  It  is  important  that  there  is  correlation  stabilization  also  in  this  case.  For 
Cm  the  important  configuration  is  provided  by  the  t]g  -*■  t\„  replacement.  In  our 
calculations  we  find,  however,  that  the  'A^  state  is  not  sufficiently  stabilized  to 
become  the  ground  state.  The  off-diagonal  interelectronic  matrix  elements  are  about 
the  same  but  the  energy  splitting  between  /|^  and  i2u  is  larger  than  between  /lu  and 
tig.  This  calculation  is  based  on  the  orbitals  of  Cm  •  There  is  a  large  increase  of  the 
tiu  energy  compared  to  the  hg  energy  between  Cm  and  Cm  due  to  occupation  of 
the  tiu  orbital.  Our  calculations  are  incomplete  in  the  sense  that  we  have  not  op¬ 
timized  the  geometry  of  Cm  which  may  be  expected  to  lower  the  t2„  energy  compared 
toii^.  Furthermore,  a  number  of  important  correlation  effects  are  still  omitted  from 
our  calculations. 

Configurations  of  the  type  <pl  -*■  <pI  or  tpl  -►  (pi,  with  <pg  and  (pg  localization  to 
the  same  region  of  space,  are  important  for  pairing  since  it  gives  the  electrons  a 
possibility  to  avoid  each  other.  A  simple  analysis  shows  that  the  two  electrons  with 
different  spins  tend  to  be  on  different  sides  of  the  Cm  molecule.  The  spatial  part  of 
the  two-electron  wave  function  may  be  written  as 


'*'(ri,r2)  =  [C(rtP„{r,)<p„(r2)  -  Ci(pg{T,)<pg{r2)] , 


(10) 


660 


LARSSON  AND  RODRIGUEZ-MONGE 


where  Co  and  Ci  are  positive  numbers  and  and  ipg  antisymetric  and  symmetric 
orbitals,  respectively.  From  Eq.  (10)  follows  that 

l^(ri,-ri)|  - ’^'(ri.r,)  =  2C,^5^(r,)ip^(r,)  >  0  .  (11) 

In  our  calculation  Ci  is  as  large  as  0.5,  which  means  a  case  of  strong  Coulomb 
correlation.  Fermi  correlation,  which  is  in  operation  in  the  triplet  states,  permits 
one  electron  to  be  at  the  equator  while  the  other  one  with  the  same  spin  is  at  one 
of  the  poles.  In  the  C  atom  such  a  singlet  pairing  cannot  occur  due  to  the  absence 
of  2d  orbitals,  and  therefore  Hund’s  rule  is  valid. 


Valence  Bond  and  Ionic  States 


Let  us  consider  two  C^o  molecules  A  and  B  along  the  x-axis  both  with  filled 
^i«(>’)  orbitals.  There  are  two  additional  electrons  which  are  distributed  on  the  two 
tiu(x)  orbitals  and  <p\,  on  the  sites  A  and  B,  respectively.  The  latter  two  electrons 
form  four  states:  a  triplet  state,  a  singlet  valence  bond  (vb)  state  ^vb,  and  two 
singlet  ionic  states  and  ^b^ 

^vb(l,2)=  [1/2V(1  +5^)](^a¥>b  +  ¥’b^a)(«l3-/3«),  (12) 

^a(  1,2)  =  ^a^a( a/3  “  /5a )/  V2  and  ^b  =  Wb( a^  -  ^a )/  V2  . 

S^a  and  ^'b  correspond  to  the  states  after  disproportionation.  The  energy  for  dis¬ 
proportionation  in  gas  phase  is 


A£=<^a|Hl'Fa>-<’I'vb|/f|^vb>=  C/-./ab>0.  (13) 

7ab  -  [1.89£/27.21  +  =  0.93  eV  ( Mataga-Nishimoto  approximation). 

7ab  thus  plays  an  important  role  in  the  decrease  of  the  effective  U  compared  to  the 
formal  on-site  repulsion. 

In  condensed  phase  there  is  a  stabilization  of  the  ionic  state  due  to  the  polarization 
of  the  medium  which  further  contributes  to  the  decrease  of  the  “effective  U."  We 
may  use  the  Born  expression 


P(r)  = 


2R 


(14) 


where  R  is  the  radius  of  a  sphere  around  the  Ceo  centers  and  z  the  charge.  In  Eq. 
( 3 )  c  is  the  dielectric  constant  of  the  medium  at  infinite  frequency  ( inductive  po¬ 
larization;  e  =  n^)  or  at  zero  frequency  (atomic  polarization ) .  The  energy  difference 
between  the  the  ionic  state  and  the  vb  state  ^“effective  U")  is  now 


A£=  t/- yab  +  2P(«)-F(«-  !)-£(«+  1)=  t/- yab-2P(l).  (15) 

Using  c  =  2.5  and  /?  =  5  A,  we  obtain,  for  exa  nple,  2P(  1 )  equal  to  1.7  eV;  using 
c  =  10,  we  obtain  2F(  1 )  equal  to  2.6  eV.  Consequently,  Af  by  Eq.  (4)  is  slightly 
negative  even  for  inductive  polarization.  These  results  are  consistent  with  a  cal¬ 
culation  of  the  Madelung  energies  [25]. 

Assuming  orthogonality  between  the  orbitals  on  sites  A  and  B,  the  Hamiltonian 
matrix  for  the  symmetric  2-electron-2-site  problem  may  be  written  as  [  1 1  ] 
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1 AE  +  k(q  -I-  qo)- 

Kab 

ilHab  \ 

1 

AE  +  k(q  -  ^o)’ 

V2//ab  . 

(16) 

\  nHab 

^Hab 

kq-  +  Kab! 

Here  //ab  is  the  off-diagonal  matrix  element  for  the  orbitals  (Pa  and  ^b-  A'ab  is  the 
exchange  integral  <  ^a  1 1  /'■12 1  ^b),  which  may  be  neglected  since  the  and  ^b  are 
localized  on  different  Qo  spheres.  For  ^  =  0  the  system  is  symmetric  and  the  matrix 
of  Eq.  (16)  may  be  transformed  to  one  corresponding  to  the  basis  functions  ( 'I'a 
-  'I'b)/V2  and  (^a  +  'I'b)/V2  and  'I'vb: 

^a  -  /AE+kql  0  0  \ 

^a  +  'I'b  0  AE+kql  2//ab  .  (17) 

'I'vb  \  0  2//ab  0  / 

There  are  different  important  cases  for  the  solution  of  Eq.  ( 16),  depending  on  the 
relative  values  of  U  -  2P,  and  Aab.  If  Aab  is  neglected  and  if  V  -  J^b  is  much 

larger  than  2  P,  Sl'vb  will  be  the  ground  state.  On  the  other  hand,  if  2  Z’  is  larger  than 
U  -  Jab;  the  ionic  state  ^'a  +  will  be  the  ground  state.  If  AE  =  0,  the  latter  state 
is  lowered  2|//abl  compared  to  ^a  -  ^b-  The  coupling  determines  the  degree  to 
which  the  singlet  ion  pair  will  localize  on  A  or  B.  Thus  if  |  //ab  I  is  reasonably  large 
there  will  be  a  possibility  for  barrierless  electron  pair  transfer.  In  Cm)  \Hab\  should 
be  about  0. 1  eV,  as  discussed  above. 

Pair  Motion 

In  the  case  of  single-electron  transfer,  different  values  of  the  ratio  coupling  to 
reorganization  energy,  p  =  lllab!  K  lead  to  three  possible  regimes  of  localization, 
essentially  as  outlined  by  Robin  and  Day  [14].  In  particular,  if  p  is  close  to  unity, 
the  electrons  localize  according  to  the  instantaneous  nuclear  positions.  If  p  is  much 
smaller  than  1 ,  trapping  is  possible.  If  p  >  1 ,  delocalization  occurs  as  in  the  Creutz  - 
Taube  5+  complex  [28-30]. 

The  two-elec»ron  case  is  different  due  to  the  appearance  of  the  vb  slates  (singlet 
and  triplet).  The  situation  may  be  described  as  follows: 

Case  I.  AE  is  positive  and  large.  The  ground  state  is  a  singlet  vb  state  with  a  deep 
minimum  for  ^  =  0  [eq.  (16)].  This  is  the  ordinary  case  of  a  strong  covalent  bond, 
for  example,  the  H2  molecule.  The  singlet-triplet  splitting  is  large. 

Case  2.  \\'eakly  bonded  vb  ground  slate.  The  magnetic  coupling  is  equal  to  half 
the  singlet-  triplet  splitting  and  has  nothing  to  do  with  the  coupling  Hab  in  Eq.  ( 16). 

Case  3.  “Negative  U"  compounds  ( AE  <  0 ) .  The  ionic  states  ( also  called  “charge 
density  wave”  states)  are  stable  compared  to  the  vb  stale.  The  energy  minima  for 
the  ground  state  in  Eq.  ( 16)  occur  for  values  close  to  ^  =  ±^0.  Disproportionation 
takes  place,  for  example,  in  BaBiOj,  where  there  are  Bi'*  and  Bi'^  sites  but  no 
Bi'*^  sites.  There  is  a  large  activation  energy  for  transfer  of  the  electron  pair  from 
one  site  to  another  ( ^-0.2  eV  in  BaBiO^).  The  intermediate  vb  type  slate,  corre- 
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spending  to  Bi'*^  sites,  is  high  in  energy,  and  it  is  likely  that  the  top  of  the  energy 
barrier  is  at  ^  =  0.  It  is  also  possible  that  there  is  a  small  depression  for  q  =  0, 
which,  however,  is  higher  in  energy  than  in  the  ionic  minima  close  to  ^  =  ^o- 

Case  4.  The  activation  energy  for  pair  transfer  between  the  sites  tends  to  zero. 
The  direct  coupling  between  and  (^ab)  is  small,  but  there  is  an  indirect 
coupling  between  +  ’J'b  and  the  VB  state,  as  we  see  in  Eq.  ( 17),  which  leads  to 
a  large  energy  splitting  between  ^a  +  ^b  and  ~  ^b-  Thus,  if  the  VB  state  is 
sufficiently  close  in  energy  to  the  ionic  states  and  at  the  same  time  l//abl  is  large, 
the  activation  barrier  for  transfer  between  the  ionic  state  disappears. 

We  have  thus  established  in  a  simple  model  that  four  possible  cases  occur  for  a 
dimeric  two-electron  system.  Case  2  may  be  divided  into  a  singlet  and  a  triplet 
ground  state.  There  is  experimental  evidence  for  cases  1-3.  The  author  is  not  aware 
of  any  binuclear  complex  for  case  4,  although  it  can  probably  be  made  just  as  its 
delocalized  single  electron  counterpart,  the  Creutz-Taube  5+  complex.  We  may 
notice  that  very  subtle  energy  differences  lead  to  very  different  types  of  behavior. 


Conclusion 

If  the  activation  energy  is  due  to  reorganization  energy  when  an  electron  or 
electron  pair  moves  between  sites  the  conductivity  problem  may  be  treated  in  a 
cluster  model.  As  discussed  by  Hush  [29],  a  large  coupling  between  the  sites  reduces 
the  actn  ation  energy  [  5  ] .  This  coupling  has  nothing  to  do  with  the  coupling  which 
determines  the  singlet-triplet  splitting  in  a  binuclear  cluster  but  is  half  the  energy 
separation  between  two  electronic  states  of  the  cluster  of  different  parity  at  an 
avoided  crossing. 

In  the  two-electron  et  case  the  direct  coupling  between  the  diabatic  wave  functions 
corresponding  to  the  ionic  states  is  very  small  [A^ab  in  Eq.  ( 16)].  However,  since 
the  symmetric  ionic  state  interacts  with  the  vb  state,  corresponding  to  one  electron 
on  each  site,  the  pair  transfer  may  occur  activationless  in  the  cases  when  the  vb 
state  has  a  low  energy.  This  is  not  always  the  case,  and  a  good  example  is  provided 
by  BaBiOj.The  existence  of  a  pairing  cannot  by  itself  guarantee  superconductivity, 
and  the  reason  is  simple;  If  the  vb  state  has  a  too  high  energy,  the  electron  pair 
cannot  move  from  one  site  to  another  without  activation.  The  system  is  then  a 
semiconductor  or  an  insulator.  The  existence  of  a  large  band  width  improves  the 
situation,  since  the  larger  |//ab|,  the  larger  is  the  interaction  with  the  vb  state. 

The  situation  is  similar  to  the  one  of  the  Mott  insulators.  If  doped,  the  latter  may 
be  regarded  as  a  mixed  valence  compound  where  the  sites  exchange  electrons  ac¬ 
cording  to  (in  Li20-doped  NiO): 

Ni^-^  +  Ni-'*^  -►  Ni’"  +  Ni’"  , 

The  electron  exchange  follows  the  rules  of  the  second  section.  If  the  coupling  2 1  //^  I 
is  not  large  enough  to  offset  the  reorganization  energy,  according  to  Eq,  (7)  the 
system  will  be  only  nonmetallic,  no  matter  what  the  magnitude  of  the  band  gap  is. 
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Alkali  compounds  of  the  type  A3C60  form  interesting  cases  for  electron  pair  trans¬ 
fer,  particularly  since  the  participating  negative  ions  and  their  interactions  are  well 
defined.  The  energy  surfaces  are  mapped  out  by  the  alkali  ions  moving  in  their 
cavity  sites  and  to  a  smaller  extent  by  the  carbon  atoms  of  Qo  •  The  activation 
barrier  for  one  or  two  electron  transfer  depends  quadratically  on  the  bond-length 
change.  In  A3C60  the  metal  ions  have  different  equilibrium  positions  depending  on 
the  charges  of  the  surrounding  negative  C50  ions  [32].  A  larger  alkali  ion  has  a 
smaller  distance  change  between  the  possible  equilibria  and  hence  more  shallow 
energy  minima  than  a  smaller  one.  Thus  activationless  electron  pair  transfer  is 
more  likely  for  a  larger  ion  than  for  a  smaller  one,  and  this  is  consistent  with 
experimental  data  [19,32,45]. 

We  have  thus  shown  in  a  bicenter  cluster  model  that  the  condition  for  activa¬ 
tionless  pair  transfer  may  in  fact  be  satisfied  in  alkali  and  alkaline  earth  doped  Qo- 
The  extension  to  an  infinite  system  has  not  been  considered,  but,  since  the  band 
width  in  the  systems  studied  is  generally  small,  we  expect  the  cluster  mode!  to  give 
accurate  predictions.  The  natural  way  for  an  extension  to  the  infinite  case  appears 
to  be  to  apply  the  theory  of  antisymmetrized  products  of  geminals  (apg)  [46,47]. 
The  essential  part  of  the  pairing  concept  of  the  Bardeen-Cooper-Schrieffer  (bcs) 
theory  remains  in  the  theory  used  here  with  the  electronic  wave  functions  expressed 
in  configuration  space.  The  lattice  coupling  is  to  modes  which  promote  transfer  of 
an  electron  pair  between  sites. 

The  appearance  of  superconductivity  is  strongly  dependent  on  the  number  of 
valence  electrons  introduced  by  doping  and  the  valence  state  of  the  metal  ion.  A 
condition  for  appearance  of  superconductivity  is  obviously  that  sites  be  available 
so  that  the  electron  pairs  can  move.  Qo  or  K^Qo  are  therefore  unable  to  support 
conductivity  or  superconductivity.  The  reason  is  that  the  hu  and  ri„  levels,  respec¬ 
tively,  are  completely  filled.  It  is  thus  important  that  the  highest  molecular  orbitals 
(MO)  is  close  to  half-filled  on  the  average.  In  the  copper  oxide  superconductors, 
the  3d*-3d configurations  seem  to  give  the  superconducting  state  (whereas  3d’- 
3d’  give  an  antiferromagnetic  state)  and,  in  BaBi03,  the  6s°~6s^  configuration 
[  1 2  ] .  In  the  latter  case,  there  is  a  stabilization  of  the  6s^  configuration  due  interaction 
with  the  6s^  6p^  configuration  (pairing),  which  permits  the  two  electrons  with 

different  spins  to  correlate.  A  parity  doublet  of  the  type  that  exists  in 
tig)  or  BaBiOs  (6s-6p)  gives  a  nice  pairing  mechanism,  but  apparently,  by  comparing 
to  the  Cu  oxide  case,  a  parity  doublet  is  not  a  necessary  condition  for  pairing. 

Finally  we  may  discuss  some  other  suggestions  to  explain  superconductivity  in 
Qc.  Chakravarty  et  al.  [  22  ]  and  Friedberg  et  al.  [  25  ]  have  discussed  pairing  of  the 
same  type  as  discussed  here  and  in  Ref.  [  1 1  ] .  In  particular,  the  latter  work  describes 
in  a  nice  way  the  pairing  in  C^o.  which  indeed  is  of  exactly  the  same  type  as 
discussed  here  and  in  Ref.  [11].  Reference  [  25  ]  discusses  in  great  detail  the  Madelung 
energy  as  well  as  some  other  important  aspects  of  the  superconductivity  problem. 

The  vibronic  coupling  for  the  two-electron  case  has  been  discussed  in  a  detailed 
way  in  Refs.  [15]  and  [16]  for  the  two-site  case  and  in  Ref.  [48]  in  the  four-site 
case.  In  the  present  paper,  we  have  used  a  simplified  approach  where  the  nuclear 
motion  is  treated  classically. 
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We  have  shown  that  there  is  a  strong  internal  correlation  effect  in  the  negative 
ions  but  not  in  €50  itself,  and  we  believe  that  the  magnitude  of  correlation  in  C^o 
is  quite  unimportant.  Schluter  et  al.  studied  the  Jahn-Teller  distortions  in  the  Hg 
state  [49].  The  conclusion  that  -43Q0  is  not  a  “negative  U"  compound  is  disagree¬ 
ment  with  the  results  obtained  here  and  in  Refs.  [1 1,22,25].  Baskaran  and  Tosatti, 
on  the  other  hand,  have  considered  the  possibility  of  a  “negative  C/”  state  and  also 
stressed  the  importance  of  correlation  effects;  but  the  numerical  values  of  their 
parameters  are  different  from  ours,  and  they  have  given  no  details  regarding  the 
character  of  the  correlation  effects  [  "’3].  Zhang  et  al.  have  discussed  a  model  [24] 
which  to  some  extent  resembles  the  one  of  the  present  paper  [11,12].  They  also 
consider  the  motion  of  the  alkali  ions  to  be  important  to  induce  the  electronic 
motion. 
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Abstract 

In  this  work,  ah  milio  coupled  and  uncoupled  Hartree-Fock  polanzabihtv  calculations  are  carried  out 
on  finite  and  infinite  polyacetylene  chain  models  We  describe  tbe  polymeric'  procedures  which  are 
extensions  to  infinite  systems  of  the  molecular  Sum  Over  Slates  (sos)  and  Random  Phase  Approximation 
t  RPA )  methods.  The  latter  docs  include  the  field-induced  electron  rcorganizational  effects  whereas  the 
former  docs  not.  The  inclusion  of  these  effects  is  particularly  important  in  computing  the  longitudinal 
component  of  the  dipole  polarizability  tensor  of  poly  acetylene  chains  at  an  accurate  level  since  the  ratio 
of  the  coupled /uncoupled  results  is  between  2  and  4  for  the  conjugated  systems  studied.  In  order  to 
evaluate  the  asymptotic  longitudinal  polanzabilitics  per  unit  cell  we  have  used  both  finite  oligomeric 
calculations  followed  by  extrapolation  procedures  and  direct  polymeric  methods  The  difficulty  of  using 
the  extrapolation  procedures  due  to  the  unknown  analytical  form  of  the  evolution  of  the  longitudinal 
polanzability  per  unit  cell  illustrates  the  convenience  of  using  our  direct  method  treating  infinite  polymeric 
systems.  We  obtain  a  good  agreement  between  our  infinite  chain  results  and  the  largest  oligomeric  values 
We  also  discuss  the  choice  of  atomic  basis  set.  the  possibility  of  using  scaling  factors,  and  the  crucial  role 
of  using  a  correct  bond  length  alternation  value.  ;c  I993  John  Wiley  &  Sons,  Inc 

Introduction 

Since  the  discovery  of  very  large  nonlinear  optical  responses  associated  with 
conjugated  organic  systems,  a  lot  of  theoretical  studies  have  been  undertaken  in 
order  to  predict  the  optical  properties  and  to  understand  the  structure-property 
relationship  of  these  materials  [1],  Due  to  electron  delocalization,  the  component 
of  the  polarizability  tensor  directed  along  the  backbone  of  conjugated  systems  is 
enhanced  [2],  As  is  the  case  for  mechanical  properties,  where  the  values  obtained 
for  a  copolymer  do  not  generally  correspond  to  the  values  that  are  the  weighted 
sum  of  their  homopolymer  contributions,  in  increasingly  large  oligomers  the  lon¬ 
gitudinal  polarizability  increases  exponentially  until  a  linear  evolution  is  reached. 
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That  corresponds  to  an  increase  of  the  polarizability  per  unit  cell  until  a  saturation 
point  is  reached.  This  saturation  point  corresponds  to  the  polarizability  per  unit 
cell  of  the  infinite  periodic  system.  The  knowledge  of  the  value  at  this  saturation 
point  would  be  interesting  for  the  design  of  new  promising  materials  for  optics. 

The  slow  convergence  behavior  of  the  longitudinal  polarizability  per  unit  cell 
with  respect  to  increasing  the  chain  length  of  conjugated  systems  makes  this  a 
quantity  difficult  to  attain.  A  first  approach  to  obtain  this  asymptotic  value  is  to 
follow  the  evolution  of  the  longitudinal  polarizability  in  larger  and  larger  oligomers 
and  then  to  extrapolate  to  the  infinite  polymer  limit.  An  alternative  procedure 
consists  of  the  direct  calculation  of  the  asymptotic  value  by  performing  the  calcu¬ 
lation  on  the  polymer  system,  i.e.,  taking  advantage  of  its  translational  symmetry. 
As  Kin  man  has  said  [3];  “Finite  oligomers  are  most  effective  for  the  treatment  of 
local  features,  while  crystal  orbitals  are  best  for  delocalized  features".  Due  to  its 
slow  convergence  for  conjugated  systems,  the  longitudinal  polarizability  corresponds 
to  a  delocalized  feature  and  ideally  requires  the  use  of  a  direct  method  to  obtain 
the  asymptotic  value  per  unit  cell.  This  is  particularly  the  case  for  ab  initio 
procedures,  in  contrast  to  semi-empirical  calculations,  because  for  the  former,  the 
computational  treatment  of  very  large  oligomers  is  beyond  of  reach. 

The  aims  of  this  work  are  ( i )  to  describe  a  general  and  suitable  way  of  computing 
the  asymptotic  values  of  the  polarizabilities  per  unit  cell  of  infinite  periodic  systems, 
including  the  correct  choice  of  geometrical  parameters  and  mastering  the  atomic 
basis  set  effects,  (ii)  to  compare  the  finite  oligomer  procedure  followed  by  various 
extrapolation  techniques  with  the  direct  method  tor  computing  the  asymptotic 
polarizability  per  unit  cell  of  infinite  systems,  and  (iii)  to  provide  a  good  strategy 
to  estimate  the  polarizabilities  per  unit  cell  of  polymeric  systems  correctly.  The 
extended  tt  conjugated  systems,  polyacctylene  chains,  are  used  as  example  due  to 
their  prototypical  nature  and  the  important  optical  responses  they  exhibit. 

Our  work  is  organized  as  follow: 

•  Firstly,  we  summarize  the  previous  calculations  of  the  polarizabilities  of  in¬ 
creasingly  large  polyacetylene  chains  made  either  at  an  empirical,  semi-em- 
pirical  or  ab  initio  level.  This  summary  defines  by  itself  the  difficulties  associated 
with  the  calculation  of  the  asymptotic  polarizability  values  of  stereoregular 
conjugated  polymers  at  an  acceptable  level  of  accuracy. 

•  In  the  following  part,  the  coupled  and  uncoupled  procedures  to  compute  the 
polarizability  of  finite  and  infinite  systems  are  briefly  described.  In  particular, 
we  remind  the  reader  of  basic  equations  of  the  Random  Phase  Approximation 
( RPA )  method  applied  to  stereoregular  polymers.  The  Summations  Over  States 
(SOS)  method  is  presented  as  an  approximation  of  the  RPA  procedure. 

•  The  results  of  computations  on  polarizabilitiis  of  finite  polyacetylene  chains 
are  presented.  The  bond  length  alternation  parameter  Ar  is  demonstrated  to 
play  an  important  role  in  computing  the  lon^Hudinal  component  of  the  po¬ 
larizability  tensor.  We  compare  the  suitability  of  different  functional  form  for 
extrapolating  to  the  infinite  chain  limit.  We  discuss  also  the  choice  of  the 
atomic  basis  set  and  the  role  of  scaling  factors  in  correctly  estimating  the  po¬ 
larizabilities. 
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•  Finally,  taking  advantage  of  the  translational  symmetry  of  polyacetylene,  wc 
perform  band  structure  calculations  and  compute  directly  the  sos  and  rpa 
longitudinaf  polarizabilities  per  unit  (%11.  We  compare  these  values  with  the 
polarizabilities  obtained  from  the  finite  oligomer  calculations. 

This  work  is  in  fact  the  third  article  of  a  series  on  polarizability  calculations  of 
finite  and  infinite  polyacetylene  chains:  the  first  article  employed  the  simple  non¬ 
iterative  Hiickel  .approach  [41  by  using  Pariscr-Parr-Pople  Hamiltonians,  the 
electron-electron  Coulomb  interactions  arc  taken  explicitly  into  account  in  the 
second  work  [5].  In  this  work,  we  treat  all  electrons  in  an  ab  initio  Hariree-Fock 
procedure. 

Summary  of  the  Previou.<(  Polarizability  Calculations  of  Polyacetylene  Chains 
Hiickel 

The  first  studies,  performed  at  the  Hiickel  level  on  regular  chains  (all  bond  lengths 
are  the  same),  predict  the  Ctoluiion  of  the  longitudinal  polarizability,  a^.,  to  be 
proportional  to  [6]  or  [7]  with  N  the  number  of  carbon  atoms.  Hence, 
the  longitudinal  polarizability  per  unit  cell  of  the  infinite  chain  is  infinite.  By  con¬ 
sidering  a  nonzero  bond  length  alternation  (Ar),  a  numerical  asymptotic  stabili¬ 
zation  of  the  lor.gitudinal  polarizabilities  per  unit  cell  appears  a^.  the  chain 
grows  [4,8]. 

Semi-Empirical  and  Parametrized  Methods 

In  the  next  step,  the  electron-electron  Coulomb  interactions  arc  taken  explicitly 
into  account  firstly  by  using  Pariser-Parr-Pople  (PPP)  Hamiltonians.  This  results 
in  a  decrease  of  ««  of  one  order  of  magnitude  [  4,9-12  J.  For  the  oligomers  ranging 
from  2  to  20  carbon  atoms,  de  Melo  and  Silbey  [10]  derived  an  expression  for  the 
evolution  of  the  longitudinal  polarizability  per  carbon  atom  with  resjpcct  to  N,  the 
number  of  carbon  atoms:  a«/A^  =  4.63yv®”  which  does  not  include  any  stabilization 
of  a„/A^  as  increases.  However,  work  by  Soos  and  Hayden  [11]  took  care  of  this 
stabilization  and  by  extrapolation  of  the  curve  representing  the  inverse  of  the  lon¬ 
gitudinal  polarizability  per  site  (A^/a,,)  with  respect  to  the  inverse  of  the  number 
of  sites  ( 1  /N),  they  found  an  asymptotic  value  of  a„/A^  for  N  equal  to  infinity  to 
be  equal  to  50.0  a.u.  A  recent  work  of  Viilesuzanne  et  al.  [12]  concerned  chains  as 
long  as  200  — CH =CH  —  units  and  predicted  asymptotic  o„/A^  values  ranging 
from  43.7  a.u.  to  283.7  a.u.  per  double  bond  depending  upon  the  PPP  parametri- 
zation  used.  More  recently,  we  computed  the  asymptotic  longitudinal  polarizability 
per  unit  cell  within  PPP  approximation  using  the  rpa  procedure  applied  to  infinite 
periodic  systems  [5].  By  using  parameters  which  provide  correct  excitation  energies 
for  the  polyene  chains,  we  evaluated  cttilH  to  be  139.1 !  a.U.  per  double  bond. 

At  the  semi-empirical  Intermediate  Neglect  of  Differential  Overlap  (INOO)  level 
of  approximation,  Kirtman  [  13]  computed  the  a„  of  finite  polyene  chains  containing 
up  to  62  carbon  atoms  and  determined  after  extrapolation  an  asymptotic  value  of 
1  lO.S  a.u.  per  double  bond.  Kurtz  used  the  Finite  Field  method  [14]  with  different 
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semi-empirical  hamiltonians  (MNDO,  AMl,  and  pm3)  to  compute  the  average  po¬ 
larizabilities  of  oligomers  up  to  C40H42  [  1 5  ] .  His  extrapolated  average  polarizabilities 
per  unit  cell  are  56.28,  61.00.  and  54.15  a.u.  with  the  MNDO,  AMl,  and  PM3  par- 
ametrizations,  respectively.  In  addition,  from  Valence  Effective  Hamiltonian  (veh) 
calculations  of  the  averaged  a  value  defined  there  as  l/3(axx  +  «rr)>  neglecting 
thus  the  perpendicular  a,  ,,  component,  the  saturation  point  for  a  has  been  calculated 
by  Shuai  and  Bredas  to  be  approximately  1 16.1  a.u.  [16]. 

The  results  of  these  semi-empirical  and  parametrized  calculations  demonstrate 
the  large  sensitivity  of  the  computed  polarizabilities  to  the  parametrizations  em¬ 
ployed  and  thus  the  need  to  use  more  suitable  ab  initio  calculations  which  are, 
however,  more  difficult  to  carry  out  on  large  oligomers. 

Ab  initio  Calcidations 

The  first  ab  initio  work  on  extended  chains,  realized  by  Bodart  et  al.  [2]  with 
the  minimal  STO-3G  atomic  basis  set  focused  on  the  strong  influence  of  Ar  on  the 
longitudinal  polarizability.  By  following  the  extrapolation  procedure  of  Hurst  et 
al.  [17],  their  asymptotic  a 2:  per  double  bond  are  105  a.u.  and  170  a.u.  for  Ar 
equal  to  0.  lOA  and  0.05 A,  respectively.  Such  an  extrapolation  procedure  appeared 
to  alleviate  the  difficulty  to  perform  ab  /n/f/o  polarizability  calculations  on  sufficiently 
large  oligomers  to  reach  the  saturation  point  of  In  an  important  Coupled 
Perturbed  Hartree  Fock  (cphf)  [18]  study,  Hurst  et  al.  [17]  determined  the  po¬ 
larizabilities  of  polyene  chains  ranging  from  C5Hg  to  C22H24  using  the  minimal 
STO-3G,  extended  6-3 IG,  6-3 IG*,  and  6-3 IG  +  pd  atomic  basis  sets.  These 
authors  made  a  least  square  fit  of  the  polarizability  values  to  the  equation 
“log(a/A^)  =  a  +  b/N  +  cjN^"  so  that  the  extrapolated  value  is  given  by  10“. 
They  obtained  extrapolated  asymptotic  values  of  38.9,  60.3,  57.0,  and  60.8  for  the 
average  polarizability  and  105.9,  163.3,  154.8,  and  160.5  for  the  longitudinal  po¬ 
larizability  using  the  STO-3G,  6-3 IG,  6-3 IG*,  and  6-3 IG  +  PD  basis  sets,  respec¬ 
tively.  Their  geometrical  parameters  were  optimized  in  the  C2h  symmetry  using  the 
6-3  IG  atomic  basis  set.  The  parameters  of  the  central  unit  cell  of  C22H24  are:  dc=c 
=  1.338  A,  dc-c  =  1-450  A,  dc-u  =  1  077  A,  ac=c-c  =  124.3°,  uc.-.c-h  =  1 19.2°. 


Hartree-Fock  Methods  to  Compute  the  Coupled  and  Uncoupled  Polarizabilities 
of  Finite  and  Infinite  Chains 


Molecular  Procedures 

At  the  Hartree-Fock  level,  the  electronic  states  of  closed-shell  systems  are  Slater 
determinants  constructed  from  doubly  occupied  molecular  orbitals  represented  by 
linear  combinations  of  atomic  orbitals.  The  most  common  methods  to  compute 
the  polarizabilities  of  molecules  work  in  the  space  of  the  atomic  orbitals  [19]. 

In  the  Summation  Over  States  (sos)  approach,  based  on  the  standard  time- 
independent  perturbation  theory,  the  polarizability  tensor  is  the  second-order  term 
in  the  perturbation  expansion  of  the  energy  as  a  power  series  in  the  applied  external 
electrical  field  [  20  ] .  The  expression  for  the  different  components  of  the  polarizability 
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is  given  in  terms  of  the  unperturbed  wavefunctions  and  energies.  Commonly,  one 
uses  a  Hartree-Fock  ground  state  together  with  its  singly  excited  determinants  for 
the  excited  states.  Hence,  an  element  of  the  polarizability  tensor  reads  as  [21]: 

«Ar  =  4  2,  2,  -  (  1  ) 

,  a  Cl 

where  the  </>,  and  0a  represent  doubly  occupied  and  unoccupied  molecular  orbitals 
respectively,  e,  and  Ca  their  corresponding  orbital  energies. 

The  CPHF  [18]  procedure  is  equivalent  to  an  analytical  differentiation  of  the  field- 
dependent  Hartree-Fock  equation.  A  similar  procedure,  called  Finite  Field  (FF) 
[14],  consists  in  the  numerical  differentiation  of  the  dipole  moment  with  respect 
to  the  applied  external  electric  field.  In  this  procedure,  the  field-induced  electron 
reorganizational  effects  are  included  in  a  way  which  is  fully  consistent  in  terms  of 
adjustments  in  the  average  two-electron  interactions.  For  this  reason  it  is  described 
as  a  coupled  Hartree-Fock  procedure  whereas  the  SOS  approach  is  said  to  be  un¬ 
coupled. 


Polymer  Procedures 

In  restricted  Hartree-Fock  theory,  the  many-electron  wave  functions  of  the  closed- 
shell  one-dimensional  periodic  systems  are  .approximated  by  Slater  determinants 
which  are  constructed  from  doubly-occupied  crystalline  orbitals.  These  crystalline 
orbitals  are  single-particle  states  defined  in  the  Linear  Combination  of  Atomic 
Orbitals  (lcao)  formalism  by: 

""  7/;  =7  2  2  Cp„{k)Xp(7  -  Rp-jae,) 

V(2A+l);_x  p., 

=  r)  (2) 

where  n  and  k  are  the  band  index  and  the  wave  vector  or  quasi  momentum  associated 
with  the  particle,  respectively.  Thus,  two  indices  label  a  crystalline  orbital. 
(2A^  +  I)  is  the  (odd)  number  of  unit  cells  considered  {N  -*■  x)  in  the  band 
structure  calculation  that  corresponds  also  to  the  number  of  ^-states  in  one  band 
or  to  the  periodicity  [(2iV  -F  1  )a]  of  the  crystalline  orbitals  imposed  by  the  Bom- 
Karman  cyclic  boundary  conditions.  1  /  V2iV  -l-  1  is  then  the  normalization  factor, 
a  the  unit  cell  length,  and  u„(k,  r )  is  a  function  having  the  same  periodicity  as  the 
direct  lattice.  The  Cp„{k)  terms  are  the  A:-dependent  lcao  coefficients.  Xp{r  -  Rp 
-  jac:).  abbreviated  to  Xj,(T),  is  the  pth  atomic  orbital  centered  in  the 7th  unit 
cell.  In  practice,  these  Xj,(r)  functions  are  chosen  to  be  contractions  of  Gaussian 
functions,  e;  is  the  unit  vector  in  the  periodicity  direction.  The  polymeric  LCAO 
coefficients  Cp„{k)  and  their  associated  energies  c„(k)  are.  respectively,  the  eigen¬ 
vectors  and  eigenvalues  of  the  matrix  equation: 


F{k)C{k)  =  Sik)C{k)c(k) 


(3) 
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where  F{A:)  and  S(^■)  are  the  ^-dependent  Fock  and  overlap  matrices.  The  standard 
theory  of  band  structure  calculations  is  described  in  several  articles  [22]  and  is 
reviewed  in  two  recent  monographs  [23]. 

Recently,  we  have  shown  the  difficulty  to  evaluate  the  longitudinal  polarizabilities 
per  unit  cell  of  infinite  periodic  systems  with  the  Finite  Field  and  CPHF  procedures 
[  5  ] .  This  is  due  to  the  unbounded  character  of  the  dipole  moment  operator.  A  first 
method,  developed  by  Otto  [24],  only  considers  the  periodic  part  of  the  dipole 
moment  operator  and.  following  the  procedure  established  by  Ladik  [25].  consists 
of  deriving  the  field-dependent  crystal  orbitals  which  lead  to  the  evaluation  of  a--l 
N.  The  approach  we  follow  uses  the  RPA  procedure  to  circumvent  the  problem  of 
the  unbounded  character  of  the  dipole  moment  operator.  This  method,  based  on 
propagator  techniques  [26],  is  equally  efficient  at  computing  the  static  polarizability 
and  the  frequency-dependent  polarizabilities  and  may  be  adapted  to  include  electron 
correlation  at  the  desired  level.  Though  working  differently,  the  rpa  procedure 
gives  identical  results  to  the  Finite  Field  and  Coupled  Perturbed  Hartree-Fock 
methods.  Since  two  labels  characterize  the  polymeric  one-particle  states,  in  second 
quantization,  the  operator  manifold  for  rpa  applied  to  periodic  infinite  systems  is 
then  given  by: 

{T^}  =  {Q\Q}  =  (4) 


where  a*(a)  is  a  creation  (annihilation)  operator.  The  subscript  labels  a,b, . . .  (/, 
y , . . .)  correspond  to  unoccupied  ( occupied )  bands.  Thus  the  particle-hole  operator 
Qlkik  ’  that  acts  on  the  ground  state  Slater  determinant  replaces  the  occupied  crystal 
orbital  0,  (A')  by  the  virtual  crystal  orbitals  4>a{k).  By  partitioning  the  operator 
space  into  0and  and  by  removing  spin  from  the  expression,  since  we  are  dealing 
with  the  dipole  moment  operator  r  which  is  a  singlet  operator,  the  frequency- 
independent  longitudinal  polarizability  of  a  closed-shell  infinite  periodic  system  is 
given  by: 


«;..(rpa)  =  2(Q* 


(5) 


where  the  asterisk  (*)  indicates  the  complex  conjugated  values.  According  to 
Blount’s  procedure  [27].  the  z-dipole  matrix  elements  are  defined  by  the  following 
integration  which  is  only  non  zero  for  A  =  A’: 


(0„(A')|.-|0,(A))  =  /5u'(h„(A)  ^  H,(A)j  =  /5AA'fi«,(A). 


(6) 


This  allows  us  to  take  only  the  vertical  transitions  into  consideration.  The  dipole 
transition  strengths  n„,(  A)  are  defined  by  a  space  integration  of  the  product  between 
ii*{k)  and  the  gradient  according  to  A  of  the  function  u,{k).  Barbier  et  al.  [28] 
provided  an  analytical  expression  of  fiaifA)  and  more  recently,  we  published  a 
better  numerical  form  [29]  for  computation.  The  other  different  matrix  elements 
are  defined  by: 
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(A{k,  =  (Caik)  -  e,{k))6,^,jd„,-  +  2(4,Ak)Mk)\4>j(k')Mk'))  (7) 

-  {Mk)<i>h(k')\4>j(k'U,{k)) 

Chemist’s  or  Mulliken  notation  has  been  chosen  to  define  the  two-electron  integrals 
between  crystalline  orbitals: 

{<l>a(k)<f,.(k)\4>Ak')Mk')) 

=  JJ  dTf  dr2(l>t{k,  'rt)<t>,(k.  r,) 72)<i>h{k',  r2).  (9) 

A  recent  study  demonstrates  the  nice  converging  behavior  [30]  of  the  different 
direct  lattice  sums  involved  in  the  evaluation  of  these  two-electron  integrals  between 
crystalline  orbitals.  The  first  step  consists  of  solving  the  following  linear  equation 
system: 


fAX  +  BY  =  fi 
1b*X  +  A*Y  =  fi*.. 


The  matrices  A.  B,  Q  are  of  infinite  dimension  since  there  are  an  infinite  number 
( 2A'  +  1 ,  A’  -►  X )  of  A- values  and  thus  an  infinite  number  of  A-states  in  each  band. 
Moreover,  there  is  an  equation  for  each  triplet  formed  by  a  A-value  and  two  band 
indices.  This  triplet  represents  a  particle-hole  excitation  that  is  vertical  in  order  to 
preserve  the  momentum.  As  is  the  case  in  many  polymeric  techniques,  the  infinite 
sum  over  k  is  transformed  into  an  integration  in  the  first  Brillouin  zone: 


(2A+  l)fl 
2ir 


occ  unocc  rw/a 

1  1 

j  h 


{A(k,k'))a.MX,Ak') 


+  (B{k,  k')),,,,Y,Ak')  dk'  =  n„,(A) 


(2N  +  na 


2'k 


OCC  unwc  rir/'’ 

-22  {B*(k,k'))a..„X„(l<') 

,  h 


+  {A*{  k.  k'))a,MjY„{k')  dk'  =  (11) 


By  partitioning  A(A,  A')  into  Ao(A,  A')  and  A|(A,  A')  according  to: 


(^o(A,  A'))„,,,,  =  Mk)  -  e,{k))6a^,5(k  -  A')  (12) 

(.-1,(A,  A'))„,,,„  =  2(<l>Jk)Mli)\<t>Ak')Mk’))  -  {4>Ak)Mk')\(l>Ak’)<t>Ak)) 

(13) 


where  we  have  used  the  relation  between  the  discrete  Kroneker  and  continuous 
Dirac  delta  functions: 
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5aa-. 


2ir 

(2N+  l)a 


d{k- 


k'). 


(14) 


Finally,  the  linear  equation  problem  takes  the  following  fonn: 

(2A^+  l)i 

■  Z.  . 

j  /> 


(2^^+ 

(Ca{k)  -  e,(k))Xa,ik)  +  - - - - 2  Z  {Ai(k,  k'))a,,i,jXt„(k') 

Ik  ,  h 


I 


+  {B(k,  dk’  =  «„,(*) 

occ  unocc  (*w/a 


(2N  r 

(ea{k)-e,{k))Yo,(k)  +  ^  .  2  2  (B*(k,k'))a.Mk') 

Zir  t  j-x/a 


+  (Af(k,  k’)),,.fyy/>Ak')  dk'  =  Q%(k).  (15) 


At  this  stage,  trapezoidal  quadrature  is  used  to  obtain  \{k')  and  Y  (A').  The  second 
step  consists  of  the  following  scalar  product  that  provides  the  longitudinal  polar¬ 
izability  of  the  polymer: 


a....(RPA)  =  2(n*{A') 


(16) 


Again,  the  infinite  sum  over  k  is  replaced  by  integration  in  the  first  Brillouin  zone. 
This  straightforwardly  provides  the  longitudinal  polarizability  per  unit  cell: 


«,,(RpA)  y  ( r  ° 
(2A'-t-  1)  "  7  [J-x/. 


■  unocc 

-  2  ^l{k’)X,.ik')  +  naAk')Ya,{k') 


(17) 


By  making  the  approximation  where  matrix  B  is  zero  and  A  is  restricted  to  Ao, 
the  matrices  become  diagonal,  every  coupling  between  the  crystal  orbitals  disappears 
and  the  polarizability  reads  as: 


a-.-(SOS)  ^  ^  f  f'/"  Ala.{k’)\' 

(2N+  1)  7  7  cAk')-eAk') 


(18) 


This  is  the  expression  of  the  uncoupled  Hartree-Fock  SOS  polarizability  per  unit 
cell  [29).  As  in  this  case,  there  is  no  coupling  between  the  different  vertical  tran¬ 
sitions,  the  coupling  and  thus  the  matrices  Ai  and  B  are  responsible  for  the  field- 
induced  electron  reorganizational  effects.  The  polaiizabilities  per  unit  cell  are  ob¬ 
tained  by  using  a  post-scF  calculation  that  takes  place  after  the  band  structure 
calculation  effected  by  the  plh  program  [31). 


Ab  initio  Application  to  Polyacetylene  Chains 

When  interested  in  a  quantitative  prediction  of  the  polarizability  value,  correct 
geometrical  parameters  have  to  be  used,  especially  the  bond  length  alternation 
which  plays  a  crucial  role  in  influencing  polarizability.  In  the  work  of  Hurst  et  al. 
[17),  Ar  is  equal  to  0.1 12  A.  This  is  larger  than  the  experimentally  determined 
value  of  0.08  A  [32)  and  creates  a  decrease  of  the  polarizability.  As  explained  in 


HARTREE-FOCK  POLARIZABILITY  CALCULATIONS 


675 


the  work  of  Villar  et  al.  [33],  the  use  of  other  atomic  basis  sets  does  not  decrease 
the  bond  length  alternation  up  to  0.08  A.  In  fact,  Ar  is  equal  to  0. 100  A  [  34] ,  0. 1 1 5 
A  [35].  and  0.127  A  [33]  by  using  a  8s4s/4s.  closely  related  4-3 IG,  and  6-3 IG** 
atomic  basis  sets,  respectively.  Such  large  Ar  is  due  to  the  lack  of  electron  correlation 
in  the  Hartree-Fock  method.  Recently,  Suhai  [  36  ]  optimized  the  geometry  of  po¬ 
lyacetylene  by  using  crystal  orbital  calculations  including  correlation  energy  up  to 
the  second  order.  Its  use  of  an  extended  basis  set  augmented  with  polarized  functions 
provided  a  Ar  value  close  to  experiment:  0.082  A.  Firstly,  in  order  to  obtain  the 
more  reliable  polarizability  values,  we  used  the  parameters  of  Suhai  represented  in 
Figure  1  for  a  unit  cell.  Secondly,  to  highlight  the  importance  of  a  correct  choice 
of  Ar,  geometrical  parameters  obtained  at  the  Hartree-Fock  level  with  the  3-2 IG 
atomic  basis  set  are  also  used.  In  this  system,  represented  in  Figure  1,  Ar  is  equal 
to  0.123  A. 

Oligomer  Calculations  and  Extrapolation  Procedures 

The  GAUSSIAN  90  [37]  program  was  used  to  calculate  the  components  of 
polarizability  tensor  of  the  a-  and  b-type  increasingly  large  polyacetylene  chains 
using  the  minimal  sto-3G  [ 38 ],  split-valence  3-21G  [39],  and  extended  6-3 IG** 
[40]  atomic  basis  sets.  The  polarizabilities  evaluated  by  the  coupled  and  uncoupled 


Figure  I.  3D  representation  of  the  polyacetylene  chains.  The  bond  length  and  bond 
angles  of  the  unit  cell  are  expressed  in  A  and  respectively.  The  optimizations  have  been 
earned  out  at  the  (a)  mp2  level  with  extended  +  polarized  basis  set  by  Suhai  [361.  (b) 
Hartree-Fock  level  with  the  split  valence  3-2 1 G  atomic  basis  .set. 
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Table  I.  Longitudinal  (o..-)  and  average  (a)  polarizabilities  in  a.u  of  increasingly  large  polyacetylene 
chains  computed  at  the  sos  level  with  the  STO-3G.  3-21G.  and  6-3IG**  atomic  basis  sets  (1  a.u.  of 
polarizability  ^  I  648  10“*'  C^m^J"'  =  0.1482  A’).  The  (a)  geometrical  parameters  have  been  used 

(see  Fig.  1 ). 


N 

STO-3G 

3-2 IG 

6-3  IG** 

«« 

a 

Of.- 

a 

a 

1 

15.38 

9.64 

24.72 

15.88 

24.88 

17.73 

2 

42.40 

22.91 

60.25 

35.02 

61  08 

38.33 

3 

78.94 

39.30 

107.89 

58.01 

108.07 

62.53 

4 

P.1.78 

57.83 

163.55 

83.69 

162.38 

89.18 

5 

168.45 

11.64 

224.42 

III. 11 

221.32 

mil 

6 

217.33 

98.19 

288.51 

139.60 

283.01 

146.4<J 

7 

267.42 

119.14 

354.50 

168.72 

346.30 

176.13 

8 

318.15 

140.31 

421.58 

198.21 

410.46 

206.06 

9 

369.20 

161.58 

489.27 

227.91 

10 

420.41 

182.91 

557.29 

257.71 

II 

471.71 

204.26 

625.49 

287.57 

12 

523.05 

225.63 

693.78 

317.46 

Hartree-Fock  procedures  are  listed  in  Table  1  to  IV  for  the  longitudinal  component 
and  the  average  value  invariant  upon  rotation.  Figures  2  to  5  represent  the  evolution 
of  the  polarizability  per  unit  cell  expressed  by  [  a::{N)  -  «....( V  -  I )] .  This  expression 
provides  faster  [15]  convergence  for  the  longitudinal  polarizabilities  per  unit  cell 
than  the  more  frequently  used  relative  polarizability  [au/V]  although  for  both 
formulae,  the  asymptotic  values  are  the  same.  As  already  mentioned  elsewhere  [5], 
the  CPHF/SOS  or  coupled/uncoupled  ratio  of  the  longitudinal  polarizability  is  largei" 


Table  II.  Longitudinal  (a,,)  and  average  (a)  polarizabilities  in  a.u  of  increasingly  large  poly  acetylene 


chains  computed  at  the  sos  level  with  the  STO-3G,  3-21G.  and  6-31G**  atomic  basis  sets.  The  (b) 
geometrical  parameters  have  been  used  (see  Fig.  1). 

N 

STO-3G 

3-2IG 

6-3  IG** 

(Xl2 

a 

«« 

a 

a-. 

a 

1 

14.80 

9.05 

23.37 

15  18 

23  83 

16.98 

2 

37.36 

20.79 

54.99 

.32.77 

55.78 

35.98 

3 

67.30 

34.78 

95.55 

53.22 

95.77 

57.62 

4 

101.26 

50.20 

141.80 

75.59 

141  01 

80.98 

5 

137.51 

66.38 

191.39 

99.07 

189.0it 

105.32 

6 

175.09 

83.00 

243.01 

123.23 

238.85 

130.24 

7 

213.33 

99.85 

295.74 

147.76 

289.48 

155.47 

8 

251.84 

1 16.73 

34900 

172.39 

340.42 

180.72 

9 

290.22 

133.61 

402.22 

197.08 

to 

328.67 

150.54 

455.58 

221.83 
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Table  III.  Longitudinal  («....)  and  average  (a)  polarizabilities  in  a  u  of  increasinglv  large  polyacetylene 
chains  computed  at  the  cphf  level  with  the  STO-3G.  3-21G.  and  6-31G**  atomic  basis  sets.  The  (a) 
geometrical  param  ;ters  have  been  used  (see  Fig.  I ). 


A’ 

STO-3G 

3-2  IG 

6-3  !G** 

O' 

a 

a.. 

rt 

1 

17.42 

11.19 

28.72 

19.04 

30.16 

20.91 

2 

51.84 

27.26 

76.83 

42.55 

80.03 

46.03 

3 

105.57 

49.79 

150.67 

74.5; 

155.98 

79.73 

4 

176.79 

78.19 

248.78 

114  72 

256.39 

121.61 

5 

262.77 

111.56 

368.59 

162.15 

378.55 

170.78 

6 

360.65 

148.94 

507.08 

215.87 

519.32 

226.20 

7 

467.82 

189.45 

601.18 

274  83 

675.58 

286.83 

8 

582  13 

232.36 

828.10 

338.10 

844.50 

351.71 

9 

701.85 

277.10 

1005.39 

404.87 

10 

825.67 

323.21 

1191.02 

474.44 

II 

952.59 

370.38 

1383.33 

546  26 

12 

1081  85 

418.33 

1580.99 

619.88 

for  the  less  alternating  and  more  conjugated  system.  For  the  transversal  and  per¬ 
pendicular  directions,  the  calculated  values  are  nearly  identical.  Since  the  major 
component  of  the  polarizability  is  the  longitudinal  component  due  to  the  electron 
delocalization  along  the  chain,  the  rpa  procedure  is  more  adapted  to  give  the 
suitable  estimates  of  the  polarizabilities  of  the  most  interesting  compounds.  More¬ 
over,  the  use  of  the  split-valence  3-2 IG  atomic  basis  set  considerably  increases  the 
polanzability  values  obtained  with  the  minimal  STO-3G  basis  set  whereas  the  ad¬ 
dition  of  polarization  functions  do  not  bring  any  significant  improvement. 


Table  IV.  Longitudinal  (oj;)  and  average  (a)  polarizabilities  in  a.u.  of  increasingly  large  polyacetylene 
chains  computed  at  the  cphf  level  with  the  STO-3G.  3-2 IG.  and  6-3 IG**  atomic  basis  sets.  The  (b) 
geometrical  parameters  have  been  used  (see  Fig,  1 ). 


STO-3G  3-2  IG  6-31G** 

CPHF  -  -  - 


N 

(t 

or 

or,. 

« 

1 

17.11 

10.67 

30.73 

I8..34 

29.15 

19.15 

2 

47.24 

25.26 

73.55 

40.11 

74.33 

43..36 

3 

92  29 

44.71 

136.69 

68.53 

139.01 

73.20 

4 

149  90 

68.35 

218.20 

103.02 

222.36 

109.05 

5 

216.99 

95.21 

314.84 

142.55 

320.37 

149.70 

6 

291.38 

124.52 

423.78 

186.19 

430.46 

194.70 

7 

371.10 

155.63 

541.34 

232.73 

549.74 

242,68 

8 

454.08 

187.85 

666.72 

282.21 

674.93 

292.61 

9 

539.44 

220.87 

797.32 

333.09 

10 

626.70 

254.54 

932.15 

385  42 
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Eigurc  2.  Evolution  of  the  sos  longitudinal  polarizability  per  unit  cell  (in  a.u.)  in  in* 
creasingly  large  polyacetylene  chains  with  dilTcrcnt  atomic  basis  sets.  The  geometry  (a) 

has  been  used. 


As  depicted  in  Figures  2  to  5,  the  convergence  of N)  —  a~z(  ~  • )  is  reasonably 
rapid  using  the  SOS  method,  whereas  it  is  very  slow  by  using  the  cphf  technique. 
Hence,  in  the  case  of  the  coupled  Hartrce-Fock  technique,  oligomeric  calculations 
are  not  sufficient  to  get  the  asymptotic  value  and,  firstly,  extrapolation  procedures 
arc  used  to  alleviate  it.  These  extrapolation  procedures  consist  in  making  a  least- 
squares  fit  of  the  polarizability  per  unit  cell  values  to  an  analytic  function  able  to 
present  a  stabilization  behavior  when  increasing  the  chain  length.  A  \/N  power 
scries  representation 


Rgurc  3.  Evolution  of  the  SOS  longitudinal  polarizability  per  unit  cell  (in  a.u.)  in  in- 
crcaringly  large  polyacetylcnc  chains  with  different  atomic  basis  sets.  The  geometry  (b) 

has  been  used. 
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Nianber  of  unit  oellt 

Figure  4.  Evolution  of  the  CPHF  longitudinal  polarizability  per  unit  cell  (in  a.u.)  in  in¬ 
creasingly  laige  polyacetylene  chains  with  dilTetent  atomic  basis  sets.  The  geometry  (a) 

has  been  used. 


az.-(iV)-a„(iV- l)  =  a  +  ^  +  ~  (19) 

has  been  proposed  by  Kirtman  et  al.  [13,41]  to  fit  the  longitudinal  polarizability 
per  unit  cell,  while  Hurst  et  al.  [17]  used  the  same  expansion  to  fit  the  logarithm 
of  the  polarizability  per  unit  cell.  Thus,  the  asymptotic  extrapolated  values  are  given 
by  a  and  10“,  respectively,  when  TV  =  oo .  In  these  procedures,  b  is  generally  negative 
whereas  c  is  positive  [17].  The  slow  decaying  behavior  of  the  1  /N  function  maintains 
important  the  contribution  of  the  blN  term  for  large  values  of  TV.  Therefore,  the 


0  2  4  (  t  10  12 

Number  of  anil  onlU 


Figure  5.  Evolution  of  the  CPHF  longitudinal  polarizability  per  unit  cell  (in  a.u.)  in  in¬ 
creasingly  large  polyacetylene  chains  with  different  atomic  basis  sets.  The  geometry  (b) 

has  been  used. 
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value  of  a  is  overestimated.  Adding  a  fourth  term  is  the  l/N  power  series  repre¬ 
sentation  to  provide 

-  a,AN  -  =  «  +  A  +  ^  +  ^  (20) 

as  fitting  function  does  not  reduce  the  asymptotic  value  greatly,  and  some  values 
even  increase.  The  asymptotic  extrapolated  values  of  the  cphf  longitudinal  polar¬ 
izabilities  per  unit  cell  obtained  by  the  procedures  of  Kirtman  et  al.  [13,41]  and 
Hurst  et  al.  [17]  with  a  four-term  l/N  power  series  representation  are  listed  in 
Table  V.  Following  Kirtman’s  experience  [3],  we  have  used  at  least  twice  as  many 
theoretical  data  points  as  optimized  parameters  to  perform  the  most  accurately  the 
least-squares  fittings.  The  imprecision  on  these  extrapolated  values  corresponds  to 
their  fluctuations  by  removing  the  d/N^  term  in  the  fitting  function.  The  largest 
sensitivity  of  the  extrapolation  procedure  in  the  case  of  the  most  polarizable  systems 
(smallest  Ar)  using  the  most  suitable  method  (cphf  which  takes  into  account  the 
field-induced  electron  reorganizational  effects)  explicitly  shows  the  need  of  using 
polymeric  methods  in  order  to  avoid  dealing  with  extremely  large  systems  but  still 
want  to  investigate  correctly  the  field  of  polymers  for  optics.  In  order  to  take  into 
account  more  correctly  the  decaying  behavior  of  the  difference  between  the  asymp¬ 
totic  longitudinal  polarizability  per  unit  cell  of  the  infinite  system  and  these  of  the 
finite  oligomers,  we  have  used  a  function  like: 

azziN)  -  a.AN  -  1)  =  a  -  (21 ) 

possessing  an  exponential  decreasing  behavior.  Table  V  shows  that  this  results  in 
smaller  extrapolated  values  than  with  the  previous  procedures. 

Polymer  Calculations 

Using  the  STO-3G,  3-2 IG,  and  6-3 IG**  atomic  basis  sets,  we  have  performed 
band  structure  calculations  with  the  plh  program  [  3 1  ] .  A  particular  feature  of  the 
PLH  program  consists  in  the  inclusion  of  the  long-range  coulombic  interactions 
which  are  accounted  for  via  a  multipole  expansion  technique  including  all  the 
monopole-quadrupole  and  dipole-dipole  interactions.  The  number  of  unit  cells  in 


Table  V.  Asymptotic  extrapolated  longitudinal  polanzabilities  per  unit  cell  of  polyacetylene  in  a.u. 
using  various  extrapolation  procedures  (see  text). 


Ar 

Basis  set 

Kirtman  et  at.  [13,41] 

Hurst  etal.  [17] 

This  work 

0.082  A 

STO-3G 

147.5  ±  2.6 

139.7  ±  10.1 

136.5 

3-2  IG 

261.8  ±  4.8 

247.9  ±  14.7 

220.7 

6-3  IG” 

274.5  ±16.1 

326.8  +  8.9 

244.3 

0.123  A 

STO-3G 

108.0  ±  1.6 

102.7  ±  1.2 

91.3 

3-2  IG 

188.4  ±  3.6 

186.0+  0.9 

151.6 

6-3  IG” 

192.9  ±  8.3 

215.3  ±  2.0 

158.6 
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Table  VI.  Asymptotic  values  of  the  energy  per  unit  cell,  uncoupled  (sos),  and  coupled  (RPa)  longitudinal 
polanzabilities  per  unit  cell  in  a.u.  of  the  infinite  polyacetylene  chain  having  geometry  (a). 


EIN 

a  JN  (SOS) 

a../N  {rpa) 

STO-3G 

-75.945636 

51.37 

132.6 

3-2  IG 

-76.458416 

68.38 

/ 

6-3 1G« 

-76 891349 

65.27 

/ 

the  short  and  intermediate  range  regions  are  21  and  45,  respectively  in  order  to 
ensure  proper  converging  results.  In  the  band  structure  calculation,  we  set  the 
threshold  for  the  two-electron  integrals  at  10“’  a.u.  The  criterion  for  convergence 
on  the  density  matrix  elements  is  fixed  at  10~*.  In  the  rpa  calculations  of  the 
asympxotic  longitudinal  polarizabilities  per  unit  cell,  we  restricted  the  short-  and 
intermediate-range  regions  [30]  to  13  and  25  and  to  7  and  13  for  the  minimal 
ST03G  and  split- valence  3-2 IG  atomic  basis  sets,  respectively.  The  number  of  k- 
points  in  the  first  Brillouin  _one  has  been  fixed  at  2 1 .  These  restrictions  make  our 
code  viable  which  at  this  preliminary  stage  requires  an  enormous  amount  of  disk 
space  and  memory.  However,  as  discussed  in  a  recent  article  [30],  such  choices  of 
the  parameters  lead  only  to  a  decrease  of  0.5  to  1 .0%  of  the  asymptotic  polymeric 
longitudinal  polarizabilities  per  unit  cell. 

The  asymptotic  values  of  the  energy  per  unit  cell,  the  coupled  and  uncoupled 
polarizabilities  per  unit  cell  are  given  in  Tables  VI  and  VII  for  both  types  of  geo¬ 
metrical  parameters.  As  demonstrated,  the  sos  polymeric  values  are  very  close  to 
the  largest  oligomeric  values  that  you  could  compute  from  results  of  Tables  1  and 
II.  The  coupled  Hartree-Fock  polymeric  values  may  be  compared  to  the  extrapolated 
results  of  Table  V.  The  agreement  is  better  with  the  extrapolated  values  obtained 
from  fitting  with  the  exponential  decaying  function  than  with  the  1  /V  power  series. 
All  these  results  highlight  the  nice  extrapolation  property  of  our  direct  SOS  and  rpa 
methods  and  in  consequence  their  convenience  to  investigate  the  field  of  polymers 
presenting  large  optical  responses. 

Basis  Set  Effects  and  Derivation  of  Scaling  Factors 

For  quantitative  predictions  of  the  polarizability  values  by  using  ab  initio  pro¬ 
cedures,  an  important  point  is  to  master  the  basis  set  deficiency  which  leads  to  an 


Table  VIL  Asymptotic  values  of  the  energy  per  unit  cell,  uncoupled  (sos),  and  coupled  (rpa)  longitudinal 
polarizabilities  per  unit  cell  in  a.u.  of  the  infinite  polyacetylene  chain  having  geometry  (b). 


EIN 

aJN  (SOS) 

a-.JN  (RPA) 

STO-3G 

-75.947424 

38.47 

89.4 

3-2  IG 

-76.459727 

53.46 

150.1 

6-3 1G« 

-76.892465 

51.08 

/ 
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Table  Vlll  Influr-nce  of  the  atomic  basis  set  on  the  electnc  dipole  polanzabihty  tensor  components 
of  ethylene,  trans-butadiene,  and  all-trans  hexatriene  in  atomic  units  computed  at  the  sos  level  and 
comparison  with  experimental  data.  The  geometrical  parameters.  [..)  have  been  used. 


STO-3G 

3-21G 

6-3 1G‘* 

Sadlej 

Exp. 

Ethylene 

<»« 

11.54 

17.81 

19.42 

23.64 

28.26  [45] 

2.00 

5.11 

8.89 

21.05 

26.10(45] 

Oj.- 

15.38 

24.72 

24.88 

28.21 

34.26  [45] 

a 

9.64 

15.88 

17.73 

24.30 

28.47  [45] 

Trans-butadiene 

Ctxx 

22.38 

34.88 

36.77 

43.75 

“» 

3.83 

9.93 

17.14 

37.43 

42.40 

60.25 

61.08 

66.81 

a 

22.87 

35.02 

38.33 

49.33 

58.23  [46] 

All-trans  hexatriene 

Ctxt 

33.32 

51.44 

54.16 

63.69 

5.64 

14.71 

25.37 

53.52 

a.s 

78.94 

107.89 

108.07 

115.84 

a 

39.30 

58.01 

62.53 

77.68 

underestimation  of  the  polarizabilities,  particularly  in  the  directions  where  the  system 
is  not  extended.  This  problem  may  be  alleviated  by  using  linear  scaling  factors 
between  minimal  and  extended  basis  sets  and  the  experimental  values  as  it  was 
proposed  by  Chablo  and  Hinchliffe  [42]  and  already  used  by  Bodart  et  al.  [43]  for 
unsaturated  hydrocarbon  chains. 

We  compare  in  Tables  Vlll  and  IX  the  polarizability  components  of  ethylene, 
trans-butadiene,  and  all-trans  hexatriene  computed  with  atomic  basis  sets  ranging 


Table  IX.  Influence  of  the  atomic  basis  set  on  the  electric  dipole  polarizability  tensor  components  of 
ethylene,  trans-butadiene,  and  all-trans  hexatnene  in  atomic  units  computed  at  the  cphf  level  and 
comparison  with  experimental  data.  The  geometneal  parameters  (a)  have  been  used. 


STO-3G 

3-2 IG 

6-3 IG” 

Sadlej 

Exp. 

Ethylene 

13.52 

22.47 

23.68 

27.65 

28.26  [45] 

2.64 

5.93 

8.89 

25.15 

26.10(45] 

17.42 

28.72 

30.16 

34.59 

34.26  [45] 

a 

11.19 

19.04 

20.91 

28.60 

28.47  [45] 

Trans-butadiene 

25.14 

39.98 

42.02 

48.19 

«» 

4.81 

10.84 

16.04 

38.38 

0-. 

51.84 

76.83 

80.03 

89.70 

rt 

27.26 

42.55 

46.03 

58.76 

58.23  [46] 

All-trans  hexatnene 

36.83 

57.39 

60.16 

68  00 

6.70 

15.65 

23  04 

52  69 

rt.. 

105.57 

150.67 

155  98 

172.48 

a 

49.79 

74.57 

79.73 

97  72 
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from  the  minimal  sto-3G  to  the  split-vaicnce  3-3 IG,  extended  6-3 IG**  and  the 
medium-polarized  basis  set  of  Sadlej  [44]  constructed  to  provide  accurate  estimates 
of  dipole  moments  and  electric  dipole  polarizabilities.  The  well-known  fact  of  the 
poor  estimation  of  the  perpendicular  polarizability  is  again  obvious.  Only  the  me¬ 
dium-size  polarized  basis  set  of  Sadlej  provides  a  very  good  agreement  with  the 
experimental  data.  Therefore,  they  may  serve  as  reference  value  in  the  determination 
of  scaling  factors.  It  is  important  to  highlight  that  the  orientation  of  the  molecules 
is  the  same  as  the  orientation  of  the  polymeric  systems:  in  the  case  of  the  Suhai 
geometrical  parameters,  the  double  bonds  are  inclined  by  29.02°  with  respect  to 
the  longitudinal  periodicity  axis. 

As  the  a(3-21G)/a( Sadlej)  ratio  is  converging  by  going  from  ethylene  to  all- 
trans  hexatriene,  acceptable  scaling  fac  ors  may  be  extracted  from  the  CPHF  results 
on  all-trans-hexatriene.  These  scaling  factors  are  0.84, 0.30,  and  0.87  for  the  trans¬ 
versal,  perpendicular,  and  longitudinal  components  of  the  dipole  polarizability  ten¬ 
sor,  respectively.  In  consequence,  we  are  able  to  scale  our  3-2 IG  values  of  the 
polyacetylene  chain  presenting  a  0.082  A  bond  length  alternation  in  order  to  provide 
a  better  estimate  of  the  exact  polarizabilities  per  unit  cell.  When  N  tends  to  infinity, 
the  values  for  ax^/W,  ctyJN,  and  axe.  21.9  a.u.,  16.0  a.u.,  and  253.7  a.u., 
respectively. 


Summary  and  Conclusions 

In  this  article,  we  have  computed  ab  initio  coupled  and  uncoupled  Hartree-Fock 
asymptotic  polarizabilities  per  unit  cell  of  finite  and  infinite  polyacetylene  chains. 
We  have  illustrated  the  convenience  of  using  direct  method  treating  infinite  polymers 
to  obtain  these  asymptotic  values  with  respect  to  finite  oligomeric  calculations 
followed  by  extrapolation  procedure.  Indeed,  such  extrapolation  procedures  are 
problematic  because  the  correct  form  of  the  longitudinal  polarizability  per  unit  cell 
according  the  number  of  unit  cells  is  not  known.  Therefore,  these  procedures  are 
very  sensitive  to  the  analytical  form  of  the  fitting  function  and  to  the  appearance 
or  not  of  the  saturation  pattern  in  the  largest  oligomers  treated.  Based  on  the  po¬ 
larization  propagator  techniques,  the  RPA  method  used  is  an  extension  to  infinite 
periodic  systems  of  the  molecular  procedure,  taking  advantage  of  the  translational 
symmetry  of  the  polymer.  The  SOS  method  is  a  major  approximation  of  the  RPA 
procedure,  and  does  not  take  into  account  the  field-induced  electron  reoiganizational 
effects  included  in  the  latter.  In  addition,  the  choices  of  atomic  basis  set  and  geo¬ 
metrical  parameters  have  been  discussed  and  shown  to  be  .‘ry  important  to  provide 
quantitative  estimates  of  the  polarizabilities. 

At  this  stage,  the  coupled  procedure  is  still  restricted  to  prototype  systems.  How¬ 
ever,  suitable  computational  procedures  exist  which  can  be  applied  to  improve  our 
code  enabling  access  to  the  coupled  polarizabilities  of  compounds  of  greater  practical 
interest  [47]. 

Future  steps  in  the  development  of  methods  to  compute  the  polarizabilities  of 
infinite  periodic  systems  would  involve  including  the  vibrational  contributions  to 
the  polarizability,  taking  into  account  the  electron  correlation  effects  and  treating 
the  frequency-dependent  character  of  the  polarizability. 
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Abstract 

A  simple  model  Hamiltonian  is  used  to  calculate  the  limit  of  zero-voltage  resistance  in  a  one-dimensional 
chain  of  one-state  sites.  The  end  atoms  in  the  chain  are  considered  to  be  chemisorbed  to  the  surface  of 
the  metal  electrodes  and  their  interaction  with  the  continuum  of  metal  states  (jellium  model)  is  treated 
as  in  the  Anderson-Newns  chemisorption  theory  11,2). 

For  this  model,  an  exact  solution  can  be  obtained  for  the  zero-voltage  resistance  within  the  formalism 
developed  by  Doyen  and  co-workers  [3]  for  the  calculation  of  the  current  in  Scanning  Tunneling  Mi¬ 
croscopy  (STM ).  Applications  of  the  results  described  in  this  article  to  the  problem  of  molecular  imaging 
in  STM  are  presented  elsewhere  [4,5].  ©  1993  John  Wiiey  &  Sons.  Inc 

Introduction 

The  problem  of  interpreting  STM  images  in  the  long  distance  regime,  where  the 
distance  between  the  tip  and  the  surface  is  larger  than  5  A,  is  reasonably  well 
understood,  at  least  for  clean  metal  and  semiconductors  surfaces  [6].  In  this  regime, 
the  dominant  forces  are  of  the  van  der  Waals  type  and  the  current  is  well  described 
within  the  Tersoff-Haman-Bardeen  theory  [  7,8  ].  The  use  of  first-order  perturbation 
theory  seems  to  be  well  justified  when  the  separate  existence  of  two  quantum  sub¬ 
systems,  e.g.,  the  tip  and  the  surface,  can  be  assumed.  Such  an  assumption  implies 
a  weak  overlap  of  the  wave  functions  characterizing  the  two  quantum  subsystems. 

Under  the  assumptions  described  above  it  is  possible  to  derive  an  expression  for 
the  tunnel  current  which  depends  on  the  surface  density  of  states  (SDOS)  at  the 
Fermi  level  evaluated  at  the  position  of  the  tip  [7].  The  STM  image  would  then  be 
just  a  contour  map  of  the  SDOS.  This  interpretation  however  cannot  give  account 
of  a  number  of  STM  images  of  clean  surfaces  and  adsorbed  species.  This  indicates 
the  need  for  a  non-perturbative  approach  to  calculate  the  current  in  the  strong  tip- 
surface  interaction  regime.  This,  in  turn,  requires  considering  the  tip,  sample  and 
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electrodes  as  a  single  quantum  system  whose  electronic  structure  must  be  determined 
as  part  of  the  problem  [4, 5, 9-1 1], 

In  this  article  we  use  the  non-perturbative  theory  developed  by  Doyen  and  co¬ 
workers  [  3, 1 2  ] .  The  STM  current  is  viewed  as  a  scattering  process,  and  the  Lippman- 
Schwinger  equation  is  employed  to  calculate  the  scattering  amplitude.  A  Green’s 
function  technique  is  used  to  calculate  the  electronic  structure  and  the  one-electron 
density  of  states  (DOS)  of  the  system.  Specifically,  we  assume  that  the  electrode- 
tip-molecule-electrode  system  is  described  by  a  finite  one-dimensional  tight-binding 
model  for  the  molecule  and  the  tip,  and  a  continuum  of  one-electron  states  for  the 
metal  electrodes.  The  model  also  takes  account  of  the  interaction  of  the  chemisorbed 
atoms  with  the  continuum  of  metal  states  by  using  the  Newns-Anderson  model  of 
chemisorption  [1,2],  which  for  our  purposes  amounts  to  considering  a  semi-elliptical 
shape  for  the  metal  band.  A  similar  approach,  but  within  the  context  of  density 
functional  theory  and  the  Tersoff-Haman  model  for  the  current,  has  been  used  by 
Lang  [13-15]. 

This  simple  model  has  the  adv.intage  of  being  exactly  solvable  in  terms  of  the 
model  parameters:  the  orbital  energies  and  the  interaction  potentials  between  sites 
in  the  tip-molecule  chain  and  between  the  end  sites  and  the  electrode  surfaces. 
This  could  in  principle  allows  us  to  gain  some  physical  insight  into  a  very  complex 
problem.  An  application  of  this  model  to  STM  imaging  of  organic  samples  which 
are  bulk-insulators  has  been  described  elsewhere  [4.5]. 

The  Model 

The  system  we  aim  to  describe  consists  of  two  semi-infinite  metallic  electrodes, 
an  atomic  tip  and  a  molecule.  The  tip  is  in  contact  with  one  of  the  surfaces  and 
the  molecule  is  assumed  to  lay  between  the  tip  and  the  other  surface. 

The  tip  and  the  molecule  are  considered  as  a  collection  of  one-particle-state  sites, 
coupled  through  a  nearest-neighbor  interaction,  so  that  only  the  first  and  last  sites 
in  the  molecule-tip  chain  interact  directly  with  the  surfaces.  This  description  cor¬ 
responds  essentially  to  a  tight-binding  (Hiickel)  model  with  site-to-site  hopping. 

We  denote  the  site-states  by  the  set  { |  a);  a  =  1.2. _ A' } ,  and  the  corresponding 

energies  and  occupation  numbers  by  { E".  n„  \  a  =  1.  2.  ....  A}.  To  keep  the 
treatment  as  general  as  possible,  we  do  not  make  an  explicit  distinction  between  a 
tip-like  site  and  a  molecular-like  site:  in  our  model  they  differ  only  in  the  values  of 
E'!,  and  coupling  potentials. 

The  molecular  sites  can  be  thought  of  as  a  simple  approximation  to  the  actual 
chemical  units,  whose  electrons  are  intervening  in  the  conduction  process,  be  they 
atoms,  bonds,  or  a  combination  of  both. 

The  two  surfaces  are  described  within  the  jellium  model  [  1 6  ] .  ( a  non-interacting 
electron  gas  with  a  positive  continuous  background).  The  one-particle  states  have 
energies  and  occupation  numbers  denoted  by  [  £"(/).  «,(!)} '  where  i  and  /  are 
continuous  indices  that  correspond  to  states  1  k, )  and  |  k, )  of  electron  momentum 
k,  and  k/.  respectively. 

The  Hamiltonian  for  the  non-interacting  surface-tip-molecule-surface  system, 
within  our  model,  is  then: 


GREEN’S  FUNCTION  CALCULATION 
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AT 

//"=  2«,£r+ 2  2n/£:/.  d) 

I  a»I  / 


H"  is  diagonal  in  the  orthonormal  basis  { la),  |k, ),  |k/) } . 

The  interaction  between  these  sub-units  is  given  as  a  sum  of  hopping  terms: 


N 

ys\a]a,  +  2  K.a+ial.a„+i  +  2  kVsflla;  +  complex  conjugate  .  (2) 

/  a=  I  j 

where  a]  and  a,  are  ordinary  fermion  creation  and  annihilation  operators,  Fsi, 
Vss  stand  for  the  matrix  elements  of  F  in  (2)  between  sites  1  (N)  and  the  corre¬ 
sponding  surfaces  ( assumed  independent  of  jellium  state ) ,  and  V„,„+ 1  is  the  matrix 
element  of  V  taken  between  adjacent  sites  in  the  chain.  These  matrix  elements 
represent  the  strength  of  the  coupling  between  the  different  subunits  of  our  model. 
The  Hamiltonian  is  then  simply  written  as 

//  =  //"+ K.  (3) 


STM  Current 

Following  the  treatment  of  Doyen  and  co-workers  [3,12],  the  current  is  viewed, 
within  a  single-particle-picture,  as  a  scattering  process.  An  electron  incident  from 
the  interior  of  the  metal-tip  escapes  from  the  barrier  and  has  a  certain  probability 
of  scattering  through  the  sample  molecule,  into  the  metallic  substrate  where  it  is 
detected. 

Let  1  /  )  =  I  k, )  be  the  initial  state  of  the  tunneling  electron,  |  /+  )  the  scattering 
state,  and  I/)  =  | k/)  the  final  state  where  the  electron  is  detected.  li)and  |/) 
are  eigenstates  of  //",  whereas  |/-F)  is  an  eigenfunction  of  the  full  Hamil¬ 
tonian  (3). 

The  probability  p(/)  that  an  electron  with  momentum  k/  is  detected  at  time  I  is: 

/?(0=  K/l /+>!'.  (4) 

The  current  can  be  expressed  as: 

J(0  =  =  \(fli  +  )l'  .  (5) 

According  to  Lippman’s  generalization  of  Ehrenfest's  theorem  [  1 7, 1 8  ] ,  the  current 
in  the  limit  of  low  applied  voltage  fV,  can  be  written  as 

J  =  — ^2  l<./'|I’|/+>|-6(^r-£f)5(A-,-£,).  (6) 


where  the  sum  runs  over  all  the  states  between  the  Fermi  levels  (E/  )  of  the  two 
surfaces.  The  resistance,  R,  in  the  zero-voltage  limit  is  then  R  =  W/j. 
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Evaluation  of  the  Matrix  Elements 

Our  starting  point  is  the  Lippman-Schwinger  equation: 

|/+>=  |/>  +  G‘>K|/+>,  (7) 

where  is  the  Green’s  function  corresponding  to  the  non-interacting  Hamiltonian 

(1) 

G'’(2)  =  (2-HT',  (8) 

z  being  a  complex  variable.  In  the  basis  of  eigenstates  of  H°,  this  operator  is  diagonal 
with  diagonal  matrix  elements  given  by: 

(7^(z)  =  <n|G‘’|n>  =  — ,  (9) 

2  —  Jb„ 

where  |n)  is  any  site  or  jellium  state. 

A  resolution  of  the  identity  can  be  written  as: 

1  =  2  l/></l  +  2  i;><7l  +  2  l«><«l .  (10) 

I  J  Of"  I 

By  using  the  orthonormality  condition,  and  inserting  the  resolution  of  the  identity 
twice  in  Eq.  (7),  one  obtains: 

</!(+>  =  I /■+>.  (11) 

We  now  make  use  of  the  facts  that  0°  is  a  diagonal  operator,  that  the  operator 
Fin  (2 )  has  non-zero  matrix  elements  only  between  adjacent  sites,  and  that  jellium 
states  are  non-interacting,  to  obtain: 

</!/+>  =  (?;F^5<iV|/+>.  (12) 

One  can  use  the  Lippman-Schwinger  equation  again,  and  the  conditions  described 
above  tc,  obtain: 

<A|/+>  =  (?^{F;v(^-i,<A-  l|/+>  +  2:  ^Vs</|/+>},  (13) 

/ 

where  there  are  now  two  terms  because  site  N  interacts  with  both  the  surface  and 
site  {N-  1 ).  By  inserting  expression  ( 12)  in  ( 13),  one  gets: 

(14) 


where 


l  =  i:GfV%s.  (15) 

N  f 


In  a  similar  way,  we  obtain: 


GREEN’S  FUNCTION  CALCULATION 


691 


{N- l\i+}  =  G%-^{VN.nN-2)(N-2\i+)  +  V^!,_^^^(N\i+)}  .  (16) 


By  inserting  (14)  into  (16),  one  obtains; 


<iV-  l|/+>=- 


GN-iG^VNiN-l) 

I -Gj/l 

N 


(N-2\i+y. 


By  introducing  Eq.  ( 17)  in  ( 14),  one  arrives  at: 

</!/+> - 2|,+  )  .  (18) 

N 

This  procedure  can  be  repeated  to  obtain  the  matrix  element  (A^  -  2|/+),  so 
that  (/I  /■+)  is  finally  expressed  solely  in  terms  of  the  parameters  of  the  model. 

For  the  sake  of  simplicity,  and  to  show  the  final  form  of  the  equations,  let  us 
consider  the  case  N  =  2,  for  which  one  obtains: 


</l'+>  = 


GfV2sGWnG-{Vs, 


G<iG"2Vh 

\-G^l 

2 


By  redefining  on-site  renormalized  Green’s  functions  as: 


1  -G?Z 


1  - 


1  -  G?  Z  ’ 


Eq.  ( 19)  can  be  written  as: 

</!/+>  =  G?V2sGW^2G1Vs,  .  (23) 

Eq.  (23)  can  be  readily  generalized  for  arbitrary  N  as: 

</|/-^>  =  G>’F)v5CX-Ka.-,;,GX.-,F^_2^-,-  •  -GTK^,  , 
where  the  definition  of  G)'  is  as  in  Eq.  (22)  and 


(24) 
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"  \-G%l 

N 

G'k 

1  -  GkGk^,Vl^,t  ’ 


1  ^  a:<a^. 


Once  the  matrix  element  (/|/+)  is  known,  the  t-matrix  element  (/|  K|/-t-)  of 
Eq.  (6)  can  be  obtained  by  using  once  again  the  Lippman-Schwinger  equation: 


</|K|/+>  = 


</!'+> 


For  computational  purposes  it  is  convenient  to  use  the  diagonal  representation 
of  the  bare  Green’s  function,  Eq.  (9),  to  recast  Eqs.  (25)  and  (26)  as: 


Gk(E)  = 


E-E%-1 

N 


Gk(E)  = 


E  -  Ek  -  Gk.iVi^.r 


\  £K<N. 


The  case  N  =  1  does  not  conform  to  definitions  (28)  and  (29).  We  include  it 
for  completeness: 


</|K|/+>  = 


E-  E‘{- 1-1’ 

R  L 


A'*  1  . 


where  R  and  L  stand  for  the  “right”  and  “left”  surfaces. 

The  calculation  of  the  self-energy  contribution  2  at  is  performed  following  Newns 
[2]  as: 


where 


2  (c)  =  A(c)  -  /A(c) , 


A(c)  =  2i8^(l  -c^)'/^ 


|e|  <  1 

A(c)=  i  2fl^[c  +  (e^- I)'/^]  c<-l.  (33) 

20^[c-ie^-  l)*'^]  e  >  1 

The  parameter  /3  is  proportional  to  the  surface-site  interaction  potential,  i.e.,  /3  = 
VsK  and  c  is  a  dimensionless  quantity  such  that  the  metal  band  width  is  2  in 
units  of  e. 
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All  the  approximations  leading  to  Eqs.  (31 ),  (32),  and  (33)  are  described  in 
Newns’s  article.  It  suffices  to  quote  here  that  although  they  are  obtained  for  a  one¬ 
dimensional  tight-binding  model  one  can  also  assume  that  for  a  three-dimensional 
jellium  system,  the  metal  DOS  has  a  semi-elliptical  shape,  in  which  case  the  same 
set  of  equations  is  obtained. 

It  is  thus  evident  that  Eqs.  (24)  and  (27)  allow  us  to  express  the  current  solely 
in  terms  of  the  parameters  of  the  model;  the  site-site  and  site-surface  interaction 
potentials,  the  site  energies  and  the  relative  position  of  the  Fermi  levels  of  th  •  two 
surfaces.  The  metal  eigenfunctions  enter  only  through  the  functions  and  Zi . 

Density  of  States 

In  addition  to  the  current  itself,  it  is  of  importance  in  the  description  of  the 
electronic  transport  process  to  be  able  to  calculate  the  local  DOS.  In  a  one-electron 
approximation,  as  the  one  we  are  employing  here,  the  availability  of  states  in  every 
position  of  the  chain  at  a  value  of  energy  of  the  incoming  electron  equal  to  the 
Fermi  energy  of  the  metal,  is  a  local  requirement  for  conduction  to  take  place. 

The  DOS  at  site  i  is  given  by  [191; 

P,(£)  =  --lm[(7,(£)],  (34) 

TT 

where  G,(£)  is  the  on-site  projected  Green’s  function; 

G,(£)  =  </|G(£)|/>.  (35) 

In  order  to  calculate  the  total  Green’s  function  for  the  system  we  start  from 
Dyson’s  equation; 

G  =  G"+G"FG.  (36) 

The  calculation  of  the  matrix  elements  G,  goes  along  a  similar  line  as  the  cor¬ 
responding  one  for  the  current.  We  illustrate  it  for  the  =  2  case  from  which  the 
general  expression  can  then  be  derived. 

From  (36)  and  using  the  fact  that  the  potential  connects  only  nearest-neighbor 
sites,  and  that  metal  eigenstates  are  non-interacting,  we  obtain; 


Gii  -  G'l'ip  +  Fj:G2i  +  2  1  uGm  (37) 

G22  =  G'lsj^l  +  F:iG,2  +  2  F2/G,2  (38) 

G21  =  G22  F21G11  +  2  (39) 

/ 

G|2  =  G'n  F12G22  +  2  £uGa2  (40) 

k 

Gy, -Gy,F,2G2,  (41) 

G/2  ~  G'jiVfiG^i  (42) 

Ga2  =  Gii’*KA,G,2  (43) 

G*,  =  GIVING,,.  (44) 
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By  substituting  (41 )  in  (39)  one  obtains: 

^  GizViiGu 

O21  -  ■;  „  ■ 

i  —  "2^22 

Subsequent  substitution  of  (44)  and  (45)  in  (37)  renders: 

G  gn 

A  similar  procedure  can  then  be  followed  to  obtain: 

G  g22 

''  ,  V'uGiiGh  .  ■ 

r-s.G?. 

For  arbitrary  N,  the  on-site  projected  Green’s  function  has  the  form: 


_ _ Hi _ 


A'  ’ 


2+1  “2+1— A' 


where  n,-i^i  and  fi;+i-.Arare  defined  as: 


^^2-1  — I  ~ 


Vj.2.j.,GUGf-2  1  -  Vj.2,,. ,  G% ,  GU  ^2-2- 1 
1  -  VU.J-2GUGU 


V]iG'iG^ 

1  -  G?2, 


^^;+l— A'  “ 


^j*\,]+lG1+\G%2  1  —  I^+|,^+2g2’+|g7+2^2+2-*Al 

1  -  F,^2.2+3g2‘'+2g;+3 


(50) 


.  _  ^ ^n-\.nG%-\G% 

1  “  Gs'^n 

Some  Limiting  Cases 

In  order  to  verify  the  exactness  of  our  results  for  the  resistance  and  the  local  DOS, 
it  is  of  importance  to  include  some  comparisons  to  results  which  have  been  obtained 
independently  for  both  quantities. 

Sharvin  Point  Contact  Limit  and  the  Value  of  Resistance  in  Resonance 

This  is  the  limit  of  quantum  resistance  corresponding  to  point  contact  between 
two  atoms.  It  has  both  been  observed  experimentally  in  STM  measurements  as  a 
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function  of  distance  [20]  and  derived  theoretically  [21,22]  to  be  /j/ 2  =  12.9  kf2 

(tt  in  atomic  units).  This  is  precisely  the  value  we  obtain  when  a  molecular  energy 
is  in  resonance  with  the  energy  of  the  detected  in-coming  electron.  This  energy  is 
equal  to  the  Fermi  energy  of  the  metal  which  we  can  set  arbitrarily  equal  to  zero 
as  measured  by  the  dimensionless  variable  c  in  Eqs.  (32)  and  (33).  We  illustrate 
below  the  calculation  for  =  1,  but  in  fact  the  result  is  independent  of  N.  For  N 
odd,  the  resistance  has  the  same  value  as  for  N  =  1 .  For  N  even,  the  resistance  is 
slightly  larger.  This  result  is  in  agreement  with  the  well-known  fact  that  a  tight- 
binding  Hamiltonian  has  an  energy  spectrum  which  is  symmetric  around  the  non¬ 
interacting  orbital  energies  [19].  For  N  odd  there  is  one  state  with  energy  exactly 
equal  to  that  value.  If  the  non-interacting  orbital  energies  are  chosen  equal  to  zero, 
then  the  chains  with  odd  will  always  meet  the  resonance  condition,  whereas  those 
with  even  N  will  have  their  orbital  energies  displaced  from  zero. 

It  is  also  interesting  to  notice  that  the  independence  of  the  current  on  the  length 
of  the  chain  is  in  agreement  with  the  well-known  result  that  the  transmission  coef¬ 
ficient  for  tunneling  in  a  one-dimensional  system,  evaluated  at  resonance,  is  in¬ 
dependent  of  barrier  thickness  [23-25].  We  should  also  add  that  the  result  is  in¬ 
dependent  of  the  strength  of  the  site-site  and  site-surface  coupling,  as  long  as  these 
potentials  are  chosen  symmetrically  with  respect  to  the  center  of  the  chain. 

Figure  1  shows  the  resistance  at  resonance  as  a  function  of  N,  the  upper  and 
lower  values  correspond  to  even  and  odd  N,  respectively. 

In  order  to  show  how  the  Sharvin  limit  is  achieved,  we  take  the  case  N  =  \. 
From  Eq.  (30)  we  obtain: 


Figure  I .  Resistance  (a.u)  at  resonance  versus  number  of  sites  in  the  chain.  The  upper 
and  lower  values  correspond  to  even  and  odd  N,  respectively. 
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\(f\V\i+\\^- 

IV  1  [i.'-£-»_Re(2)]^  + [lm(2)]^’ 

(51) 

where 

2  —  2^  +  22,. 

(52) 

At  resonance,  one  has: 

l</|I'U+>l  {iin(Sjj)-Hm(2z.)]^‘ 

(53) 

Using  the  identity: 

lim  .  =  W  1  iir6{x) , 

,.-0*  X-  ly  \xj 

(54) 

we  obtain  the  result: 

3 

M 

II 

1 

1 

(55) 

and  similarly: 

Im(2i)  =  -^2  5(£:-  . 

(56) 

I 


By  carrying  out  the  double  summation  in  (6),  and  allowing  for  an  extra  multi¬ 
plicative  factor  of  2  which  takes  account  of  the  electron  spin  degeneracy,  and  taking 
2;?  =  2/.,  we  obtain  j  =  1  /ir  which  renders  the  Sharvin  limit  for  R. 


Local  COS  for  a  Periodic  One-Dimensional  Chain 

The  local  DOS  for  a  one-dimensional  periodic  chain  with  a  tight-binding  Ham¬ 
iltonian  is  [19]: 


where  V  is  the  site-site  interaction  potential.  In  our  finite  chain,  we  should  recover 
this  behavior  for  a  long  enough  chain  and  for  sites  far  away  from  the  surfaces.  That 
this  is  indeed  the  case  is  shown  in  figure  2,  where  the  DOS  at  the  central  position 
for  a  chain  of  1 00  sites  is  plotted  as  a  function  of  e.  The  curve  shows  the  characteristic 
square  root  singularities  of  the  DOS  for  the  infinite  periodic  system  given  by  (57). 


Conclusions 

We  have  been  able  to  obtain  exact  solutions  for  the  tunneling  current  and  DOS 
for  a  model  of  electron  transfer  through  a  finite  chain  which  may  be  of  interest  for 
the  analysis  of  molecular  imaging  in  stm  in  the  regime  of  moderate  to  strong 
interactions  between  the  atomic  tip  and  the  sample.  The  model  can  also  be  applied 
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Figure  2.  Local  density  of  states  (arbitrary  units)  in  the  center  of  a  chain  of  100  sites  as 
a  function  of  the  dimensionless  energy  t.  The  non-interacting  site  energies  are  all  equal  to 
zero  and  the  potentials  are  symmetric  with  respect  to  the  center  of  the  chain. 


to  the  study  of  the  influence  of  molecular  disorder  on  the  STM  image.  Investigations 
in  this  direction  are  currently  under  way,  where  disorder  is  introduced  through 
variations  in  the  orbital  energies  and  the  site-site  potentials. 

As  a  test  of  the  exactness  of  our  results  we  have  successfully  recovered  some 
known  results  for  one-dimensional  systems  regarding  DOS  and  tunneling  current 
in  resonance. 
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Abstract 

We  have  calculated  the  first-,  second-,  and  third-order  contr.bution  to  the  refraaive  inde*  of  a  dilute 
aqueous  solution  of  three-level  molecules  irradiated  by  an  intense  one-frequency  external  field.  Each 
solute  molecule  is  described  by  two  coupled  harmonic  curves  of  electronic  energies  with  the  same  force 
constant  and  with  minima  displaced  in  energy  and  nuclear  coordinate.  We  assume  that  the  intramolecular 
coupling  mixes  only  the  states  corresponding  to  two  near  degenerate  of  the  three  levels  considered. 
Important  linear  and  nonlinear  contnbutions  of  the  solute  molecules  to  the  refractive  index  have  been 
found,  mostly  in  the  so-called  weak  and  intermediate  coupling  regions  ©  1993  John  Wiley  &  Sons,  Inc 

Introduction 

Three-level  models  of  atoms  and  molecules  interacting  with  an  intense  electro¬ 
magnetic  field  allow  the  description  of  additional  effects  to  those  already  described 
by  the  simple  and  fruitful  two-level  models  [1-4],  In  a  previous  work  we  have 
studied  this  last  type  of  model  describing  the  two  states  as  vibrational  ones  belonging 
respectively  to  two  coupled  harmonic  potentials  with  the  same  force  constant  and 
with  minima  displaced  in  energy  and  nuclear  coordinates  [5],  In  this  way  we  have 
included  in  the  two-level  model  a  typical  trait  of  the  complexity  of  the  molecular 
structure,  while  preserving  some  of  the  simplicity  of  this  approach.  We  have  found 
that  the  intramolecular  coupling  mixing  the  two  states  considered  has  a  significant 
effect  on  the  first-,  second-,  and  third-order  optical  susceptibilities  [5]. 


International  Journal  of  Quantum  Chemistry.  Quantum  Chemistry  Symposium  27, 699-/11  (1993) 
c  1993  John  Wiley  &  Sons.  Inc.  CCC  0020-7608/93/010699-13 
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In  this  article  we  study  the  first-,  second-,  and  third-order  contribution  to  the 
refractive  index  of  a  dilute  water  solution  of  molecules,  each  of  them  described  as 
a  three-level  system  irradiated  by  an  intense  one-frequency  external  field.  On  the 
other  hand,  we  assume  that  an  intramolecular  coupling  mixes  only  the  states  cor¬ 
responding  to  the  two  near  degenerate  of  the  three  levels  considered,  according  to 
the  scheme  illustrated  in  Figure  1. 

In  the  next  section  we  will  describe  a  simplified  version  of  the  typical  vibronic 
model  for  molecules  [6-8],  allowing  a  not-too-intricate  calculation  of  the  optical 
susceptibilities  up  to  third-order  in  the  external  field  amplitude.  Finally,  in  the  last 
section,  we  will  discuss  our  results  for  the  first-,  second-,  and  third-order  contribution 
to  the  refractive  index  of  a  dilute  water  solution  of  the  molecules  considered.  Ac¬ 
cording  to  our  results  important  linear  and  nonlinear  contri,.,utioi<s  of  the  solute 
molecules  to  the  refractive  index  occur  mostly  in  the  so-called  weak  and  intermediate 
coupling  regions. 

The  Three-Level  Model  of  a  Molecule  with  Intramolecular  Coupling 
between  Two  Near  Degenerate  States 

The  Hamiltonian  describing  a  molecule  can  be  written  as 

H=Ts+T,+  V(q,Q)+V'(q)  (1) 

when  q  and  Q  are  the  electronic  and  nuclear  coordinates;  TV  and  Tf  are  the  nuclear 
and  kinetic  energy  operators;  V(q,Q)  is  the  potential  energy  term  arising  from 
electron-electron,  electron-nucleus,  and  nucleus-nucleus  interactions  within  the 


Energy 


Figure  I.  A  three-level  model  of  a  molecule  in  which  two  levels  belong  to  one  of 
the  harmonic  electronic  potentials.  The  intramolecular  coupling  v  mixes  the  states 
corresponding  to  the  nearest  levels  belonging  to  different  electronic  potentials.  The 
weak,  intermediate  and  strong  coupling  cases  correspond  to  v  S.  v  —  S  and  v  P  S, 

respectively. 
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molecule;  and  V'{q)'\sz  residual  term  depending  only  in  the  electronic  coordinates. 
The  wave  functions  describing  the  three  states  considered  of  our  system  are 

=  Mq-,  Q)<f>iAQ)  (2) 

and 

=  ct,MQ\Q)MQ)  +  CUi{q\QH2i(Q)  ,  (3) 

where  ■^\(q\Q)  and  are  the  electronic  wave  functions  corresponding  to  the 

left  and  right  harmonic  potentials  in  Figure  1.  The  harmonic  vibrational  wave 
functions  of  states  associated  to  the  electronic  wave  functions  4/\(q',Q)  and 
'PiiQ'yQ)  are  denoted  as  and  4>2i{Q),  where  j,  k,  and  /are  vibrational 

quantum  numbers.  In  the  scheme  illustrated  in  Figure  1  these  quantum  numbers 
are  j  =  0,  k  =  1,  and  /  =  0.  The  three  energy  levels  of  our  system,  expressed  in 

units  of  h(j}/2,  where  w  is  the  frequency  of  the  harmonic  oscillators,  are  given  by 

£0  =  2(7 +1/2)  (4) 

and 

£"  =  ( l/2)l(£u  +  £2/)  ±  V(£u  -  £2/)'  +  ,  (5) 

where  £ia  =  2(/:  +  1/2),£2/  =  2{/  +  1/2)  +  Vq,  and  the  intramolecular  coupling 
parameter  v  =  where  m,  n  =  1,2,  and  m¥^n. 

The  corresponding  wave  functions  to  the  energies  £0  and  are  respectively  the 
Born-Oppenheimer  products  given  in  Eq.  (2).  and  [5] 

=  Cfdhid-.QHiAQ)  ±  Af,MQ\Q)<l>2i{Q)] ,  (6) 

where 

CfA=  l/ll  +  Uffcl']''"  and  Afk  =  {E,k-E^)/v(<t>jd<t>2i}- 

The  criteria  of  weak,  intermediate,  and  strong  coupling  cases  [6-8]  are  v  S, 
u  ~  S  and  v  l>  S,  respectively,  where  S  =  (mw/  ft  )0o,  "j  is  the  mass  corresponding 
to  the  vibrational  mode  described  by  the  coordinate  Q,  and  Qo  is  the  relative  dis¬ 
placement  between  the  two  minima  of  the  electronic  potentials  (Fig.  1).  The  vi¬ 
brational  overlapping  factors  (</>ikl<p2/)  in  the  above  expressions,  where  /:  =  0,  1 
and  /  =  0,  can  be  calculated  using  the  Pekarian  formula  [9] 

(7) 

where  we  have  the  cases  (n  =  1  and  n’  =  2)  and  (n  =  2  and  n'  =  1 ).  Given  the 
symmetry  of  the  starting  harmonic  potentials  corresponding  to  the  system  without 
intramolecular  coupling,  we  take  the  permanent  dipolar  moments  associated  to  the 
electronic  states  described  respectively  by  our  basis  wave  functions  'P\{q\Q)  and 
'i>2i<l\Q)  as  zero. 

The  transition  dipolar  moments  corresponding  to  our  three-level  system  are  given 
by 
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fk)- -  ~C'\kA'iktii2(.~Qo)(,<l>\o\‘l>2oy  ,  (8) 

IM)+  -  ~CUA\ktt-\2(.~Qo)(,<l>\o\<l>2o')  »  (9) 

M-+  =  ~C'\kC\kt^\2{~Qo)(^<i>2o\<t>l\')[A\k  +  ^r*r]  •  (10) 

Since  cur  ground  state  coincide  with  one  of  the  basis  states  its  permanent  dipolar 
moment  is  zero.  On  the  other  hand,  the  permanent  dipolar  moments  respectively, 
associated  to  the  mixed  states  -  and  +  will  not  vanish,  and  are  given  by 

/i —  =  ik)^A  ikllt2{~Qb)(,<i>2o\<f>liy  (11) 

and 

At++  =  “2(Ct*)^>4j*f£i2(— (3o)('^2oI<^ii)-.  (12) 

Having  obtained  the  expressions  for  the  energy  of  the  three  levels  involved,  and 
the  corresponding  transition  and  permanent  dipolar  moments,  we  are  now  prepared 
to  calculate  the  optical  susceptibilities  up  to  third-order  in  the  external  field  am¬ 
plitude  E,  in  term  of  which  the  polarization  is  usually  written  as  [4] 

/»=  X'‘>£+ X‘^>£2  + (13) 

The  first-order  susceptibility  for  our  system  will  be  given  by  [4] 


_ 1 _  _ 1 _ ■ 

£«-£■«-  £r  -  iEs  ^  E„-  E„  +  Ec+  iEs 

(14) 

where  m,n  =  0,-,+,  and  N  is  the  number  density  of  molecules,  p  mm  is  the  thermal 
equilibrium  value  of  the  fraction  of  molecules  in  the  state  m,  E^  =  ftw//0.5fiw, 
and  £j  =  hym„/0.5hw,  oyand  y„„  being,  respectively,  the  frequency  of  the  external 
field  and  the  phenomenological  damping  rate  of  the  density  matrix  element  pmn- 
We  have  assumed  the  same  value  for  all  the  imn’s  involved. 

The  second-  and  third-order  optical  susceptibilities  are  respectively  given  by  [4] 


X  <"(£.)  = 


N 


O.Sfiw 


2  pLpmm 


X‘^'(2£c)  = 


N 


(O.shwy 


(pmm  Pvv^)f^nvf^vmPmrt 


IE„  -  £m  -  2£,  -  /£,][[£.  -Em-E,-  /£,]] 


_ (pt'f^  phn)P'mP’i!mP’mn _ 

[E„  -  £m  -  2£<.  -  /£,][[£„  -  £„  -  £<•  -  /£,]]  ’ 

(15) 


and 
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X'^>(£,) 


N 


io.shuy 


X  2 

nmv/ 


f  „(0)  _  (0)^ 

\Hnim  Hw  / 


[E„  -E„-Ec-  iE^][Ei  -  E„-  2Ec  -  lEs] 
X  E,,  -  iEs] 


I  „(0)  _ 

-(Pvv  Pll  H 


PmnPvmPhPnl 


/  JO)  „(0)\ 
-(Pll  Ptv) 


+(pi2’-p{.2') 


[£„  -  E„,  -  Ec-  iEs]lEi  ~E„-  IE,.  -  iE^] 

X 

[£„  -  £m  -  £,.  -  /£j1[£„  -  £/ -  2£c  -  iE,] 

X  [£t,  -  £/-  £r-  /£J 

_ MmnM/mMni?Mt7 _ 

(£„  -  E„-  Ec  -  iEs][E„  -El-  lEc  -  iE,] 
X[E„-  Ec  -  Ec  -  iEs], 


N 


(O.Shcoy 


X  2 

nmi/ 


-  pj?') 


(£„  -  £„,  -  £,.  -  /■£,][£/  -  E„-  iE,] 

X  [£,.  -  E„  -  Ec  -  iE,] 


,  (0)  <0)> 
-(Pir  -  PU  ) 


_/  ^(0)  _  ^(0)\ 
(  P  //  P  IT  ) 


+(pl.?>-pj.n 


PmnP  rmP  /  i“P  n! 


[£„-£„-£(.-  /£,){£/  -  £„,  - 

X  [£/  —  £i,  —  £,.  —  /£,] 

_ PmnPncPlmPvl _ 

[£„-£„-  £r  -  /£,][£„  -El  -  /£,] 

X  [£,-£/-£,  -  /£,] 


[  PmnPimPnvPtI 


[E„  -  E„-  Ec  -  /£,][£„  -El  -  iEs] 

X  [£„  -  £,,  -  £,.  -  /■£,] 


(O.Shwy 


X  2 


nmi/ 


(o«>)  _  „(0K 

\Pmm  Per  I 


_ PniPivPvmPmn _ 

[£„  -  E„  -  Ec  -  /£,][£/  -  £m  -  iE,] 
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X  [£„  -  Em  +  Ec-  iEs] 

(Q)  _  (0)v  _ _ 

PU  ) 

X  [£/  -£„  +  £■(.-  /£■,] 


_/„(0)  -(0)\ 
(p//  ~  Pll!  ) 


+(pi2>-pl.°,’) 


[£„  -E„-E,-  /£,][£„  -  -  /£,] 

[£„  -  Em-  Ec  -  iE,][E„  -  E,-  iEs] 

X  [£„  -  £„  +  £f  -  /£■,], 

(16) 


where  n,v,  I  -  0,  -,  +,  and 


X”>(3£.)  = 


^  1/  /ni  (0)^ 

L  ’  (£.  -  £.  -  3£,  -  IE.] 


-(^r-pS’) 


/  (O'  (0)i 

-(pit'  -  Pll  ) 


,  /  (0)  (0)\ 
+  (p//  -  pin  ) 


X  [£„  -  E„-  2Ec  -  /£,] 

X  [£/  -  Em  -  Ec  -  iEs] 

_ ^^mn^^nvl^^rn^■vl _ 

[£„-£„-  3£,.  -  /£j] 

X  [£v.  -  Em  -  2Ec  -  iEs] 

X  [Ec  -  El  -  Ec  -  iEs] 

_ PmnPvmPniPiv _ 

[£„  -  Em  -  3Ec  -  iEs] 

X  [£„  —  Ec  —  2Ec  —  iEs] 

X  [£/  -  Ec  -  Ec  -  iEs] 

_ _ 

[E„  -Em  -  3Ec  -  iEs] 

XlE„-  Ec-  2Ec  -  iEs] 

X  [£„  -  El  -  Ec  -  iEs] 


where  m,  n,  n,  /  =  0,  +. 

With  the  above  expressions  for  x  ‘  X  and  X  we  will  calculate  in  the  next 
section  the  first-,  second-,  and  third-order  contributions  to  the  refractive  index 
coming  from  the  solute  three-level  molecules  in  a  dilute  water  solution. 


s 


Figure  2.  First-order  refractive  index  ij"’(£.-)  =  iji(£f)/£-o  for  a  water  (rjo  =  1.33241 ) 
solution  in  which  the  solute  are  three-level  molecules,  u  =  700  cm"',  5  =  0.1  and  Es  = 
0.02.  In  (a)  we  consider  K,  =  1.9  and  in  (b)  Fq  =  1.5. 


Refractive  Index  Contributions  in  a  Dilute  Water  Solution 

From  the  Maxwell  equation  in  which  enters  the  polarization, 

^2jr-  ld^E__4nd^P 
dt^  di 


Figure  3.  The  same  as  Figure  2  except  that  in  (a)  Vo  =  2.1  and  in  (b)  Vo  =  2.5. 


and  making  the  slowly-varying-amplitude  approximation  [4],  it  can  be  shown  that 
for  one-,  two-,  and  three-photon  dispersive  processes,  the  corresponding  refractive 
indexes  of  a  dilute  water  solution  of  the  molecules  considered,  are  respectively 

»?.(fc)  =  [»;?  +  47rReX,ff(£,)]'/^  (19) 

V2(2Ec)  =  [775  +  4t  £  Rex<^’(2£r))'/^ ,  (20) 


Figure  4.  The  refractive  index  7),(£’f),  Eq.  ( 19).  when  we  tum-on  the  external  field 
amplitude  to  a  typical  value  of  £  =  10’  statvolt/cm  (a)  I'o  =  2. 1.  (b)  I'o  =  2.5. 


and 

773(3£,)  =  [77^  +  47r  Rex'^'(3£V)]''-  (21) 

where  tjo  is  the  solvent  refractive  index,  and  the  effective  susceptibility  Xcfr(  £’< )  can 
be  calculated  from  Eqs.  (14)  and  (16),  according  to 
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Figure  5  The  second-order  refractive  index  i(2(2£'f).  Eq.  (20).  for  the  same  parameters 

as  in  Figure  4. 


Figure  2  illustrates  the  first-order  refractive  index  rj •'’(£,)  =  J?i(£'c)/f-o  for  a 
dilute  water  (r/o  =  1.33241 )  solution  in  which  the  solute  corresponds  to  a  concen¬ 
tration  of  10~^  M  (A^  =  6.022  X  10'^  molec/cm^),  w  =  700  cm"',  5  =  0.1,  £j  = 
0.02,  mi2(~(2o)  =  5  X  10“'*  esu  cm,  and  T  =  300  K.  In  Figure  2(a)  and  2(b)  the 
energy  separation  between  the  minima  of  the  two  harmonic  potentials  are,  respec¬ 
tively,  Vo  =  1.90  and  Vo  =  1.50,  which  correspond  to  a  sepaiation  between  the  two 
levels  belonging  to  different  harmonic  potentials  in  Figure  1  of  0. 1  and  0.5,  re¬ 
spectively.  The  range  of  variation  of  £,  (the  photon  energy  of  the  external  field) 
and  the  intramolecular  coupling  v  is  in  both  cases  between  0  and  1.  We  can  ap¬ 
preciate  in  these  figures  a  sizable  variation  of  the  first-order  refractive  index  with 
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Figure  6.  The  third-order  refractive  index  7)j(3£f).  Eq.  (21 ),  for  the  same  parameters  as 

in  Figure  4. 


both  Ec  and  v  in  the  resonance  region.  On  the  other  hand,  there  is  an  asymmetry 
between  the  two  signals  that  separate  from  each  other  as  the  coupling  v  increases. 
In  this  case  the  lower  frequency  signal  is  larger  than  the  higher  frequency  one.  We 
appreciate  the  inverse  effect  in  Figure  3(a)  and  3(b)  which  correspond  to  the  same 
paramete.s  as  the  previous  figures,  except  that  now  Fq  =  2.1  and  F,  =  2.5,  respec¬ 
tively.  i.e.,  the  energy  level  belonging  to  the  right  harmonic  potential  (Fig.  1)  is 
now  over  instead  of  below  the  nearest  level  of  the  left  harmonic  potential  by  the 
quantities  0.1  and  0.5.  The  undulations  presented  by  in  Figures  2  and  3 

when  we  fix  E^  in  the  resonance  region  and  vary  i'  occur  because  of  the  equivalence 
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between  fixing  v  and  varying  Ec  or  vice  versa,  in  connection  with  the  shape  of  the 
signal. 

In  Figures  4(a)  and  4(b)  we  represent  the  resulting  refractive  index  n  once 
we  superimpose  on  the  first-order  term  the  effect  of  the  third-order  correction  to 
the  optical  susceptibility,  x  *^’(£'c),  when  we  turn  on  the  external  field  to  a  typical 
intense  value  of  £  =  10^  statvolt/cm  [see  Eqs.  (19)  and  (22)].  We  can  appreciate 
a  strong  modification  of  the  refractive  index  in  the  resonance  region  due  to  this 
third-order  effect  on  the  external  field  amplitude.  On  the  other  hand,  we  have  again 
an  asymmetry  between  the  signals  that  separate  as  the  intramolecular  coupling  v 
increases.  We  can  also  appreciate  large  variations  of  the  refractive  index  77i(£c) 
when  we  fix  Ec  in  the  resonance  region  and  vary  the  intramo' ocular  coupling  v. 
We  have  also  calculated  j?i(£c)  for  the  same  parameters  as  in  Figures  4(a)  and 
4(b),  except  that  now  Vq  =  1 .90  and  1 .5,  instead  of  2. 1  and  2.5.  We  have  obtained 
analogous  modification  as  in  passing  from  Figure  2  to  Figure  3. 

In  Figures  5(a)  and  5(b)  are  illustrated  the  second-order  contribution  to  the 
refractive  index,  Eq.  (20),  for  the  same  parameters  as  in  Figures  4(a)  and  4(b), 
respectively.  Now,  the  variation  of  mC^Ec)  with  both  £<.  and  v  are  much  smaller 
(by  a  factor  of  ca.  1  / 1000)  than  the  variation  of  i}i(£f ).  Still  smaller  are  the  variations 
of  )?3(3£c)  illustrated  in  Figures  6(a)  and  6(b).  This  result  can  be  expected  on  the 
ground  that  to  j?i(£c),  milEc)  and  vA'iEc)  contribute  respectively  one-,  two-  and 
three-photon  processes  once  we  tum-on  an  intense  external  field. 


Concluding  Remarks 

Using  a  simple  model  of  three-level  molecules  with  intramolecular  coupling  be¬ 
tween  the  two  states  corresponding  to  different  electronic  potentials,  we  have  found 
the  following  results  for  the  refractive  index  of  a  dilute  water  solution  of  these 
molecules  irradiated  by  an  intense  one-frequency  external  field. 

i )  A  strong  contribution  to  the  first-order  refractive  index  coming  from  the  third- 
order  correction  in  the  external  field  amplitude  for  one-photon  process  when  we 
tum-on  an  intense  external  field  of  £  =  10^  statvolt/cm  [see  Figs.  4(a) 
and  4(b)]. 

ii )  Large  variation  of  the  refractive  index  for  one-photon  processes  ??  i  ( £c)  when 
we  fix  Ec  in  the  resonance  region  and  vary  the  intramolecular  coupling  v. 

iii)  Since  the  refractive  index  is  related  with  the  absorption  coefficient,  the  vari¬ 
ation  found  above  for  j?i(£c).  with  both  £<•  and  v  will  induce  modification  of  the 
absorption  coefficient  for  the  kind  of  systems  we  have  studied. 
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Abstract 

An  extended  version  of  the  Klopman-Salem  equation  of  chemical  reactivity,  which  incorporates  the 
effect  of  an  external  electrostatic  field  in  both  charge  and  orbital  contributions,  has  been  applied  to 
analyze  the  reactivity  pattern  of  ambident  metal  enolates  towards  acylaUon  reactions  in  different  solvents. 
The  results  are  consistent  with  the  experimental  observation  that  the  proponion  of  C-acylation  product 
IS  enhanced  for  solvents  of  low  polanty  where  a  relevant  orbital  control  contribution  is  predicted  Com¬ 
parison  between  the  cisltrans  proportion  of  C-  and  O-acylation  products  are  also  qualitatively  accounted 
for  by  the  FMO  analysis.  ©  1993  John  Wiley  &  Sons.  Inc 

Introduction 

Enolate  anions  are  members  of  a  group  of  ambident  nucleophiles  that  react  with 
alkylating  agents  or  acylating  agents  to  form  products  with  a  new  bond  either  at 
the  alpha-carbon  or  at  oxygen  [1,2].  From  a  spectroscopic  examination  of  metal 
enolates,  the  lithium  salts  are  found  to  exist  as  enolates  structures  in  which  either 
contact  ion  pair  or  solvent  separated  ions  may  be  present. 

The  existence  of  these  metal  enolates  as  solvent-separated  ions  is  favc.  ’d  by  (i) 
increasing  the  polarity  of  the  solvent,  (ii)  the  presence  of  lithium  or  sodium  rather 
than  zinc  or  magnesium  cations,  and  (iii)  by  use  of  the  trans  rather  than  the  cis 
stereo-isomer  of  the  enolate  [3]  [see  Fig.  1(a)]. 

In  acylation  reactions  with  acetic  anhydrides  as  acylating  agents,  it  is  observed 
that  the  major  product  of  the  kinetically  controlled  processes  are  usually  the  O- 
acylated  derivatives  corresponding  in  structure  and  stereochemistry  to  the  starting 
enolate  [4]..  However,  it  has  been  noted  that,  even  with  reaction  procedures  that 
results  in  kinetically  controlled  acylation,  mixtures  of  C-  and  0-acylated  products 
may  result.  The  proportion  of  C-acylation  appears  to  be  enhanced  by  the  use  of 
relatively  nonpolar  solvents  [5]. 


•  Contribution  No.  8  from  Centro  de  Mecanica  Cuantica  Aplicada  (CMCA). 
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The  experimental  observation  on  metal  enolates  are  specially  suitable  to  test  the 
usefulness  and  reliability  of  our  recently  developed  extension  of  the  KJopman- 
Salem  equation  of  chemical  reactivity  in  solution  phase  [6],  First  of  all,  the  co¬ 
existence  of  C-  and  0-acylation  products  appears  to  be  strongly  related  to  the  vari¬ 
ation  of  the  solvent  polarity.  Secondly,  the  theoretical  procedure  described  in  Ref 
[6]  may  provide  some  qualitative  explanations  about  the  relative  relevance  of  the 
charge  control  vs,  the  otbital  control  contributions  for  a  given  solvent. 

In  this  work  we  present  the  result  of  the  solution  FMO  calculations  in  the  series 
of  methyl-,  n-butyl-.  and  phenyl-enolates,  including  the  cis/trans  stereo-isomers, 
for  a  wide  region  of  solvent  polarity  represented  parametrically  by  the  dielectric 
constant.  Specific  effects  of  lithium  are  also  incorporated. 


Model  and  Calculations 


The  Extended  Klopman-Salem  Equation 


It  is  possible  to  have  an  expression  for  the  interaction  energy  between  a  nucleophile 
and  an  electrophile  in  the  presence  of  an  external  electrostatic  field,  by  using  the 
Klopman-Salem  frontier  molecular  orbital  (FMO)  expression  [7.8] 


and 


£.n.(l) 


gNu(l)QEl(1)  2[c;g„(  1  )cg|(  1  )^]- 


^  QNu(0(2Ei(e)  ^yy  2[ci;„(e)cg,(0/3]- 
i  7  f//(0  -  ei(c) 


(1) 


(2) 


for  gas  and  solution  phases,  respectively. 

£,nt,  Qk  -  f//,  and  e/,  are  the  interaction  electrostatic  energy,  the  net  charge,  the 
energy  of  highest  occupied  molecular  orbital  of  the  nucleophile  (HOMO),  and  the 
lowest  unoccupied  molecular  orbital  of  the  electrophile  (LUMO)  in  gas  phase  ( 1 ) 
and  in  the  presence  of  a  polarizable  medium  (c).  represented  by  an  effective  dielectric 
constant  e.  R  represents  the  intemuclear  distance  between  the  active  sites  of  the 
interacting  electrophile /nucleophile  pair.  From  Eqs.  ( 1 )  and  (2),  it  became  possible 
to  derive  a  general  expression  for  the  change  in  the  interaction  energy  from  gas  to 
solution  phase  [6].  The  resulting  final  expression,  after  introducing  a  first-order 
variation  in  the  net  charges  for  the  system  in  solution.  Ga(c)  =  (2a(  1 )  +  ^Qk{  1  )• 
is 


/  1\(2nu(1)(2ei(1)  g[QNu(l)QEi(l)] 

\  cj  R  tR 

\ir 

+  1  {S’/:{c)[cMc)]-  -  S'/.{\)[ch{\)]'}  .  (3) 


which  is  expressed  in  terms  of  the  unperturbed  net  charges  of  the  atomic  active 
centers  of  the  nucleophile /electrophile  pair  and  0ei.  the  LCAO  coefficients  and 
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the  electrophilic  superdelocalizability  of  the  homo  of  the  nucleophile  [9]  in  gas 
( 1 )  and  solution  (e)  phases,  Se- 


occ 

5?(1)  =  2  2 

m 


[cg(i)]^ 

end) 


(4) 


and 


S'e'(c)  =  2 


r  e^ic)  ’ 


(5) 


respectively. 

Equations  (3),  (4),  and  (5)  have  been  derived  previously  [6],  based  on  a  theo¬ 
retical  procedure  that  allows  one  to  describe  the  monoelectronic  orbital  energy 
shifting  induced  by  an  electrostatic  external  field  [10].  Equation  (3)  also  contains 
implicitly  the  solvation  energy  as  given  by  the  Bom  model.  It  is  worth  mentioning 
here  that  in  spite  of  the  fact  that  Eq.  (3)  was  obtained  by  a  perturbation  theory 
procedure  [6],  the  effect  of  the  polarizable  medium  is  variationally  introduced  in 
the  calculation  of  monoeleci  onic  orbital  energies  as  well  as  in  the  lcao  coefficients 
[  1 1 , 1 2  ] .  It  is  also  worth  mentioning  that  Eq.  ( 3 )  keeps  the  structure  of  the  Klopman- 
Salem  equation:  The  first  and  second  terms  are  the  original  charge  control  contri¬ 
bution  and  the  corresponding  charge  polarization  correction,  whereas  the  third 
term  represents  the  corrected  orbital  control  contribution. 

Method  and  Calculations 

We  select  the  series  of  methyl-,  n-butyl-,  and  phenyl-lithium  enolates  belon^ng 
to  a  wide  series  of  metal  enolates  extensively  studied  by  House  et  al.  [2,3]  with 
great  detail.  For  this  small  series,  results  concerning  the  proportion  of  alpha-C-  vs. 
O-acylated  product,  for  a  wide  variety  of  solvents  and  including  both  cis/trans 
stereoisomers  are  reported. 

The  calculation  of  properties  involved  in  Eq.  (3)  were  obtained  from  the  SCRF- 
CNDO/2  method  described  in  Refs.  [11,12].  The  electrophilic  agent  was  acetic 
anhydride.  The  ion  pair  structure  of  enolate  was  taken  as  the  contact  ion  pair 
shown  in  Figure  1(a).  Standard  bond  angles  and  distances  were  considered  in 
all  the  calculations  done..  The  solvent-separated  ion  pair  was  modeled  through 
the  enolate  anion. 

The  calculation  of  the  isolated  ion  pairs  and  enolate  anions  were  completed  with 
calculations  where  the  electrophile  was  allowed  to  approach  the  enolate  frame  at  a 
perturbing  distance  of  about  2.0  A  as  shown  in  Figure  1(b).  Calculations  with 
distances  greater  than  2.0  A  were  also  performed,  and  they  did  not  show  any  relevant 
distortion  with  respect  to  the  variation  of  the  properties  involved  in  Eq.  (3)  from 
gas  solution  phases. 

;  the  best  characterized  reactions  in  House  et  al.  experimental  work  [3]  (in 
itiTiis  of  the  variety  of  solvents  used)  was  the  trans-phenyl  enolate,  we  will  focus 
first  our  attention  on  this  system. 
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Figure  1.  Molecular  structures  considered  for  calculations  of  molecular  properties:  (a) 
lithium  enolate  ion  pairs;  (b)  nucleophile/electrophile  pair  interaction. 


Results  and  Discussion 

We  first  became  interested  in  determining  the  region  of  solvent  polarity  (measured 
by  the  variation  of  the  dielectric  constant  c  from  1 .0  to  10.0),  at  which  the  interaction 
energy  variations  given  by  Eq.  ( 3 ) ,  for  the  nucleophile  /  electrophile  pair,  were  more 
favorable  for  the  reactions  under  study.  Under  the  approximation  that  /3c-o/ 
jSc-c  =  0.8  [13],  the  results  displayed  in  Figure  2  were  obtained  for  trans-phenyl 
enolate  anion  and  trans-phenyl  enolate/lithium  ion  pair.  It  shows  the  variation  of 
the  total  energy  for  the  electrostatic  interaction  between  the  nucleophile/electrophile 
pair,  for  the  reaction  at  the  oxygen  and  the  alpha-carbon  sites  as  a  function  of  the 
dielectric  constant.  It  may  be  seen  that  the  interaction  with  the  solvent  favors  the 
reaction  at  both  sites.  As  e  increases,  the  O-acylated  product  becomes  predominant. 
Saturation  of  solvent  effects  is  predicted  for  c  >  3.0.  Comparison  of  the  differential 
reactivity  at  the  oxygen  and  alpha-carbon  sites  for  both  the  anion  (solvent-separated 
ions)  and  the  ion  pair  shows  that  the  enhancement  in  favor  of  the  O-acylation 
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Figure  2.  Nucteophile/electrophile  electrostatic  interaction  energy  variations  for  trans- 
phenyl/Li*  ion  pair  and  rrans-phenyl  enolate  with  respect  to  solvent  polarity  (e).  O- 
acylation  in  ion  pair  (■),  0-acylation  in  enolate  ( • )  alpha-C-acylation  m  ion  pair  (O)  and 
alpha  C-acylation  in  enolate  (+). 


product  is  greater  for  the  enolate  anion.  In  summary,  these  results  suggest  that,  for 
increasing  solvent  polarity,  a  major  proportion  of  0-acylated  product  is  to  be  ex¬ 
pected.  Moreover,  from  Figure  2,  it  may  be  seen  that  at  low  solvent  polarity  solvent 
separated  ions  may  be  present,  in  agreement  with  the  experimental  results  reported 
in  Ref.  [3].  This  result  also  permits  us  to  stablish  a  low  polarity  region  1.0  <  e  < 
3.0,  where  the  C-acylation  products  may  coexist  with  the  O-acylation  products. 

Figure  3  displays  the  variation  of  the  interaction  energy  with  the  solvent  polarity 
for  the  m-phenyl-enolate/ lithium  ion  pair  and  c/5-phenyl-enolate  anion,  repre¬ 
senting  the  solvent  separated  ions,  respectively.  In  this  case,  the  reaction  at  the 
oxygen  enolate  site  remains  favored  with  respect  to  the  alpha-carbon  acylation. 
Comparison  with  Figure  2  shows  that  the  difference  in  reactivity  at  the  O-  and 
alpha-C  is  smaller  than  the  corresponding  difference  of  the  franwsomer.  Also,  no 
variations  in  reactivity  at  both  C-  and  O-  sites  is  observed  for  the  c/s-phehyl  enolate 
in  its  ion  pair  and  anionic  forms. 

Figure  4  summarizes  the  relative  reactivity  at  the  oxygen  and  alpha-carbon  sites 
for  the  cis/trans  stereoisomers  for  a  varying  solvent  polarity.  The  following  may 
be  observed:  First  of  all,  reaction  at  the  oxygen  site  appears  to  be  invariant  with 
respect  to  the  cisitrans  conformation  and  also  independent  of  whether  or  not  dis- 
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Figure  3.  Nucleophile/electrophile  electrostatic  interaction  energy  variations  for  as- 
phenyl/Li*  ion  piair  and  c«-phenyl  enolate  with  respect  to  solvent  polarity  (c).  O-acylation 
ion  pair(a),  O-acylation  in  enolate  (-l.alpha-C-acylation  in  ion  pair  (O),  and  alpha-C- 
acylation  in  enolate  (+). 


sociation  is  present.  Secondly,  if  we  assume  that  for  e  <  3.0  the  lithium  enolate 
displays  a  contact  ion  pair  structure,  the  C-acylation  reaction  shows  a  significant 
difference  with  respect  to  the  cis/trans  conformations.  For  c  >  3.0,  the  difference 
between  reactivity  at  oxygen  and  alpha-carbon  increases  and  becomes  constant. 
Therefore,  in  view  of  the  experimental  results  reported  by  House  et  al.  [  3  ] ,  we  may 
suggest  dissociation  for  e  >  3.0,  and  under  this  condition,  the  oxygen  site  will  react 
more  readily  than  the  alpha-carbon  site,  in  the  dissociation  region.  On  the  other 
hand,  variation  of  the  reactivity  pattern  at  the  alpha-carbon  will  be  relevant  in  the 
low  polarity  region  (e  <  3.0),  where  significant  differences  in  reactivity  are  predicted 
for  the  cis/trans  conformations.  These  predictions  are  reinforced  by  the  experimental 
data  reported  by  House  et  al.  [3]:  the  O-acetyl  derivative  in  DME,  a  moderately 
polar  solvent,  shows  a  87%  yield  for  the  ds-phenyl  derivative,  compared  to  95% 
yield  for  the  frans-phenyl  derivative.  On  the  other  hand,  C-acetyl  derivative  in 
Et20,  a  modest  polar  solvent  as  compared  to  DME,  shows  a  26%  yield  for  the  cis- 
phenyl  enolate,  compared  to  the  <10%  yield  for  the  trans  product  in  the  same 
solvent  (Et20), 

Frontier  Molecular  Orbital  Analysis 

To  discuss  the  previous  results  in  the  frame  of  the  FMO  theory,  we  start  by  in¬ 
troducing  the  following  criteria  with  reference  to  Eq.  (3).  The  first  two  terms  of 
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Figure  4.  Relative  reactivity  at  the  oxygen  and  alpha-carbon  sites  for  the  cis/lrans  ste¬ 
reoisomers  for  the  solvent  polarity  variation;  /ran-r-C-acylation  (  +  ),  c/.r-C-acylation  (O). 
//•tf/r.r-O-acylation  (•),  and  crs-O-acylation  {■). 


Eq.  (3)  represent  the  c//ar;?e  co/i/roZ  contribution  to  the  variation  of  the  nucleophile/ 
electrophile  electrostatic  interaction  energy,  whereas  the  third  term  represents  the 
orbital  control  contribution. 

In  order  to  compare,  in  the  simplest  way,  the  relative  reactivity  at  the  alpha- 
carbon  and  oxygen  sites,  we  introduce  the  following  definitions: 

5A£S  =  (6) 

and 

.  (7) 

With  definitions  (6)  and  (7),  we  assess  the  relative  contribution  of  the  charge 
control  term  £^Eq  and  the  orbital  control  term  A£Sb  for  reaction  at  the  alpha- 
carbon  site,  having  the  corresponding  charge  and  orbital  contributions  at  oxygen 
as  a  reference  state.  These  definitions  also  entail  the  following  criteria:  If  hLEg  > 
0,  then  the  charge  contribution  at  the  alpha-carbon  site  will  be  less  relevant  than 
the  corresponding  charge  contribution  at  the  Oxygen  site.  On  the  other  hand,  if 
l>E%rty  <  0,  then  the  orbital  contribution  at  the  alpha-carbon  site  will  be  more  relevant 
than  the  corresponding  orbital  contribution  at  the  Oxygen  site. 

In  Figure  S  a  plot  of  the  relative  charge  control  vs.  the  relative  orbital  control 
for  different  values  of  the  dielectric  constant  is  shown  for  methyl,  /i-butyl,  and 
phenyl  derivatives. 

As  expected,  bAE'g  is  always  positive  and  ^A£^  is  always  negative.  According 
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Figure  5.  Relative  charge  vs.  orbital  control  contributions  for  C-acylation  with  respect 
to  solvent  polarity  for  phenyl-  (O),  methyl-  (■),  and  n-butyl-  (A)  enolates  derivatives 
Numbers  over  the  curves  indicate  different  values  of  c. 


to  these  trends,  we  may  conclude  that  charge  control  contribution  will  mostly  favor 
reaction  at  the  oxygen  site,  whereas  orbital  control  contribution  will  enhance  carbon 
acylation  products. 

As  compared  to  methyl-  and  butyl-derivatives,  5A£orb  for  fran^-phenyl  is  con¬ 
sistently  less  favorable,  along  the  complete  range  of  solvent  polarity  considered. 
We  may  predict  then  that,  for  mixtures  of  C-acylated  and  O-acylated  products, 
methyl  and  n-butyl  derivatives  will  show  a  greater  proportion  of  C-acylated  products 
than  /rani-phenyl  derivatives.  This  prediction  is  consistent  with  the  experimental 
data  reported  in  Ref.  [3],  where  an  C-acylated  product  yield  <  1%  was  observed 
for  this  derivative.  On  the  other  hand,  the  methyl  and  butyl  derivatives  are  predicted 
to  display  a  very  close  product  yield  for  alpha-C-acylation  which  is  also  consistent 
with  what  is  experimentally  observed  in  Ref.  [3]  (4%  and  9%  product  yield  for  n- 
butyl  and  methyl  derivatives,  respectively). 

Concluding  Remarks 

The  experimental  reactivity  pattern  shown  by  a  series  of  lithium  enolates  towards 
acylation  reactions  was  used  as  benchmark  case  for  the  extended  Klopman-Salem 
equation  of  chemical  reactivity  in  .olution  phase.  The  following  conclusions  may 
be  drawn  from  the  present  work. 

(i)  Two  polarity  regions  where  lithium  enolates  may  be  present  either  as  contact 
ion  pair  or  solvent-separated  ions  were  identihed.  In  terms  of  this  criterion, 


C-  vs.  O-ACYLATION  OF  METAL  ENOLATES 


721 


the  conditions  for  the  alpha-carbon  vs.  0-acylation  reactions  were  estab¬ 
lished. 

(ii)  The  relative  differential  reactivities  along  series  of  compounds  may  be  ex¬ 
plained  in  terms  of  charge  znd  orA/ta/ contributions.  Alpha-carbon  acylation 
products  are  to  be  expected  for  low  polarity  solvents,  that  is,  when  the 
lithium  enolate  presents  a  contact  ion  pn/r  structure.  In  other  words,  reaction 
at  the  alpha-carbon  appears  to  be  assisted  by  the  presence  of  lithium  at  the 
neighborhood  of  the  oxygen  site,  and  it  is  enhanced  along  a  series  of  sub¬ 
stituent  as  the  orbital  control  term  increases. 

(iii)  General  trends  concerning  the  proportion  of  the  alpha-carbon  vs.  oxygen 
acylation  products  in  different  solvents  were  successfully  accounted  for  by 
the  present  formalism. 
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Abstract 

Results  of  a  pilot  study  on  the  applicability  of  the  quantum  defect  orbital  method  for  describing 
electronic  transitions  between  molecular  Rydberg  states  are  reported  Oscillator  strengths  and  Einstein 
emission  coefficients  for  the  tnatomic  hydrogen  molecule  have  been  calculated.  The  results  are  in  good 
agreement  with  the  data  derived  from  more  sophisticated  theoretical  approaches  ©  1993  John  Wiley  & 
Sons.  Inc. 


Introduction 

The  Rydberg  molecules  are  characterized  by  a  repulsive  ground  state,  a  series 
metastable  excited  states,  and  a  stable  cation  [  1  ] .  If  the  energy  gap  between  the 
electronic  ground  state  and  the  first  excited  state  is  sufficiently  large,  all  excited 
states  are  of  predominantly  Rydberg  character.  An  excellent  review  on  Rydberg 
molecules  has  been  recently  published  by  Herzbeig  [  1  ] ,  and  an  extensive  compilation 
of  the  relevant  experimental  data  by  Jacox  [2],  Prototypes  for  the  diatomic  and 
for  the  polyatomic  Rydberg  molecules  are  He2  and  H3,  respectively.  These  two 
species  have  received  most  attention  from  both  experimental  and  theoretical  spec- 
troscopists  and  are  commonly  used  for  testing  new  methods. 

The  existence  of  metastable  Rydberg  states  of  H3  was  deduced  by  Herzberg  |1], 
and  by  Vogler  [3].  Herzberg  [1]  also  determined  the  D3h  geometry  of  the  underlying 
H3  core.  This  pioneering  work  inspired  several  theoretical  [4-10]  and  experimental 
[11-15]  studies  which  involve  transitions  between  the  bound  Rydberg  states.  Most 
of  the  theoretical  studies  on  the  H3  spectrum  have  been  based  on  frozen-core  Har- 
tree-Fock  (HF)  calculations  [4-7]  or  have  used  limited  configuration  interaction 
(ci)  expansions  [8].  Highly  correlated  calculations  have  been  reported  only  recently 
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[9,10].  Petsalakis  et  al.  [9]  have  examined  electronic  transition  moments  among 
several  low-lying  Rydberg  states  at  various  geometries.  Diercksen  et  al.  [10]  have 
calculated  energies  for  the  ground  and  several  Rydberg  states  by  a  large-scale-mul¬ 
tireference  configuration  interaction  with  different  basis  sets  at  the  Dsh  geometry. 
They  have  found  that  the  sequence  of  the  energy  levels  depends  on  the  size  of  the 
basis  set,  and  obtained  excitation  energies  that  are  in  good  agreement  with  the 
observed  spectrum.  However,  these  authors  [10]  point  out  that  even  for  a  3-electron 
system,  reliable  results  may  be  obtained  only  for  low-lying  states,  and  only  if  both 
the  one-electron  basis  set  and  the  configuration  interaction  expansion  are  extended 
to  the  limits  permitted  by  the  most  advanced  computing  facilities.  Therefore,  de¬ 
veloping  semiempirical  schemes  to  study  such  spectra  seems  to  be  worth  some 
effort. 

The  electronic  spectra  of  the  Rydberg  molecules  consist  of  typical  Rydberg  pro¬ 
gressions.  It  is  expected  that  the  Rydberg  states  may  be  described  as  quasi-hydrogenic, 
with  the  spherical  atomic  core  replaced  by  a  molecular  cation  which  imposes  “in¬ 
ternal  crystal  field”  splittings  appropriate  to  its  symmetry.  This  suggests  a  possibility 
of  modeling  the  electronic  structure  of  molecular  Rydberg  states  by  the  methods 
originally  developed  to  describe  the  Rydberg-like  series  in  atoms.  In  particular,  the 
concept  of  quantum  defect  has  been  used  within  the  context  of  the  multichannel 
quantum  defect  theory  [  1 8  ]  to  study  Rydberg  states  in  diatomic  molecules  [19,20]. 
It  has  also  been  applied  to  some  polyatomic  systems,  for  which  the  mixing  between 
different  electronic,  vibrational,  and  rotational  channels  has  been  studied  [21].  The 
Jahn-Teller  effect  in  the  (np)^E'  Rydberg  series  of  H3  and  Dj  has  also  been  studied 
by  this  technique  combined  with  existing  ab  initio  results  [22] .  In  some  of  the  ab 
imtio  studies  mentioned  above  [4-7]  quantum  defects  have  been  extracted  from 
the  HF  energies  computed  for  the  low-lying  Rydberg  states  and  then  used  to  construct 
the  quantum  defect  hydrogenic  orbitals  to  describe  highly  excited  Rydberg 
states  [  7 ] . 

In  the  present  article  a  pilot  study  on  the  application  of  the  quantum  defect 
orbital  (QDO)  method  [16]  for  describing  electronic  transitions  in  Rydberg  molecules 
is  reported.  The  method  has  been  applied  to  the  calculation  of  transition  probabilities 
in  H3. 


Method 

The  QDO  approach  belongs  to  the  class  of  model  potential  semiempirical  methods. 
It  allows  to  generate  valence,  Rydberg,  and  continuum  orbitals  as  analytical  solutions 
of  a  one-electron  radial  equation  that  contains  the  quantum  defect  as  an  empirical 
parameter.  The  model  potential  has  the  form 

V(r)  =  [X( A  +  1 )  -  1  ( 1  +  1  )]/2r^  -  Znet/r ,  ( 1 ) 

where  Znet  is  the  nuclear  charge  acting  on  the  valence  or  Rydberg  electron  at  large 
radial  distances.  The  parameter  X,  which  appears  both  in  the  effective  potential  and 
in  the  radial  solutions,  determines  the  screening  aspects  of  the  former.  It  is  related 
to  the  quantum  defect,  5,  and  the  orbital  quantum  number,  1 .  through  the  expression 
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X  =  I  -  6  +  c  (2) 

where  c  is  an  integer  determined  in  such  a  way  that  the  radial  orbital  are  normalizable 
and  possess  a  correct  nodal  pattern.  The  eigenvalue  of  the  corresponding  Hamil¬ 
tonian  ( i.e.,  the  energy  of  the  Rydberg  electron )  is  independent  of  c.  since  it  depends 
on  the  non-integer  part  of  5  only.  The  quantum  defect  is  obtained  from  the  following 
equation 

£„  =  r-Znet/[2(«-5)=]  (3) 

where  E„  is  the  discrete  spectrum  energy  (taken  either  from  experiment  or  calculated 
using  another  method),  7  is  the  ionization  energy,  and  n  is  the  principal  quantum 
number.  The  quantum  defect  creates  the  non-integer  part  of  the  parameter  (2).  It 
connects  the  potential  ( i )  with  the  empirical  data  and.  as  a  consequence,  brings 
about  an  implicit  account  for  electron  correlation,  spin-orbit  coupling,  and  other 
effects  which  are  explicitly  neglected  in  this  model. 

The  transition  moment  is  defined  as 

M,^{R,\Q{r)\Rj).  (4) 

where,  in  the  simplest  case  of  dipole  transitions  and  the  dipole-length  formulation, 
Q(r)  =  r.  It  is  important  to  note  that  in  this  case  A/,,  may  be  expressed  in  a  closed 
analytic  form.  Taking  into  account  the  effects  of  the  core  polarization  (i.e..  a  part 
of  the  correlation  effects  between  the  core  and  the  Rydberg  electron )  results  in  a 
modification  of  the  transition  operator  0(r)  [16].  It  has  recently  been  demonstrated 
[23]  that  in  the  case  of  atoms 

(2(r)  =  r{l  -(o/r^)[l  -  exp(-/-/r. )]' }  ,  (5) 

where  a  is  the  dipole  polarizability  of  the  core  and  r,  is  a  cut-off  parameter,  correctly 
descnbes  the  polarization  of  the  core  and  allows  the  theory  to  retain  its  analytic 
character. 

We  assume  that  the  molecular  Rydberg  electron  may  be  described  by  a  hydrogenic 
Schnjdinger  equation  with  the  effective  potential  given  by  Eq.  ( 1 ).  The  modifications 
introduced  to  the  theory  are  the  following.  The  principal  and  the  angular  momentum 
quantum  numbers  correspond  to  the  appropriate  orbital  in  the  united  atom  limit. 
The  energies  used  to  determine  the  quantum  defects  include  the  splittings  due  to 
the  non-spherical  molecular  core.  In  Eq.  (5)  the  core  polarizability  parameter  is 
estimated  from  molecular  data. 


Results 

The  energies  for  the  lowest  energy  levels  have  been  derived  from  the  measurements 
by  Watson  quoted  by  Petsalakis  et  al.  [9]  and  from  the  ionization  energy  (29562.6 
cm- 1 )  obtained  by  Helm  [13].  The  last  author,,  in  a  series  of  experimental  studies 
[12-15]  established  the  assignment  of  the  discrete  spectrum  and  the  effective  quan¬ 
tum  defects.  In  the  present  work  the  quantum  defects  have  been  calculated  according 
to  Eq.  (3).  They  are  compared  with  the  ones  by  Helm  [12]  and  those  derived  by 
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Other  authors  from  theoretical  energies  in  Table  I.  Also,  the  quantum  defects  for 
lithium,  i.e.,  for  the  united  atom  limit  of  H3  are  given.  The  differences  between  the 
quantum  defects  from  different  sources  arise  from  the  differences  in  the  energy 
values.  No  experimental  energies  for  the  electronic  states  higher  than  those  listed 
in  Table  1  are  known.  Therefore,  the  quantum  defects  for  the  higher  states  have 
been  assumed  to  be  equal  to  the  ones  listed  in  Table  I,  corresponding  to  the  state 
of  the  same  symmetry  and  with  the  highest  energy.  This  assumption  is  consistent 
with  earlier  observations  on  the  behavior  of  quantum  defects  in  atoms. 

The  parameters  defining  the  transition  operator,  Eq.  ( 5 )  have  been  estimated  by 
combining  the  experimental  data  for  the  equilibrium  geometry  of  H  ,  and  the  com¬ 
putational  experience  gained  in  similar  calculations  for  atoms,  that  is,  r<.  has  been 
taken  as  approximately  equal  to  the  core  radius,  and  a  is  the  core  dipole  polariz¬ 
ability.  Thus,  it  has  been  assumed  that  r  =  1.19  a.u.  (the  interatomic  distance  in 
H3  is  equal  to  1.65  a.u.  [24])  and  a  =  5.7  a.u.  (from  the  data  reported  by  Fowler 
etal.  [25]). 

The  QDO  values  of  the  oscillator  strengths  for  the  allowed  electronic  transitions 
are  displayed  in  Table  II.  They  are  compared  to  those  obtained  by  Martin  [6]  with 
the  HF  method.  The  agreement  between  the  QDO  and  HF  results  is  even  better  than 
one  might  expect  from  such  a  highly  simplified  procedure.  Therefore,  the  oscillator 
strengths  for  several  transitions  not  being  studied  by  Martin  [6]  have  also  been 
included  in  the  table. 


Table  I  The  Rydberg  states  of  Hj  at  Djh  geometry,  and  the  corresponding  quantum  defects.  Data 
for  the  united  atom  limit  (lithium)  are  also  included.  All  quantum  defects  are  multiplied  by  10’. 


Hj 


Derived  from  HF  energies  Li 

This  -  Exptl.  Exptl. 


State 

work 

Ref.  (4) 

Ref.  17] 

Ref.  (5] 

Ref  [12] 

Ref  (26] 

’E'(2p) 

483 

407 

407 

48* 

2s 

415 

'A',  (2s) 

113 

84 

73 

8* 

71 

2p 

41 

'A5(2p) 

73 

17 

12 

0* 

80  10 

’E'(3p) 

404 

345 

328 

35* 

3s 

404 

’A'i  (3s) 

103 

71 

89 

7* 

71 

3p 

44 

’a;(3p) 

59 

22 

3 

0* 

’E'  (3d) 

22 

-0 

2* 

3d 

1 

’E‘'(3d) 

-4 

-6 

-29 

-1* 

-10 

'A',  (3d) 

-15 

-15 

00 

1 

-2* 

-6 

*  The  last  digit  of  the  quantum  defect  was  not  given  in  Ref.  [5]. 

For  the  (2p)  ’AJ  state,  the  first  and  the  second  entnes  arc  the  recommended  values  by  Helm  (12]  for 
the  2p-ns  and  2p-nd  transitions,  respectively. 
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Table  II.  Oscillator  strengths  for  the  allowed  electronic  transitions  m  H3  at  Dsh  geometry.  For  each 
entry,  the  first  and  the  second  numbers  are  the  qdo  oscillator  strengths  calculated,  respectively,  without 
and  with  the  core  polarization  correction.  The  third  number  is  the  hf  value  [6].  All  quantities  are 

multiplied  by  10*. 


State 

'E'(2p) 

W,  (2s) 

'A!(2p) 

^A'l  (2s) 

20. 

15. 

26. 

W,  (3s) 

0.98 

0.88 

1.1 

2.2 

2.5 

2.8 

W,  (4s) 

0.65 

0.42 

0.40 

0.48 

0.54 

0.67 

W,  (5s) 

0.021 

0.018 

0.19 

0.22 

W,  (3d) 

2.2 

3.6 

3.8 

18. 

19. 

25. 

'A',  (4d) 

0.63 

1.2 

1.4 

5.2 

5.2 

5.0 

W,  (5d) 

.28 

0.56 

2.2 

2.3 

^E'(3d) 

8.2 

7.4 

5.1 

23. 

23. 

17. 

=E'(4d) 

2.4 

2.0 

1.8 

5.8 

5.8 

3.4 

=E'(Sd) 

0.52 

0.46 

2.4 

2.4 

^E’(3d) 

14. 

14. 

15. 

2.9 

2.9 

0.07 

^E'  (4d) 

4.9 

4.7 

5.4 

0.75 

0.76 

0.022 

^E'(5d) 

3.7 

3.4 

0.32 

0.32 

"E'(3p) 

0.17 

021 

40. 

40. 

35. 

'E'(4p) 

0.063 

4.9 

3.7 

3.7 

’E'(5p) 

1.8 

1.8 

'A5(2p) 

3.8 

4.2 

4.8 

'A5(3p) 

11. 

'A5(4p) 

2.6 

2.8 

2.3 

'a;(5p) 

1.2 

1.3 

The  Einstein  emission  coefficients  calculated  in  the  present  work  and  the  ones 
derived  from  other  theoretical  calculations  are  compared  in  Table  111.  With  the 
exception  of  Raynor  and  Herschbach  (7  j,  who  report  Einstein  coefficients  directly 
in  their  article,  the  remaining  authors  whose  results  are  shown  in  the  table  report 
either  oscillator  strengths  [  6  ]  or  transition  moments  [  4,9  ] .  Therefore,  the  displayed 
Einstein  coefficients  have  been  obtained  from  their  transition  frequencies  or  wave¬ 
lengths  and  from  the  corresponding  transition  moments  or  oscillator  strengths.  The 
overall  agreement  is  very  good.  In  particular,  the  results  of  the  extensive  ci  calcu¬ 
lations  by  Petsalakis  et  al.  [9]  and  the  ones  by  King  and  Morokuma  [4],  who 
carried  out  the  most  careful  choice  of  the  basis  functions  of  all  the  hf  procedures 
[4-7],  and  whose  calculated  lifetimes  agree  well  with  those  observed  by  Figger  et 
al.  [11]  for  the  isotopic  molecule  triatomic  deuterium,  seem  to  be  the  most  re¬ 
liable  ones. 


Conclusions 

Several  transitions  between  Rydberg  states  of  the  triatomic  hydrogen  molecule 
have  been  studied  using  the  non-relativistic  version  of  the  qdo  method.  The  results 
for  oscillator  strengths  and  Einstein  coefficients  for  spontaneous  emission  agree 
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Table  III.  Wavelengths 

in  A  (  he  upper  line)  and  Einstein  emission  coefficients  in 
fthe  lower  two  lines) 

units  of  lO’s  ' 

Transition 

QDO* 

MRDCl'* 

HE* 

hf" 

HF' 

'E'(3p)-^A',  (2s) 

6877 

6878 

6805 

7010 

7051 

5.58 

4.88 

2.52 

5.53 

4.54 

5.63 

^A^(3p)-^A',(2s) 

5516 

5548 

5514 

5731 

5767 

2.21 

2.52 

2.19 

2.12 

1.55 

2.32 

^E'(3d)-=A',  (2s) 

5442 

5468 

5514 

5711 

5759 

0.0131 

0.0167 

0.0493 

0.0109 

0.01 

0.0132 

^A5(2p)-^E'(2p) 

6031 

5526 

6077 

6518 

6460 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

^A',(2s>-'E'(2p) 

6525 

5997 

6925 

7488 

7302 

3.10 

4.59 

3.62 

3.09 

3.21 

2.32 

2A',(3s)-=A5(2p) 

6070 

5641 

6369 

6621 

6554 

0.40 

0.501 

0.460 

0.461 

0.27 

0.46 

2E'(3d)-=A5  (2p) 

5746 

5929 

6016 

6349 

6233 

4.57 

3.27 

1.57 

3.87 

3.23 

4.61 

^A',(3d)-*A5(2p) 

5714 

5800 

6077 

6544 

6295 

3.76 

4.78 

4.52 

3.90 

4.40 

3.40 


•  This  work;  experimental  wavelengths  (as  quoted  in  Table  II  of  Ref.  [9])  and  emission  coefficients 
calculated  without  (second  line)  and  with  (third  line)  core  polarization  correction. 

Petsalakis  et  al.  (9] 

'  Martin  [6]. 

Ring  and  Morokuma  [4]. 

'  Raynor  and  Herschbach  (7). 


fairly  well  with  those  of  other,  more  sophisticated,  theoretical  procedures.  Therefore, 
it  is  expected  that  this  approach  may  be  useful  for  studying  the  spectra  of  other 
Rydberg  molecules.  In  the  case  of  molecules  containing  heavy  atoms,  the  relativistic 
version  of  the  qdo  method  [17]  may  be  more  adequate. 
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Abstract 

The  intermolecular  potentials  for  the  A'  'Z  and  A  states  of  Li  •  •  -  Ne  were  studied  by  a  variety  of 
multiconfiguration,  single  configuration  and  perturbation  methods  (c^SPT2)  The,-1  ^11  excited  state  was 
calculated  to  have  a  well  depth  of  214  cm  '  at  an  internuclear  separation  of  2.26  A  (acpf)  in  excellent 
agreement  with  the  224  cm  '  denved  from  expenmental  data.  A  smaller  well  of  15.8  cm*'  was  found 
for  the  A  ^21  ground  state  at  an  intermolecular  separation  of  5.28  A  (acpf).  These  results  are  in  better 
agreement  with  expenmental  results  than  the  previously  reported  pseudopotential  calculations.  The 
comparison  of  ci  calculations  with  caspt2  results  shows  that  the  latter  is  able  to  give  good  results  for 
interacting  systems.  «  i99.t  John  Wiley  &  Sonv.  Inc. 


Introduction 

Recently  the  results  of  experiments  on  lithium  doped  rare  gas  solids  (Ar.  Kr.  Xe) 
were  reported  [  1  ].  That  research  focused  on  the  properties  of  cryogenically  trapped 
lithium  atoms  and  clusters  prepared  by  laser  ablation  of  solid  lithium  metal,  and 
was  motivated  by  the  potential  of  using  trapped  alkali  metal  atoms  for  chemical 
rocket  propulsion  systems  [2], 

In  general,  the  optical  absorption  spectrum  of  heaver  alkali  metals  trapped  in 
rare  gas  matrices  consist  of  multiple  triplet  absorption  features  [3,4],  There  are 
several  competing  models,  all  involving  multiple  trapping  sites,  which  seek  to  explain 
the  origin  of  this  triplet  structure.  In  contrast,  the  spectroscopic  results  obtained 
for  lithium  doped  rare  gas  matrices  show  only  one  well-resolved  triplet  absorption. 
This  observation  together  with  the  sharp  ESR  lines  reported  in  Ref.  [  1  ]  gives  evidence 
for  the  existence  of  a  single,  well-defined  trapping  site  structure  for  lithium  in  rare 
gas  solids. 

The  structure  of  the  various  trapping  sites  in  a  rare  gas  solid  is  determined  by 
the  potential  experienced  by  the  alkali  metal  atom.  This  potential  can  be  deduced 
experimentally  from  mobility  and  beam  scattering  measurements  as  well  as  emission 
experiments  [5],  or  it  can  be  calculated  using  various  models. 


*  Permanent  Address  Department  of  Chemistry.  University  of  Warsaw.  02-09.5  Pasteura  I.  Warsaw. 
Poland. 

*  Author  to  whom  correspondence  should  be  addressed 


International  Journal  of  Quantum  Chemistry  Quantum  Chemistry  Symposium  27..73I-74I  (1993) 
1993  John  Wiley  &  Sons.  Inc.  CCC  0020-7608/9.3/010731-1 1 


732 


SADLEJ  AND  EDWARDS 


Several  computational  studies  have  used  nonlocal  pseudopotential  methods  based 
on  the  Thomas-Fermi  statistical  model  [6.7]  in  order  to  determine  the  alkali  metal- 
rare  gas  interaction  potential.  These  pseudopotenlial  calculations  are  in  agreement 
with  the  cxperimentallv  deduced  results  for  many  ground-state  alkali  metal-rare 
gas  systems.  However,  for  the  .-1  *11  first  excited  states,  the  calculated  potential 
curves  are  found  to  be  too  shallow  relative  to  experiment.  This  is  alleviated  somewhat 
by  employing  the  Kim-Gordon  core-core  repulsion  model  [8]  rather  than  the 
Gombos-Baylis  model  [9],  but  is  not  eliminated. 

It  IS  also  possible  to  express  the  matrix  perturbation  on  the  ground  and  excited 
states  of  an  isolated  alkali  atom  in  terms  of  two-body  dimer  potentials  [10].  Such 
a  pairwise  additive  potential  was  used  in  Ref  [I]  for  the  interpretation  of  their 
experimental  results. 

Other  experiments  which  require  knowledge  of  the  interaction  potential  are  in¬ 
tradoublet  or  interdoublct  transitions  occurring  in  thermal  collisions  between  alkali 
metal  atoms  and  rare  gas  atoms.  Understanding  these  processes  requires  a  detailed 
knowledge  of  the  interatomic  potential.  The  work  reported  here  is.  to  our  knowledge, 
the  first  ah  iniiia  study  of  the  ground  and  first  excited  state  potential  energy  curves 
for  the  Li  •  •  •  Ne  system.  We  shall  compare  the  results  of  our  calculations  to  the 
pseudopotential  results,  the  only  theoretical  data  available  for  these  states. 

Computational  Method 

We  have  used  contracted  averaged  natural  orbital  (AND)  type  basis  sets  throughout 
this  work.  These  basis  sets  have  been  derived  from  a  density  matrix  averaged  over 
the  atomic  ground  state,  positive  and  negative  ions,  as  well  as  the  atom  in  a  polarizing 
electric  field.  A  detailed  description  of  the  averaging  procedure,  as  well  as  the  explicit 
primitive  sets  used  can  be  found  elsewhere  [11].  These  basis  sets  have  been  specif¬ 
ically  designed  to  account  for  as  much  electron  correlation  as  is  possible  within  a 
given  size.  For  Li  and  Ne.  the  ano  basis  set  is  of  the  form  ( 14.9.4.3. )/[5.4. 3.2.]  and 
gives  a  total  of  92  functions  for  Li  •  •  •  Ne  systems. 

This  basis  is  capable  of  giv  ing  an  accurate  description  of  Li  (’5)  and  Li  {‘P)  at 
both  the  set  and  correlated  levels  of  approximation.  At  the  rhf  level,  the  calculated 
-.S'  •«-  -/'excitation  energy  is  1 .8444  cV,  which  compares  well  with  the  experimental 
value  of  1.8479  eV  [12],  The  calculated  ionization  potential  is  5.342  eV.  in  com¬ 
parison  with  the  experimental  value  of  5.390  eV. 

The  complete  active  space  (cas)  scr  method  [13]  has  been  shown  to  yield  very 
accurate  results  for  spcctro.scopic  properties  of  low-lying  excited  states  of  BH  [14]. 
AIH  [15].  and  SiH*  [16].  The  cassci  approach  generates  a  full  configuration  in¬ 
teraction  (cO  wavcfunction  for  a  certain  number  of  electrons  which  are  distributed 
among  a  given  number  of  active  orbitals.  The  siz'*  of  the  active  electron  space  and 
the  number  of  electrons  considered  determine  the  length  of  the  full  ci  expansion. 

Long  (  I  vectors  can  be  avoided  if  the  clccuun  correlation  effects  outside  the 
c  \ssci  active  space  arc  included  via  perturbation  theory.  Andersson  et  al.  [17] 
have  recently  published  a  second-order  perturbation  scheme,  known  as  caspt2. 
which  uses  the  (  \ss(  t  wavcfunction  as  the  unperturbed  wavcfunction. 
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If  the  Fock  matrix  elements  coupling  the  inactive  and  active  orbitals  and  the 
active  with  the  virtual  orbitals  are  set  to  zero  (block  diagonal  Fock  matrix),  the 
method  is  called  caspt2d.  If  a  general  (nondiagonal)  Fock  matrix  is  used  for  par¬ 
titioning,  the  method  is  called  caspt2n.  This  perturbation  treatment  with  a  single 
state  multiconfiguration  casscf  function  is  aimed  primarily  at  recovering  the  dy¬ 
namic  correlation.  The  conclusion  from  the  series  of  papers  is  that  caspt2  is  capable 
of  giving  accurate  results  for  many  molecular  properties  with  a  considerably  reduced 
computing  effort  [  1 8],  One  motivation  for  the  present  work  has  been  to  test  the 
usefulness  of  these  perturbation  calculations  for  interacting  systems. 

The  efficiency  of  the  second-order  caspt2  method  is  dependent  on  the  choice 
of  the  CASSCF  reference  function.  For  the  present  purpose,  the  choice  of  the  active 
orbital  space  for  the  atoms  Li  and  Ne  was  always  the  valence  ns  and  np  subspace. 
The  major  part  of  the  CASSCF  calculations  in  the  present  study  have  been  carried 
out  with  the  nine  active  valence  electrons  distributed  among  the  active  orbitals 
built  from  25  and  2p  orbitals  of  Li  and  Ne  atoms. 

Another  method  of  treating  the  dynamic  correlation  is  to  perform  the  multirefer¬ 
ence  Cl  calculation  (MRCi)  [19].  SDCI-MRCI  denotes  all  single  and  double  excitations 
from  the  reference  configurations.  With  many  electrons  to  be  correlated,  an  unlinked 
cluster  correction  must  be  added  to  ci  energies;  in  our  case,  we  have  performed  a 
Davidson  (Q)  correction  [20], 

Size  consistency,  which  is  always  important  in  the  calculation  of  binding  energies, 
is  particularly  important  for  weakly  bound  complexes  [21],  The  majonty  of  studies 
of  interacting  systems  have  employed  mbpt  theory,  which  gives  size-consistent 
energies  [22].  In  this  work,  we  have  applied  caspt2  methods  which  are  explicitly 
size-consistent  when  applied  to  CASSCF  reference  functions.  We  have  also  calculated 
interaction  energies  using  both  the  modified  coupled  pair  functional  approach, 
(MCPF)  [23]  and  the  averaged  coupled-pair  functional  treatment  (acpf)  [24],  The 
later  is  found  to  be  size  consistent  for  the  complexes  to  within  0.01  kcal/mol  [25]. 

In  order  to  determine  the  adequacy  of  the  size-consistency  correction  for  truncated 
MRCI  calculations,  we  performed  calculations  for  the  Li  •  •  •  Ne  complex  {R  =  500 
a.u.)  and  the  isolated  atoms,  and  compared  the  results  with  the  results  of  casscf 
and  ACPF  calculations.  The  difference  between  the  supermolecule  energy  (R  =  500 
a.u.)  and  the  sum  of  the  energies  of  the  two  isolated  atoms  provides  a  direct  measure 
of  the  size-consistency  error  in  each  case.  Table  I  shows  the  size  consistency  errors 
obtained  for  the  Li  •  •  •  Ne  ground  state  at  an  internuclear  separation  of  500  a.u. 
The  CASSCF  for  the  minimal  valence  active  space  is  exactly  size  consistent  while 
MRCI  has  a  size  consistency  error  of  0.2  kcal/mol.  acpf  is  nearly  size-consistent 
with  an  error  of  approximately  0.015  kcal/mol.  Del  Bene  and  Shavitt  [25]  have 
also  reported  very  small  size-consistent  errors  for  charged  complexes  with  the  same 
correlated  electrons  in  each  subunit.  We  concluded  from  this  study  that  while  the 
CI  size-consistency  errors  for  our  complex  are  small,  the  interaction  energies  are 
equally  small  and  it  is  necessary  to  calculate  binding  energies  relative  to  the  500 
a.u.  results.  This  is  especially  true  for  the  perturbation  calculations  where  the  size 
consistency  errors  are  relatively  large. 
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Tabu  I  The  size-consistenc\  error  (in  a  u.)  m  the  casscf.  MRCi.and  acpf  calculations  for 

Li  •  ■  •  Ne  system. 


Method 

2v 

(Li-  •  -NcK 

Ml 

=  500  a.u.) 

Li 

Ne 

A 

CASSCF'* 

-l.t5.‘)7‘)2(X) 

__ 

-7.4.32702 

-128.546490 

0.000001 

CASSCI* 

-136.070121 

-1.35.987752 

-7.432702 

-128.687100 

-0.049681 

CASPT2I)'’ 

-  136.259150 

-136.207454 

-7.432702 

-128.825674 

0.000684 

oaspt2n^ 

-136.258476 

-136.205953 

-7.432702 

-128.828912 

-0.003138 

MRCl" 

-136.258918 

-136.195354 

-7.4.32702 

-128.826552 

0.000336 

MRCl"  +  Q 

-1.36.268336 

-1.36.20.3731 

-7.432702 

-128.836072 

0.000438 

ACPI* 

-1.36.266461 

-136.202067 

-7.432584 

-128.833903 

0.000025 

*  Nine  active  electrons  in  C?  1 1 1 1 10)  active  space. 
'’Nine  active  electrons  in  (4|2|2|0)  active  space. 


A  pilot  series  of  calculations  for  the  ground  state  of  the  Li  •  •  •  Ne  system  was 
performed  with  the  seven  active  electrons  distributed  in  an  active  space  of  (3 1 2 1 2  ]  0). 
where  the  numerical  entries  give  the  number  of  active  orbitals  per  irreducible  rep¬ 
resentation  of  the  symmetry  species  a\,  bi,  b\.  and  03,  respectively.  The  Is 
orbital  for  Li  and  the  Is  and  2s  orbitals  for  Ne  have  been  kept  inactive  in  these 
calculations.  This  gives  rise  to  208  configuration  state  functions  (esFs).  In  the  sub¬ 
sequent  scries  of  calculations,  nine  active  electrons  were  distributed  among  the 
(4 1 2 1 2 10)  active  orbitals  belonging  to  Cz,  symmetry.  In  this  case  the  Is  orbitals  of 
Li  and  Ne  have  been  kept  frozen.  Both  the  X  *2  ground  state  and  A  ^11  excited 
state  were  calculated  with  this  choice  of  subspaces  (inactive,  active,  and  secondary 
orbitals)  which  gives  rise  to  606  configurations  for  the  ground  state  and  588  con¬ 
figurations  for  'll  state.  We  present  here  only  the  results  for  this  larger  active  space. 

CASSCF  calculations  were  performed  for  each  spectroscopic  state  in  order  to  obtain 
the  natural  orbitals.  These  calculations  were  then  followed  by  caspt2  and  MRCl 
calculations  in  order  to  include  the  dynamic  correlation  effects.  Reference  CSFs 
were  chosen  as  those  which  appeared  in  the  CASSCF  wavefunction  with  the  weight 
larger  than  7%. 

The  potential  curves  were  evaluated  by  fitting  spline  functions  to  the  computed 
points.  From  these  potentials  the  vibrational  wavefunctions  and  the  vibrational- 
rotational  energy  levels  were  obtained  by  numerically  solving  the  radial  Schrodinger 
equation  for  a  set  of  rotational  quantum  numbers.  The  spectroscopic  constants 
were  determined  by  a  least-square  fit  of  these  energy  levels.  The  computed  spacings 
between  the  rotational  lines  were  used  to  obtain  the  rotational  constants  and  the 
vibrational  constants  were  derived  from  the  computed  band  origins. 

All  calculations  were  performed  using  the  programs  included  in  the  molcas2 
quantum  chemistry  software  [26]  on  an  IBM-3090  at  the  Cornell  Theory  Center. 

Results  and  Discussion 

The  interaction  energies  for  the  X  ^2  and  A  *11  states  are  listed  in  Tables  II  and 
III,  and  the  corresponding  potential  energy  curves  are  shown  in  Figures  1-4.  A 
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Tabi  E  II  Interaction  energy  (in  mHartree)  for  the  ground  state  A  Li  •  •  •  Ne  system  calculated  by 
dillerent  methods  relative  to  /;(Li  ■  •  •  Ne)  H  R  =  500  a.u. 


R 

CASSCF* 

Configuration  interaction' 

CAS 

pt2d 

pt2n 

MRCI 

MRCI  +  0 

MCPF 

ACPF 

6.0 

2.3494 

1.6167 

1.5024 

1  4838 

HQil 

1.4943 

7.0 

0.9783 

0  5311 

mSSSm 

^91^9 

0.4782 

8.0 

0..37I2 

0.1092 

■is 

^9lliu9 

9uIK  £ 

8.5 

0  2228 

0.0216 

0.0054 

9.0 

0  1318 

-0.0234 

iH 

! 

9.5 

0.0767 

-0.0437 

Buwl  ' 

10.0 

0  0438 

-0.0461 

Won!!  !■ 

-0.0562 

-0.0537 

' 

10.5 

0  0246 

-0.0491 

-0.0537 

-0.0445 

-00531 

-0.0531 

\  1 

II.O 

0.0134 

-0.0448 

-0.0408 

-0.0474 

-00481 

BIa8II 

II  5 

0.0069 

-0.0391 

-0.0357 

-00408 

-00418 

B9S! 

12.0 

0  0034 

-0.0331 

9 

-0.0302 

-0.0.342 

-0.0352 

BotS ' ! 

13.0 

0  0004 

-0  0224 

-0.0204 

-0  0228 

-0.0237 

BIfl/ ' 

140 

--0  0003 

-0  0144 

-0.0132 

-0.0147 

-0.0154 

-0.0146 

15.0 

-0  0004 

-0.0090 

^WitW!!9 

-0.0083 

-0.0092 

-0.0096 

16.0 

-0  0002 

-0.0055 

-0.0059 

-0.0051 

-0  0057 

-0  0059 

170 

-0.0001 

-0.0003 

-0.0032 

-0  0035 

-0.0037 

•  Nine  active  electrons  in  (4 1 2 1 2 1 0)  active  space,  two  frozen  orbitals  and  zero  inactive  orbitals. 


Table  III  Interaction  potential  energies  (in  mHartree)  for  the  .1  ’II  excited  state  of  Li  •  •  •  Ne  system, 
calculated  by  different  methods  relative  to  /;(Li  •  •  •  Ne)  for  R  =  500  a.u. 


R 

cassce' 

a" 

CAS 

pt2d 

pt2n 

MRCI 

MRCI  +  Q 

MCPF 

ACPF 

3.0 

1.7449 

RM 

1.6174 

1.5907 

1.6535 

4.0 

-0.4360 

-1.0418 

-0  4052 

45 

0  2255 

-1.0317 

-0  6407 

-0.8525 

5.0 

^91  ^ 

BkSm 

-0.7458 

5.5 

^91 S 

■^1 

^9^^9 

-0.5851 

-0.658' 

-0.5750 

6.0 

^9e  m 

-04658 

-0.4723 

-0.4383 

6.5 

^91 9 

-0.3597 

-0.3518 

-0..341I 

7.0 

^iMii 

-0.2259 

-0.2721 

-0.2719 

8.0 

^91 9 

^9E!9 

-0.1752 

-0.1803 

9.0 

^91 M 

-0.1261 

-0.1053 

-0  1216 

-0.1168 

-0.1211 

10.0 

-0.0835 

-0.072U 

-0.0819 

11.0 

-0.0565 

-0.0.594 

-0.0490 

-0.0551 

BmS 

12.0 

-0  0366 

-0.0352 

-0.0364 

■b 

-0  0254 

-0  01.52 

-0.0146 

-0.0251 

*  Nine  active  electrons  in  (4 1 2 1 2 1 0)  active  space,  one  frozen  orbital  and  one  inactive  orbital. 
Nine  active  electrons  in  (4 1 2 1 2 10)  active  space,  two  frozen  orbitals  and  zero  inactive  orbitals. 
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Figure  I.  LiNe  doublet  sigma  state.  Energy  is  in  mHartree.  separation  is  in  a.u. 


total  of  1 7  points  were  calculated  for  the  X  *2  state  and  a  total  of  1 6  points  fo:  the 
A  ’n  state  were  calculated.  The  interaction  energies  are  given  relative  to  E(R)  - 
£(x)  at  the  following  levels  of  approximation:  casscf.  caspt2.  mrci,  mrci  +  Q 
(cluster  corrected  MRCI  energies  using  a  multiconfigurational  analogue  of  the  Da¬ 
vidson  correction),  mcpf.  and  acpf. 

The  electronic  structure  of  the  X  ‘2  ground  state  is  dominated  by  the  expected 
SCF  electronic  configuration  Li  (li'2s')Nc(l.s‘2.<;‘2/;.^2/t72/7;),  which  has  a  weight 
of  98%  in  the  casscf  wavefunction.  The  lowest  excited  state  is  A  ‘II  with  an  ex- 


Figurc  2.  LiNe  doublet  sigma  state  Energy  is  in  mHartree.  separation  is  in  a.u 
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Figure  3.  LiNc  doublet  pi  state.  Energy  is  in  mHartrce.  separation  is  in  a.u. 


citation  energy  around  1.73  eV  (acpf).  The  electronic  structure  of  the  A  state 
is  dominated  by  the  electronic  configuration  Li  (is“2/?')  Ne  {Is'2s^2pl2pj2pl), 
which  has  a  weight  of  98%  in  the  CASSCF  wavefunction.  In  the  CASPT2  and  MRCI 
calculations  the  A  *11  state  has  a  relatively  deep  well  of  approximately  225  cm"', 
while  the  X  *2  state  shows  an  order  of  magnitude  smaller  binding  and  a  much 
larger  internuclear  separation.  Two  curves  are  only  negligibly  bound  in  the  CASSCF 
calculation. 

The  CASSCF  results  in  the  ano  basis  show  only  negligible  binding  for  both  elec¬ 
tronic  states.  We  believe  that  this  is  due  to  the  lack  of  dynamic  correlation  in  the 


Figure  4  LiNc  doublet  pi  state  Energy  is  in  mHartree.  separation  is  in  a.u. 
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Table  IV.  The  dipole  moments  for  X  and  A  states  of 
Li  •  •  •  Ne  system  calculated  by  different  methods  (in  a.u.,  positive 
dipole  moments  is  defined  as  Li'Ne*). 


R 

A- =2 

A^n 

CAS* 

MCPF* 

CAS*” 

mcpf’’ 

3.0 

0.3800 

0.3691 

4.0 

— 

— 

0.2526 

0.2421 

4.5 

— 

— 

0.1934 

0.1836 

5.0 

— 

— 

0.1441 

0.1354 

5.5 

— 

— 

0.1055 

0.0979 

6.0 

0.3328 

0.3142 

0.0764 

0.0701 

7.0 

0.1564 

0.1432 

0.0398 

0.0354 

8.0 

0.0702 

0.0605 

0.0209 

0.0180 

8.5 

0.0464 

0.0383 

— 

— 

9.0 

0.0308 

0.0241 

0.01134 

0.0094 

9.5 

0.0206 

0.0150 

— 

— 

10.0 

0.0144 

0.0096 

0.0065 

0.0052 

10.5 

0.0097 

0.0053 

— 

— 

II.O 

0  0070 

0.0038 

0.0039 

0.0031 

11.5 

0.0053 

0.0025 

12.0 

0.0042 

0.0018 

0.00249 

0.0019 

13.0 

0.0031 

0.0011 

— 

— 

14.0 

0.0023 

0.0008 

0.00118 

0.0009 

15.0 

0  0021 

0.0008 

— 

— 

16.0 

0.0020 

0.0007 

— 

17.0 

0.0019 

0.0007 

— 

— 

500.0 

0.0018 

0.0007 

-0.0001 

-0.0002 

*  Nine  active  electrons  in  (4|2i2|0)  active  space  with  two  frozen 
orbitals. 

’’Nine  active  electrons  in  (4|2|2|0)  active  space  with  one  frozen 
orbital  and  one  inactive  orbital. 


CASSCF  wavefunction  for  the  active  space  chosen.  This  is  borne  out  by  the  obser¬ 
vation  that  second-order  perturbation  correction  of  this  CASSCF  wavefunction 
(CASPT2)  as  well  as  mrci  calculations  yields  bound  states. 

The  calculated  potential  energy  curves  for  the  X  "Z  ground  state  are  shown  in 
Figure  1  (caspt)  and  Figure  2  (MRCI).  The  results  of  the  present  calculations  can 
be  compared  with  the  experimental  results  of  Dehmer  and  Wharton  [27]  based  on 
absolute  total  scattering  cross  section  measurements.  For  this  state,  the  experimen¬ 
tally  derived  potential  has  a  well  depth  of  ==9.7  cm"'  at  !=5.0  A,  while  the  calculated 
well  depth  is  15.3  cm"'  (mrci  +  Q,  acpf)  at  5.3  A.  The  calculations  reported  here 
may  also  be  compared  with  the  pseudopotential  calculations  reported  in  Ref  [7], 
which  find  a  substantially  less  attractive  well  of  4.5  cm"'  at  5.8  A. 

For  the  A  -fl  state,  our  caspt2  and  mrci  potential  energy  curves  are  shown  in 
Figures  3  and  4.  The  mrci  +  Q  and  acpf  results  give  a  minimum  of  214  cm"'  at 
2.26  A  in  excellent  agreement  with  the  experimental  results  of  224  cm"'  at  2.6  A 
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Table  V.  Molecular  constants  for  the  X  ^2  stateof  Li  •  •  •  Ne  system. 


Method 

R,{k) 

iSrleV) 

Do  (eV) 

CASSCF* 

=8.0 

caspt2d* 

5.2905 

0.00147 

0.00142 

caspt2n* 

5.2894 

0.00i52 

0.00145 

MRCl* 

5  4062 

0.00121 

0.00115 

MRCl‘  +  Q 

5.2721 

0.00199 

0.00177 

MCPF* 

5.3889 

0.00196 

0.00175 

ACRE* 

5.2836 

0.00196 

0.00174 

Theory  [7] 

5.82 

0.000557 

Exp  [27] 

5.116 

0.001089 

4.979 

0001191 

*  Nine  active  electrons  in  (4 1 2 1 2 10)  active  space  and  two  frozen  orbitals. 


[28].  Similar  values  were  obtained  for  the  other  methods  used.  The  previously 
reported  pseudopotential  results  gave  a  somewhat  flatter  curve  with  a  well  depth 
of  only  166  cm“'. 

The  dipole  moments  were  calculated  at  a  number  of  geometries  for  both  states 
and  are  listed  in  Table  IV.  These  dipole  moments  are  all  in  the  direction  Li'Ne"^ 
and  are  quite  small  at  the  equilibrium  separations.  Tables  V  and  VI  list  the  calculated 
spectroscopic  constants  for  both  states. 

Several  factors  were  neglected  in  these  calculations  which  may  affect  the  inter¬ 
action  energy  by  a  small  but  not  necessarily  negligible  amount.  The  most  important 
of  these  is  the  neglect  of  the  core  correlation  effects,  including  core-valence  cor¬ 
relation  results  in  the  contraction  of  the  atomic  valence  shell  and  the  degradation 
of  the  dissociation  energy  by  about  10%  [29].  The  next  most  important  factor  is 


Table  VI.  Molecular  constants  for  the  A  ^11  state  of  Li  •  •  •  Ne  syst''.n. 


Method 

RAk) 

f),(cV) 

£>o(eV) 

0),  (cm  ') 

w,X,  (cm"') 

fi,  (cm"') 

CASSCF* 

=4.8 

caspt2d* 

2.2603 

0.0291 

137 

— 

0  5469 

CASPr2N* 

2.2515 

150 

— 

MRCl” 

2.2716 

103 

-18.18 

MRCl”  +  Q 

2.2628 

139 

-18.44 

0.643 

MCPF* 

2.2335 

188 

-18.35 

0.5887 

ACPF* 

2.2617 

131 

-18.21 

0.5311 

Theory  [7] 

2.116 

Exp  [27] 

*  Nine  active  electrons  in  (4 1 2 1 2 10)  space  with  one  frozen  and  one  inactive  orbital. 
’’  Nine  active  electrons  in  (4 1 2 1 2 1 0)  space  with  two  frozen  orbitals. 
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the  neglect  of  the  polarization  of  the  atomic  cores.  The  effect  of  core  polarization 
on  the  interaction  potential  can  be  related  to  polarizability  [30].  The  polarization 
of  the  Li  core  leads  to  the  decrease  of  polarizability  and  to  a  less  attractne  interaction 
potential  for  Li  •  •  •  Ne.  In  contrast,  the  polarization  of  the  Ne  core  would  lead  to 
an  increased  polarizability  for  Ne  and  thus  a  more  attractive  potential. 

Conclusion 

Ah  initio  potential  curves  have  been  determined  for  the  ground  X  "2  and  the 
excited  A  ’ll  states  of  the  Li  •  •  •  Ne  complex  by  the  casscf.  caSPI  2,  and  Cl  methods. 
The  calculations  correlate  all  valence  electrons  in  active  space  build  from  the  valence 
orbitals  of  Li  and  Ne.  These  calculations  gave  a  well  depth  of  2 14  cm  '  (acpf)  for 
the  A  ’ll  state,  in  excellent  agreement  with  the  224  cm  '  deduced  from  experimental 
data  and  much  better  than  the  pseudopotential  results  of  166  cm  '  [7].  The  much 
less  attractive  X  ’2  state  is  predicted  to  be  bound  by  only  1 5.8  cm  ‘  with  an  equi¬ 
librium  distance  of  5.28  A  using  the  caSPt2d  method,  while  ail  other  methods 
predict  a  much  weaker  interaction  for  this  state.  We  conclude  from  these  results 
that  the  ground  state  of  Li  •  •  •  Ne  is  at  best  only  very  weakly  bound. 

We  have  also  tried  to  illustrate  the  capabilities  of  the  second  order  perturbation 
method  based  on  casscf  reference  function  and  conclude  that  either  caspt2  method 
shows  promise  of  yielding  satisfactory  results  in  comparison  with  the  MRCi  results. 
For  the  A  ’ll  state,  the  Caspt2n  gave  a  well  depth  of 234  cm  '.in  excellent  agreement 
with  the  224  cm"'  derived  from  experiment.  However,  the  caspt2d  value  of  258 
cm' '  is  a  little  bit  too  high.  For  the  X  ’2  state,  the  caspt2  values  of  the  well  depth 
of  1 1  and  12  cm  '  are  closer  to  the  experimental  value  of  9.7  cm  '  than  the  mrci 
+  Q  value  of  15.3  cm"'.  Thus,  we  conclude  that  CASPT2  methods  are  potentially 
capable  of  giving  accurate  results  for  the  interaction  potential  for  the  weak  van  der 
Waals  systems  with  a  considerably  reduced  computational  time. 
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Abstract 

The  systematic  quantum-mechanical  investigation  of  the  stable  geometnes  and  some  energetic  char- 
actenstics  of  the  anionic  silver  clusters  up  to  the  hexamer  are  performed  by  an  all-electron  spin-density 
approach  with  nonlocal  corrections  included.  Calculated  vertical  detachment  energies  are  in  good  agree¬ 
ment  with  the  expenmental  ones  without  any  scaling  procedure.  The  fragmentation  energy  for  the 
channel  Ag^  -►  Ag;.|  +  Ag  shows  very  pronounced  oscillations  for  even-odd  n.  The  obtained  stable 
geometnes  for  Ag„  («  =  2-4)  are  in  agreement  with  previous  calculations  by  Bauschlichei  ct  al.  [26,27] 
To  the  contrary  of  the  case  of  tetramers  and  pentamers  we  found  that  anionic  and  neutral  hexamers 
have  different  stable  geometnes.  ©  1993  John  Wiley  &  Sons.  Inc 

Introduction 

The  past  decade  is  characterized  by  the  intensive  experimental  and  theoretical 
investigations  of  different  kinds  of  clusters  [  1  -3  ] .  One  of  the  main  reasons  for  this 
great  interest  in  cluster  studies  is  their  evident  importance  in  filling  the  gap  between 
the  behavior  of  molecules  and  of  solids.  In  other  words,  it  is  expected  that  the  study 
of  the  properties  of  clusters  will  allow  us  to  understand  the  manner  in  which  the 
microscopic  properties  evolve  towards  the  macroscopic  ones. 

Therefore  it  is  particularly  important  to  study  these  properties  as  a  function  of 
the  cluster  size,  usually  refered  to  as  “size  effects.”  Jortner  et  al.  [4]  have  classified 
size  effects  according  to  their  nature  as  follows:  (a)  quantum  or  energetic;  (b) 
thermodynamic;  (c)  dynamic;  and  (d)  chemical  size  effects. 

The  main  interest  of  the  present  article  will  lie  on  energetic  size  effects.  Among 
these  the  most  interesting  energetic  size  effects  are  those  revealed  by  their  photo¬ 
electron  spectra  (PES).  The  PES  allows  to  obtain  several  cluster  characteristics  such 
as  its  ionization  potentials  (IP),  electron  affinities  (Ea),  vertical  detachment  energies 
(VDE),  and  electronic  transition  energies.  These  characteristics  often  show  a  very 
pronounced  dependence  upon  the  cluster  size.  There  are  extensive  experimental 
investigations  on  alkali  metal  clusters  [5-6]  demonstrating  such  behavior.  A  sub¬ 
stantial  number  of  accurate  theoretical  investigations  have  been  performed  by  Kou- 
tecky  and  co-workers  [7-13]  on  neutral  and  charged  Li„,  Na„,  and  K„  clusters. 
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Recently  published  measurements  of  iPs  for  Li„  clusters  [14]  show  very  good  agree¬ 
ment  with  the  theoretical  values  [  7  ] . 

The  transition  metal  clusters  have  also  received  a  large  amount  of  experimental 
interest  [15-23],  in  particular  in  the  cases  of  silver  and  copper  clusters.  The  rica- 
surement  ofVDEs  have  demonstrated  a  well-defined  even-odd  oscillating  behavior 
for  Cu,;^  and  Ag^  ionic  clusters.  This  means  that  the  \  DEs  reach  alternatively  local 
minima  for  even  «  and  local  maxima  foi  odd  n  clusters.  On  the  contrary  the  vde 
values  for  Pd;^  anionic  clusters  show  only  a  smoothy  increasing  behavior  as  a  func¬ 
tion  of  cluster  size  [18].  In  the  specific  case  of  Ag^  which  will  concern  us  here,  the 
vde  values  have  been  measured  in  two  laboratories  [  1 8, 1 9  ]  using  diverse  ion  source 
conditions.  In  spite  of  that,  their  vde  results  agree  within  the  experimental  error 
limits. 

As  it  concerns  with  the  theoretical  investigations  on  d-shell  metal  clusters,  they 
were  at  first  not  so  abundant  as  on  the  alkali  metal  clusters.  In  recent  years,  however, 
many  precise  ab  initio  calculations  of  the  coinage  metal  clusters  (Cu„,  Ag„,  and 
Au„ )  have  been  performed  by  several  research  groups  [ 24-35  ] ,  including  up  to  the 
hexamers  [34].  We  have  nevertheless  not  found  any  publications  devoted  to  the 
systematic  theoretical  calculation  of  anionic  clusters.  The  only  exception,  to  our 
best  knowledge,  is  the  investigations  of  Akeby  et  al.  [35]  on  Cu;;  (n  :£  10).  In  their 
article,  however,  the  one-electron  model  approach  was  used,  which  lacks  the  ac¬ 
curacy  of  most  of  the  other  studies  [24-34]. 

For  smaller  coinage  metal  anion  clusters  (dimers  and  trimers),  very  precise  cal¬ 
culations  were  attempted  by  Bauschlicher  et  al.  [26].  Tetramer  anions  were  further 
studied  rigorously  by  this  group  [27],  but  for  anionic  pentamers  the  vdes  were 
calculated  using  the  optimal  geometry  of  the  neutral  clusters  [27].  A  similar  ap¬ 
proach  was  followed  for  tetramer  vdes  in  [33].  We  should  point  out  that  in  all  of 
these  calculations  [26,27,33],  the  vdes  were  obtained  only  after  introducing  scaling 
factors  (1.3-1.35),  without  which  the  agreement  with  the  experimental  data  was 
rather  poor.  Even  with  the  scaling  factor,  good  agreement  was  not  possible  in  all 
cases  [33]. 

The  goal  of  the  present  article  is  to  obtain  for  anionic  silver  clusters  up  to  the 
hexamers,  the  stable  geometries  and  the  binding  energies  using  a  sufficiently  precise 
quantum  chemical  method.  In  particular,  we  aim  to  obtain  the  vde  of  each  silver 
anionic  cluster  without  using  any  scaling  factors.  The  main  reason  for  introducing 
a  scaling  factor  is  to  correct  for  an  insufficient  account  of  atomic  correlation  effects 
[33].  To  avoid  this  we  have  chosen  to  use  the  all-electron  spin  density  (lsd)  ap¬ 
proach  with  nonlocal  corrections  included  [36,37].  With  this  approach  we  expect 
the  atoms  to  lie  treated  at  similar  level  of  accuracy  as  the  molecules. 

All  calculations  were  performed  on  a  Cray  Y-MP4/432  computer  using  the 
IXJauss  program.  As  it  has  been  shown  by  Fantucci  et  al.  [10]  for  Li„  and  Na„ 
clusters,  the  nonlocal  version  of  the  electron  density-functional  method  yields  results 
close  to  those  obtained  by  hf-ci  calculations. 

Computational  Method 

The  systematic  study  of  size  effects  on  anionic  clusters  implies  the  calculations 
of  increasingly  larger  oligomers.  In  the  particular  and  quite  interesting  case  of  tran- 
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sition  metal  clusters  the  all-electron  ab  initio  calculations  become  prohibitly  time- 
consuming  because  of  the  enormous  quantities  of  inner-shell  electrons  that  these 
systems  imply. 

The  neglect  of  interactions  between  distant  atoms  and  the  effective  core  potentials 
(ECP)  are  among  the  most  common  cut-offs  to  reduce  time  and  memory  in  a 
computer.  ECPs  render  in  many  cases  results  of  accuracy  comparable  to  all-electron 
ab  initio  calculations.  Unfortunately,  for  metallic  clusters  it  is  necessary  to  include 
not  only  a  large  number  of  configuration  states  to  obtain  significant  core-core, 
core-valence,  and  valence-valence  correlation  effects  but  also,  scaling  factors  in 
order  to  account  for  the  deficiency  when  computing  the  atomic  electron  affinities 
and  ionization  potentials  [26,27,33].  Because  of  this  fact  and  the  great  number  of 
calculations  required  to  find  out  the  stable  geometries  for  anionic  silver  clusters  we 
resort  to  a  less  computationally  demanding  method,  namely,  the  local  spin  density 
(LSD)  approximation  with  nonlocal  corrections. 

The  LSD  approximation  has  been  developed  within  the  Kohn-Sham  (ks)  scheme 
where  the  electron  density  represents  the  main  variable  of  the  theory  and  the  ex¬ 
change-correlation  effects  are  taken  into  consideration  in  an  approximative  way.. 
It  is  generally  agreed  that  the  Vosko.  Wilk,  and  Nusair  (  vwn  )  approximate  expres¬ 
sion  [38]  for  the  e.xchange-correlation  potential  should  be  introduced  in  the  total 
energy  functional  in  order  to  constitute  the  l.SD  level  of  theory.  Nevertheless,  because 
of  the  high  inhomogeneities  and  strong  correlation  produced  by  the  interacting 
electron  clouds  of  a  metallic  cluster,  it  is  necessary  for  the  higher  precision  to  consider 
a  nonlocal  correction  to  the  vwn  potential.  Among  several  nonlocal  exchange- 
correlation  functionals  the  one  proposed  by  Peeke  and  Perdew  [39-41]  (bp)  is  the 
most  attractive  to  work  with  because  of  its  good  results  and  computational  efficiency 
(see  the  last  review  by  Ziegler  [42]). 

Here,  we  shall  use  the  DGauss  computational  approach  [36]  already  implemented 
m  the  UNICHEM  package  to  solve  the  LSD-KS  equations.  This  approach,  in  contrast 
to  the  Hartree-Fock  (HK)  method,  demands  not  only  basis  sets  for  the  molecular 
orbitals  but  also  auxiliary  gaussian  basis  sets  to  expand  the  electron  density,  ex¬ 
change-correlation  potential,  and  exchange-correlation  energy.  The  molecular  or¬ 
bital  and  auxiliary  basis  functions  to  use  in  the  computation  will  be  those  specifically 
optimized  within  the  LSD  approximation  [  43,44  ]  for  silver  and  denoted  by  ( 63332 1  / 
5321 1/531  )/[6/5/3)  and  ( 10/5/5).  respectively.  The  first  set  is  equivalent  to  a 
double-zeta  valence  plus  polarization  basis  set  and,  the  second — without  equivalence 
inside  the  HF  scheme — is  known  as  the  auxiliary-1  basis  set  (Al).  Both  of 
them  have  been  carefully  checked  to  minimize  basis  set  superposition  errors 
(BSSE)  [43,44]. 

As  it  has  been  shown  by  Andzelm  and  Wimmer  [36]  the  inclusion  of  nonlocal 
corrections  m  this  computation  scheme  allows  to  predict  bond  dissociation  energies, 
within  0. 1  cV.  This  indicates  that  the  B.SSE  is  small. 

The  nonlocal  bp  corrections  can  be  calculated  by  two  ways;  in  a  nonself-consistent 
and  in  a  self-consistent  manner.  We  shall  use  both  approaches. 

The  DGauss  approach  like  any  other  computational  implementation  has  some 
shortcomings  too.  One  of  the  most  important  is,  up  to  the  moment,  the  lack  of 
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inclusion  of  relativistic  corrections  in  the  one-electron  lsd-ks  equations.  Even 
when  these  effects  are  not  included  at  this  stage,  there  exists  evidence  [  37  ]  that  lsd 
provides  good  results  for  bond  lengths  and  angles  of  molecules  containing  metallic 
atoms  such  as  silver.  Balasubramanian  [45]  has  analyzed  the  influence  of  relativistic 
effects  on  transition  metal  cluster  reaching  the  conclusion  that  such  effects  become 
quite  important  for  heavier  elements  like  Au,  Pt,  etc.,  see  also  [46],  but  do  not 
lead  to  drastic  changes  for  silver.  We  also  believe  that  these  kind  of  contributions 
should  be  further  reduced  by  cancellation  when  energy  differences  are  taken  into 
account. 

In  order  to  find  out  the  most  stable  geometries,  the  method  already  described 
above  was  applied  to  a  number  of  different  molecular  conformations.  Only  sym¬ 
metrical  conformations  were  taken  into  account  for  the  anionic  clusters  Ag;^  (n  ^ 
6).  The  calculations  were  performed  on  the  CRAY  YMP4/432  with  the  DGSCA 
supercomputing  facility. 


Results  and  Discussion 

As  mentioned  in  the  Introduction,  we  must  optimize  the  geometry  of  the  Agii 
stable  clusters  before  calculating  the  vdes  and  other  energetic  characteristics.  We 
have  done  this  systematically  for  anionic  silver  dimers,  trimers ...  up  to  hexamers 
in  order  to  study  quantum  size  effects.  Dimers,  trimers,  and  tetramers,  have  been 


Table  I  Companson  of  calculated  geomctnes  and  some  energetic  charactenstics  of  silver  anions 

Ag;. 


a)  atom  and  dimer 


n 

r,(A) 

Calculation 

O.(eV) 

EA  (eV) 

Calculation 

Experiment 

Calculation 

Experiment 

Our  (261 

Our 

[26] 

Our 

I26p 

1 

1.183 

1.26 

1.302  +  0.007  [47] 

2 

2.850  2.813 

1.307 

1.12 

1.39  +  0.02’ 

1.013 

0.97 

1.028  ±001’ 

b)  tnmer  and  tetramer 

ra(A) 

« (deg.) 

Calculation 

Calculation 

n 

Geometry 

Our 

Our  [27] 

3 

linear 

2.760 

2.754  [261 

4 

rhombic' 

2.849 

2  852  [27] 

61.2  63.7 

•  See  Ref.  [4]  in  [26]. 
Scaled  by  a  factor  1 .3. 
‘  See  Figure  1 . 
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already  studied  by  Bauschlicher  et  al.  [26,27  ],  so  in  Table  I  the  detailed  comparison 
of  their  results  with  ours  are  given.  We  see  from  Table  1  that  the  agreement  in  the 
stable  geometries  is  rather  good.  Our  unsealed  energies  for  the  dimer  are  in  letter 
agreement  with  the  experimental  data  than  the  results  in  [26].  For  trimer  and 
tetramer  anions  we  compare  only  geometrical  parameters  for  stable  geometries,  we 
can  not  compare  atomization  energies  because  they  have  been  calculated  in  [27] 
only  for  neutral  clusters. 

In  Table  II  and  Figure  1  we  present  the  results  of  our  calculations  obtained  by 
including  nonlocal  corrections  in  a  self-consistent  manner.  The  difference  in  the 
geometry  and  the  energy  as  compared  to  the  calculations  with  nonself-consistent 
inclusion  of  the  nonlocal  corrections  is  rather  small;  for  the  energy  it  is  in  the 
interval  0.02  to  0.09  eV.  The  notation  for  the  cluster  Ag^  (in  Table  II )  represents 
the  neutral  cluster  whose  geometry  is  fixed  to  be  the  same  as  that  of  the  stable  anion 
Ag^.  Its  energy  E„(,Ag'„)  is  necessarily  higher  than  the  ground  state  energy  of  the 
optimized  geometry  of  the  neutral  cluster  £„(  Ag„).  This  is  the  reason  why  the  VDE 
calculated  as  the  difference: 


Table  II.  Ground-slate  energies  and  vdes  for  the  anionic  isomers  Ag„.  n  =  2-6. 


VDE.  eV 

n 

GeometO'* 

£■»  (Ag;).  a.u. 

li„  (Ag'„)^  a.u. 

Our  calc. 

Experimental 

1 

Linear.  D,(, 

-10399.989439 

-10399.949723 

1  08 

1.06  +002  [19] 

3 

Linear.  D,h 

-15600.006445 

-15599.926529 

2.18 

2.43  ±0.01  (19) 

2.43(18] 

4 

Rhombic.  Dji, 

-20799.997814 
(AT-  =  0) 

-20799.935275 

1.70 

Linear.  0.,^ 

-20799.996715 

-20799.918354 

2.13 

1.65  +  0  05  (19] 

(Mi  =  0.03eV) 

1.71  118] 

Planar. 

-20799.994353 
(Mi  =  0.09eV) 

-20799.927857 

1.81 

5 

Trapezoidal.  C:, 

-26000.010334 
(Mi  =  0) 

-25999.935604 

2.03 

2.11  ±0.05(19] 

Linear.  D,h 

-25999.999273 
(Mi  =  0  30cV) 

-25999.898926 

2.73 

2.1  [18] 

6 

Tripyramidal.  C;, 

-31200.012922 
(Mi  -  0) 

-31199.936451 

2  08 

Trigonal.  Djh  -*  Djh 

-31200  010615 
(A/i'  =  006cV) 

-31199.957999 

1.43 

2.06  ±0  05  [19] 

Quadratic  bipyramid.  O*  -*  Dj* 

-31200.008.368 
(A/;  =  0.12eV) 

-31 199.916271 

2.51 

2.05  [18] 

Pentagonal  pyramid.  Cj, 

-31200  007207 
(A/f  =  O.I6cV) 

-31 199.9491 12 

1  58 

“  Sec  Figure  1  and  Table  1 . 

The  Ag„  designates  the  neutral  cluster  whose  geometry  is  the  same  as  the  stable  geometry  of  the 
anion  Ag„. 
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o)  n  =4 
Rhombic  ,  Djh 


QiSI.Z* 


b)  n  =  5 

Tropezoidol  ,  Czv 


c)  n  =  6 

Tripyromidol  ,  C2V 


284 


Lmeor  ,  0„s  Plonof »  Dad 


Linear  ,  0*11 


Trigonol  ,  Djh— 


2n  iti 


Quadratic 

bipyramid  ,  0(,-*-D4(,  Penfagonol  pyromid  ,  Cjv 


Figure  I .  The  most  stable  geometries  for  the  anionic  isomers  Ag; .  n  =  4-6;  all  distances 

in  angstroms. 


VDE  =  -[£„(  Ag;)-£„(Ag;,)]  (1) 

is  always  larger  than  the  adiabatic  electron  affinity  (ea). 

The  stable  geometry  for  AgJ  is  linear,  as  follows  from  Table  III,  and  this  agrees 
with  the  results  of  Bauschlicher  et  al.  [26J  as  well  as  with  the  data  obtained  for 
NaJ  by  Bonacic-Koutecky  et  al.  [1 1].  According  to  our  results  the  linear  trimer  is 
1.31  eV  more  stable  than  the  equilateral  triangle  one. 

For  the  tetramer  anionic  clusters  three  isomers  have  almost  equal  energies,  and 
we  therefore  need  to  analyze  other  properties  to  determine  the  most  stable  geometry. 
Among  such  properties,  we  choose  the  vde,  which  is  highly  dependent  on  the 
cluster  geometry  [12].  According  to  Table  II  the  coincidence  between  the  calculated 
value  of  the  tetramer  vde  and  the  experimental  is  much  better  for  the  rhombic 
geometry.  But  the  value  of  the  vde  for  the  D2d  planar  geometry  is  also  not  very  far 
from  the  experimental  one.  The  energy  of  the  D2d  isomer  is  only  0.09  eV  higher 
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I  Mil  I  III.  IX’pcinlencc  of  the  grnunil-&tatc  cnciKy  and  the  inlcralomic  distances  from  Ihc  angle  for 

silver  anion  AgJ. 


t>c(imcir>' 

It  (cleg.) 

/„(A) 

Relative 
energy.  cV 

,0 

IHO 

2,76 

om 

150 

2.78 

0.052 

0  0 

120 

2.80 

0.278 

90 

2.81 

0.634 

60 

2.88 

1.313 

than  the  energy  of  the  D2h  isomer.  The  experiments  with  carbon  anionic  clusters 
[48,49]  indicated  the  existence  of  more  than  one  isomer  with  the  same  number  of 
atoms.  Different  isomers  are  not  necessarily  in  the  thermodynamic  equilibrium. 
So  for  «  =  4  the  both  planar  geometries  Djh  and  D2d  can  be  present  in  the  exper¬ 
iments  [18,19];  the  probability  for  the  formation  of  linear  isomers  AgJ  is 
much  lower. 

Nothing  has  apparently  been  published  about  the  geometry  of  the  anion  clusters 
AgJ  with  «  >  4.  As  a  first  approach  we  may  expect  that  for  heavy  atom  clusters 
with  n  >  4  the  anion  geometry  must  not  differ  considerably  from  the  neutral  one 
[27,33].  We  have  studied  here  several  geometries  for  pentamer  and  hexamer  anions. 
Some  of  them  were  suggested  by  the  results  of  Bonacic-Koutccky  et  al.  on  sodium 
clusters  (1 1-13]  and  by  those  of  Refs.  [27]  and  [34]  on  neutral  silver  clusters.  The 
most  stable  isomers  in  our  calculations  on  each  investigated  anion  cluster  for  n  - 
4-6  are  presented  in  Figure  1 .  For  n  =  6  we  have  presented  four  isomers  as  their 
energies  are  relatively  close  to  each  other. 

For  /I  =  5  the  trapezoidal  planar  geometry  is  more  stable  than  the  linear  by  0.3 
eV.  The  poor  agreement  of  the  VDE  calculated  for  the  linear  geometry  with  the 
exfieri  mental  value  is  an  additional  reason  to  assume  the  trapezoidal  planar  isomers 
of  AgJ  are  produced  with  much  hi^er  probability  than  the  linear  isomers. 

Notice  that  the  stable  geometry  for  the  anionic  pentamer  found  in  our  study  has 
the  same  symmetry  as  that  found  in  Ref.  [27]  for  the  neutral  pentamer.  A  similar 
situation  takes  place  for  tetramer  [27]. 

It  is  important  to  point  out  that  for  sodium  anionic  clusters  it  has  been  proposed 
[11]  that  the  Na  J  linear  geometry  is  responsible  for  the  photoelectron  detachment 
spectra  and  that  for  Na  J  both  the  planar  and  linear  anionic  isomers  may  contribute 
to  the  recorded  spectrum.  The  reason  for  this  structural  difference  between  AgJ 
and  Ag  J ,  on  the  one  hand,  and  Na  J  and  Na  J,  on  the  other,  is  probably  due  to  the 
interaction  between  the  d-subshell  and  the  valence  shell.  Furthermore,  the  relaxation 
of  the  closed  d-subshells  in  the  silver  metal  clusters  during  their  formation  could 
also  be  partially  responsible  for  these  differences.  This  has  been  remarked  before 
[SO]  in  the  case  of  copper.  This  problem  shall  be  a  subject  of  future  investigation. 

According  to  Table  II  four  structures  have  close  energies  for  «  =  6.  The  most 
stable  structure  has  a  tripyramidal  geometry  with  symmetry  €2*.  This  kind  of  ge- 
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ometry  was  found  to  be  the  most  probable  for  the  neutral  lithium  hexamer  [7], 
On  the  contrary,  for  the  neutral  silver  hexamer  the  most  stable  isomer  according 
to  Ref  [34]  has  the  geometry  of  a  pentagonal  pyramid.  In  our  case  of  anionic  silver 
hexamers  the  pentagonal  pyramid  geometry  is  0.16  eV  less  stable  than  the  tripyr- 
amidal  geometry  (see  Table  II).  For  the  most  stable  tripyramidal  structure,  there 
is  very  good  agreement  with  the  experimental  vde.  This  indicates  that  the  tripyr¬ 
amidal  structure  is  the  most  favorable  structure  for  contributing  to  the  measured 
PES  [18,19]  among  those  considered  in  our  study.  But  we  can  not  exclude  the 
possibility  that  the  other  three  isomers  also  contribute  to  the  measured  spectrum, 
because  the  higher  value  of  the  vde  for  quadratic  bipyramid  can  be  compensated 
by  the  lower  values  of  the  vde  for  the  two  others. 

We  also  calculated  the  hexagonal  ring  geometry  which  was  found  the  most  stable 
for  the  anionic  gold  hexamer  [  5 1  ] .  In  the  case  of  the  silver  it  is  much  less  stable  in 
comparison  with  the  other  four  structures  (AE  =  0.9  eV)  and  has  not  such  good 
agreement  with  the  experimental  value  of  the  vde  as  the  tripyramidal  structure. 

We  conclude  from  the  results  of  Table  II  that  there  is  very  good  agreement 
between  the  experimental  values  of  the  vdes  and  our  calculations  (we  have  not 
such  good  agreement  only  for  the  trimer).  We  remark  that  this  agreement  is  achieved 
without  any  resource  to  scaling  procedures.  This  implies  that  errors  due  to  the 
approximate  account  of  the  electron  correlation  effects  as  well  as  to  the  relativistic 
corrections,  which  are  not  included  in  our  calculations,  are  canceled  out  to  a  great 
degree  when  the  difference  of  formula  ( 1 )  is  computed. 

The  important  aspect  of  our  vdes  is  that  they  reproduce  the  even-odd  oscillating 
behavior  (local  minima  for  even  «,  local  maxima  for  odd  n)  observed  in  the  ex¬ 
periments  [18,19]. 

In  Figure  2  we  present  the  visual  characteristics  of  the  anion  cluster  stabilities 
Cain)  and  ei(«)  defined  as  follows.  The  binding  energy  per  atom  £„  is  defined  by 
the  expression 

ca(«)  =  -  ^[^^(Ag')  -  («  -  !)£■<,( Ag)  -  ^^(Ag”)]  (2) 

the  fragmentation  energy  ei(/i)  related  to  the  reaction: 

Ag;;  ->  Ag;.,  +  Ag  (3) 

is  equal  to: 

e,(«)  =  -[£„(Ag;)  -  £„(Ag;_,)  -  £<,(Ag)]  (4) 

For  Ca(n)  Figure  2  shows  a  sharp  increase  at  the  transition  from  AgJ  to  AgJ  and 
a  slow  increase  with  cluster  size  henceforth.  Such  behavior  is  consistent  with  the 
dependence  of  e^in)  obtained  in  Ref.  [35]  for  Cu^  and  in  Ref.  [8]  for  Li^.  For 
neutral  clusters  in  contrast  with  the  ionic  behavior,  the  Ca(n)  does  not  increase 
when  it  passes  from  the  dimer  to  the  trimer  [27]  and  it  begins  to  increase  slowly 
beyond  the  latter.  For  the  neutral  clusters  of  the  lighter  metals  ( lithium  and  sodium) 
there  even  appears  a  decreasing  at  the  dimer-trimer  transition  [10].  On  the  other 
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E.eV 


Figure  2.  The  dependence  of  the  binding  energy  per  atom  ( c^)  and  the  atomic  fragmen¬ 
tation  energy  (ci)  on  the  number  of  atoms  in  silver  anionic  clusters. 


hand  the  pronounced  even-odd  alternation  obtained  for  the  fragmentation  energy 
of  silver  anions  (see  the  behavior  of  ci  in  Fig.  2)  has  also  been  revealed  for  cations 
such  as  Na^  [52]  and  Li^  [8]. 


Conclusions 

From  the  result  obtained  we  can  conclude  that  the  lsd  method  with  nonlocal 
corrections  included  (in  the  EXjauss  program  realization)  is  quite  appropriate  for 
the  calculation  of  d-subshell  metal  anionic  clusters.  It  is  advantageous  in  scanning 
the  stable  geometry  and  in  computing  the  pes.  In  particular,  the  application  of  this 
method  allows  to  obtain  the  experimental  vdes  without  any  scaling  factors. 

The  reason  why  the  stable  geometry  for  the  anionic  hexamer  Age  has  different 
symmetry  than  that  for  the  neutral  hexamer  Ag^,  although  for  tetramers  and  pen- 
tamers  the  anionic  and  neutral  clusters  have  the  same  symmetry,  requires  special 
investigation. 
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Abstract 

A  linear  combination  of  atomic  orbitals  (lcao)  of  the  Slater  type  is  used  in  a  variational  treatment 
of  the  HeH^*  ion  to  achieve  excellent  results  for  the  ground  state  energy  of  this  heteronuclear  diatomic 
system.  As  in  our  recent  treatment  of  H2 ,  we  use  orbitals  with  identical  screening  constants  but  with 
increasing  principal  quantum  numbers  and  angular  momentum.  This  strategy  was  feasible  because  of 
our  ability  to  accurately  evaluate  all  overlap  integrals.  Unlike  even  tempered  Gaussian-type  lcao,  our 
results  become  more  accurate  at  large  interatomic  separations.  Using  two  different  screening  constants 
(one  type  associated  with  each  atom)  proved  to  be  unnecessary,  £>  1993  John  Wiley  &  Sons.  Inc 


Introduction 

Basis  set  methods  in  electronic  structure  calculations  have  been  shown  by  Wells 
and  Wilson  [1]  to  give  excellent  results  for  simple  molecules  such  as  Hj  and  HeH^^. 
They  employed  an  even-tempered  basis  set  of  Gaussian-type  orbitals  (GTOs).  Our 
interest  is  in  using  the  more  physical  Slater-type  orbitals  (stos)  so  as  to  develop 
strategies  for  nonlinear  molecules.  Steinborn  and  Weniger  [2]  used  their  "B"  func¬ 
tions  of  spherical  symmetry  to  approximate  the  ground  state  energy  of  HeH^^  (the 
“5”  functions  span  the  same  space  as  stos).  They  suggested  that  better  results 
could  be  achieved  if  orbitals  of  higher  angular  momentum  and  various  screening 
constants  were  used.  We  found  that  excellent  results  are  indeed  achieved  by  using 
orbitals  of  higher  angular  momentum,  but  identical  screening  constants  for  all 
orbitals  are  sufficient.  For  nuclear  separation  of  1  a.u.  we  get  8-digit  accuracy:  The 
accuracy  increases  with  increasing  nuclear  separation,  unlike  for  gtos. 

LCAO  Method  for  HeH^^ 

We  extend  our  previous  work  on  the  homonuclear  ion  H  2  [  3  ]  to  the  heteronuclear 
ion  HeH  .  In  the  electronic  hamiltonian  we  set  Z|  =  1  and  Z2  =  2: 

H  =  +  —  +  —  (1) 

2  fa  rt 
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Tabi.e  I.  Screening  constant  fof  the  lowest  energy  for  various 
values  of  the  intemuclear  distance  R.  and  maximum  principal 
quantum  number 


R 

^hijh  ^ 

II 

< 

1.0 

2.2 

2.7 

33 

2.0 

2.0 

25 

2.9 

4.0 

2.0 

2.0 

2.5 

8.0 

2.0 

2.0 

2.0 

20.0 

2.0 

2.0 

2.0 

60.0 

2.0 

20 

2.0 

(the  intemuclear  energy  term  ZiZi/R,  where  R  is  the  internudear  separation,  has 
been  dropped).  For  our  lcao  (linear  combination  of  atomic  orbitals)  ansatz,  we 
use  a  systematic  sequence  of  STOs  on  each  nucleus: 

=  A'  2  Z  C'slAst r  i( 0, .  0, ) .  ( 2 ) 

A’- 1  /.=0 

where  /  designates  an  orbital  at  position  a  or  an  orbital  at  position  b.  K  is  the 
normalization  constant  chosen  such  that 

I' =  1,  ^  =  +  (3) 

To  apply  the  variational  principle,  we  write 

Hil/  =  Etp,  (4) 

and  arrive  at  a  secular  equation  for  the  energy.  This  eigenvalue  problem  was  solved 
on  a  CRAY  Y-MP  using  the  IMSL  library  [4].  The  screening  parameter  f  was  varied 
to  minimize  the  energy  for  different  nuclear  separations  and  different  number  of 
orbitals,  fVh.gh  ( see  Table  I ) .  For  A'h.gh  greater  than  3,  the  energy  is  insensitive  to  the 
variations  of  f.  In  Table  II  the  corresponding  energies  are  recorded.  The  exact 
values  are  taken  from  [5J.  For  the  equilibrium  position,  i.e..  R  =  2  a.u.,  we  get 


Table  II  The  eletronic  energy  of  HeH‘*  for  vanous  intemuclear  distances  R  and  for  lcao  with  highest 

quantum  number 


R 

A'tuth  -  3 

Exact 

1.0 

-3.0129511 

-3.0332651 

-3.033352043 

-3.033352518075 

2.0 

-2.5043488 

-2.5121563 

-2.512192935 

-2.512193016590 

4.0 

-2  2500067 

-2  2505992 

-2.2506053801 

-2.250605387829 

8.0 

-2.1250000 

-2.12503483 

-2.125034906460 

-2.125034906488 

20.0 

-2.0500000 

-2.0500008807 

-2.050000880914 

-2.050000880914 

60.0 

-2.016666666 

-2.01666667750 

-2.016666677520 

-2.016666677520 

(C) 


«J) 


Figure  1.  (a),  (b),  (c).  and  (d)  show  the  wave  function  of  HeH^*  along  the  H-He  axis 
(H  IS  at  -RI2  and  He  is  at  +R/2)  for  separations  i?  =  1,  2.  4.  and  8  a.u..  respectively. 


8-digit  accuracy  using  only  2 1  orbitals  on  each  nucleus,  whereas  Wells  and  Wilson 
[  1  ]  require  42  gtos  for  7-digit  accuracy.  In  addition,  our  method  gives  even  better 
results  for  larger  separations  The  normalized  wave  function  along  the  molecular 
axis  is  plotted  in  Figure  1  for  /?  =  1, 2, 4,  and  8  a.u.  It  is  shown  that  the  charge  is 
practically  all  about  the  He  nucleus  especially  when  R  is  larger  than  4  a.u. 

Conclusion 

It  appears  that  STOs  when  processed  by  the  Lowdin  a-function  method  [6]  can 
form  basis  sets  that  give  excellent  results  for  one-electron  systems.  Strategies  de¬ 
veloped  here  should  prove  useful  in  the  study  of  more  complicated  systems. 
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Abstract 

Calculations  have  been  performed  on  low-lying  states  of  SF^  and  SF^  at  the  complete  active  space 
multiconhguration  self-consistent  field  (CAS-MCSCr )  level.  These  calculations  involved  optimization  of 
basis  function  exponents,  particularly  the/functions  on  sulfur  and  their  effect  on  the  potential  energies 
and  the  SF^  electron  affinity.  The  cas-mcscf  calculations  for  the  anion  and  neutral  included  4,863  and 
2.1 15  configuration  functions,  respectively.  Octahedral  and  selected  distorted  geometries  were  considered 
The  ground-state  equilibnum  geometries  correspond  to  the  octahedral  configuration  of  SF^  ('/li,),  R(S — 
F)  =  1.56  A.  ui,  =  776  cm“',  and  of  the  anion  SFi(^/li,).  R(S — F)  =  1.70  A,  ui,  =  558  cm"'.  The 
adiabatic  and  vertical  electron  affinities  are  found  to  be  -0.63  and  I  8  eV,  respectively  Electron  auto¬ 
detachment  phenomena  associated  with  the  anion  are  also  discussed  and  a  ixissible  mechanism  for  this 
process  is  presented,  c  1993  John  Wilcv  &  Sons.  Inc 


Introduction 

The  SF(,  molecule  has  been  the  object  of  many  studies,  both  experimental  and 
theoretical  during  the  last  several  years.  This  interest  is  oriven  largely  by  the  fact 
that  SFo  has  a  very  high  cross  section  for  electron  capture,  forming  the  SFo  anion. 
Experiments  on  the  ground-state  anion  indicate  that  it  has  multiple  autoneutrali¬ 
zation  modes.  Lifetimes  for  the  electron  autodetachment  process  have  been  mea¬ 
sured  in  drift  tube  experiments,  and  it  appears  that  the  autoneutralization  process 
is  relatively  facile,  with  lifetimes  between  2-40  microseconds  [1,2].  The  ease  with 
which  the  electrons  autodetach  suggest  that  the  detachment  process  is  not  dipole 
or  photo-induced,  but  probably  arises  through  intra-  or  intermolecular  energy 
transfer  and  surface  crossings. 

The  observed  properties  of  the  SF^/SF^  system  has  prompted  numerous  studies 
of  the  adiabatic  electron  affinity  (aEA)  of  the  neutral  molecule  While  the  experi¬ 
mental  value  has  been  reported  to  be  as  low  as  0.4  eV  and  as  high  as  1.4  eV  [3- 
9],  the  currently  accepted  value  is  very  near  1.0  eV,  as  obtained  by  Streit  using  a 
bracketing  method,  measuring  the  rates  of  lon-molecule  reactions  involving  electron 
transfer  [7],  and  later  by  Grimsrud  et  al.,  with  thermodynamic  measurements 
obtained  using  a  pulsed-elcctron  high-pressure  mass  spectrometer  [8], 

From  a  theoretical  standpoint,  the  SF^/SF^  system  poses  a  challenge  to  research¬ 
ers  interested  in  calculating  the  correct  value  for  the  neutral  aEA,  and  for  describing 
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the  modes  for  electron  autodetachmenl.  With  70  and  7 1  electrons  for  the  neutral 
and  anion,  respectively,  severe  restrictions  are  placed  on  the  basis  set  size  for  practical 
calculations.  This  problem  is  aggiavated  by  the  large  basis  set  requirements  for 
calculating  the  electron  affinities  of  even  simple  atomic  species,  In  addition,  the 
level  of  electron  correlation  required  to  adequately  describe  the  relative  differences 
between  the  neutral  and  anion  systems  is  unknown. 

Undaunted  by  the  difficulties  expected  in  these  calculations,  several  researchers 
have  attempted  the  theoretical  determination  of  the  aEA  of  SFb,  with  various  degrees 
of  success.  As  a  prelude  to  describing  the  work  performed  in  this  effort,  it  will  be 
instructive  to  briefly  review  some  pertinent  theoretical  results  obtained  to  date.  All 
reported  studies  have  assumed  an  octahedral  geometry  for  both  the  neutral  and 
anion.  Hay  calculated  the  self-consistent  field  (SCF)  value  of  aEA  to  be  1.03  eV 
without  zero-point  correction  [  10] .  However,  the  basis  sets  used  by  him  have  since 
proved  to  be  inadequate  for  correctly  determining  the  relative  electronic  energies 
of  both  the  neutral  molecule  and  anion.  Thus  the  agreement  between  his  value  and 
the  experimental  value  appears  to  be  fortuitous. 

Klobukowski  et  al.  performed  a  careful  study  of  the  basis  set  requirements  for 
studying  the  neutral-anion  system  at  the  SCF  level  and  have  determined  the  following 
[11]: 

( 1 )  The  use  of  a  larger  basis  set  gives  a  much  different  value  for  the  aEA  of  SFe, 
but  does  not  change  the  calculated  geometries  of  the  neutral  and  anion 
significantly  from  those  obtained  by  Hay. 

(2)  A  3 orbital  on  sulfur  has  little  effect  on  the  calculated  aEA. 

( 3 )  The  aEA  decreases  by  about  1  eV  by  simply  adding  a  d  polarization  function 
to  the  fluorine  basis.  This  basis  set,  similar  to  the  one  used  by  Hay,  produces 
an  electron  affinity  of  1 .40  eV. 

(4)  Adding  an  /type  function  to  the  sulfur  basis  lowers  th^  electron  affinity  by 
over  0.5  eV. 

The  final  value  for  the  aEA  of  SF*  obtained  by  Klobukowski,  using  their  largest 
basis,  is  -0.34  eV  at  the  SCF  level.  Thus,  providing  this  basis  is  adequate,  the 
Hartree-Fock  value  for  the  electron  affinity  of  SF(,  is  negative. 

In  an  effort  to  determine  the  effects  of  electron  correlation  on  the  electron  affinity 
of  SFft,  Miyoshi  et  al.  performed  both  SCF  and  configuration  interaction  (ci)  cal¬ 
culations  on  the  neutral  and  anion  [12].  Using  a  relatively  large  basis  set,  with  d 
polarization  functions  on  both  sulfur  and  fluorine,  these  workers  obtained  an  aEA 
of  -0.30  eV,  in  agreement  with  the  best  value  obtained  by  Klobukowski.  This 
agreement  is  interesting  since  Miyoshi  did  not  use  an /function  on  sulfur.  Thus, 
it  is  likely  that  adding  an  /function  to  their  sulfur  basis  will  lower  their  Hartree- 
Fock  value  for  the  aEA  of  SF6  even  further. 

Miyoshi’s  Cl  calculation  on  both  the  anion  and  neutral  was  performed  using  a 
much  smaller  basis  set  than  they  used  for  their  sCF  study.  The  Cl  energy  obtained 
from  this  basis  combined  with  their  SCF  energy  u  'ng  the  larger  basis  set  gave  a 
value  for  the  aEA  of  SF6  of  1 .06  eV,  in  excellent  agreement  with  the  currently 
accepted  experimental  value.  At  first  glance  this  monumental  effort  seems  to  com¬ 
plete  the  picture  concerning  the  relative  potential  energies  of  SF(,  and  SF6  .  However, 
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there  remain  some  nagging  questions  which  have  not  adequately  been  answered  in 
the  analysis  of  Miyoshi.  The  first  is  the  effect  of  the  inclusion  of  an  y^function  on 
sulfur,  which  was  left  out  of  the  Miyoshi  basis  set,  but  as  Klobukowski  clearly 
showed  lowers  the  neutral  potential  energy  curve  by  half  an  electron  volt  relative 
to  the  anion  curve,  and  thus  lowers  the  neutral  aEA.  The  second  is  the  consideration 
of  nonoctahedral  geometries  for  the  anion. 

In  this  study  we  have  attempted  to  determine  the  effect  of  the  sulfur /function 
and  nonoctahedral  anion  geometries  on  the  aCA  of  SF*,  using  both  SCF  and  CAS- 
MCSCF  calculations.  We  have  also  explored  low-lying  excited  states  of  both  the 
anion  and  neutral  at  the  sCF  level  with  relatively  large  basis  sets,  complementing 
earlier  work  on  the  neutral  molecule  by  Hay  [13].  Vibrational  states  of  both  species 
were  also  determined  in  an  attempt  to  rationalize  the  electon  attachment-detach¬ 
ment  properties  of  the  neutral  molecule. 

Theoretical  Procedure  and  Results 

The  basis  sets  employed  for  the  SCF  calculations  in  this  study  were  based  on 
Huzinaga’s  well-tempered  sets  [14].  In  the  largest  basis  sets  for  sulfur,  the  tightest 
primitives  for  both  the  s  and  p  sets  were  removed.  A  diffuse  ^-function  (zeta  = 
0.0023)  and  two  diffuse  p-functions  (zeta  =  0.041  and  0.002)  were  added.  The 
sulfur  basis  set  was  augmented  with  two  z(-functions  (zeta  =  0.659  and  0. 1 83 )  and 
one  /function.  To  determine  the  proper  choice  of/functions  for  the  sulfur  basis, 
the  exponent  was  varied  from  0.22  to  1 .02  for  both  the  anion  and  neutral.  For  the 
anion  'A  if,  state,  the  minimum  energy  was  obtained  for  an /function  with  f  =  0.64 
while  for  the  neutral  'T|^  state  the  best  exponent  was  0.68.  It  is  fortunate  that  the 
optimum  /function  for  both  the  anion  and  neutral  is  similar.  The  value  used  by 
Klobukowski  was  0.62  determined  from  the  overlap  criterion  [  14] .  For  consistency 
we  used  the  value  of  =  0.62  for  both  the  anion  and  neutral.  To  determine  whether 
one  /function  is  indeed  adequate,  an  additional  calculation  was  performed  using 
two /functions  (f  =  0.41  and  0.93).  The  calculated  aEA  at  the  SCF  level  with  two 
/functions  was  only  0.02  eV  smaller  than  that  calculated  using  one  /function 
(f  =  0.62).  Thus  it  appears  that  one  /function  on  sulfur  is  adequate.  The  sulfur 
basis  was  contracted  to  (12,12,11,1,1/8,8,1,1,1/1.1/1),  that  is,  the  sulfur  basis 
consisted  of  5s,  5p,  2d,  and  1  /function.  The  55  functions  contained  12,  12,  11, 
1,  and  1  primitive  gaussian  functions,  respectively,  while  the  5p  functions  contained 
8.  8,  1,  1,  and  1  primitive  gaussian  functions. 

For  the  largest  basis  for  fluorine,  again  the  tightest  primitives  for  both  the  5-  and 
p-functions  of  Huzinaga’s  well-tempered  set  were  removed.  The  set  was  augmented 
by  a  diffuse  5-function  (zeta  =  0.07392)  and  a  (/-function  (zeta  =  1.496).  The 
fluorine  basis  set  was  contracted  to  ( 10,1 1,1  /8,1,1,1  / 1 ). 

For  the  MCSCF  calculations,  two  additional  diffuse  5-functions  on  sulfur  were 
added  (zeta  =  0.0002  and  0.00002)  and  the  final  set  was  contracted  to 
( 1 2, 1 2, 1 1 , 1 , 1 , 1 , 1  /8,8. 1 , 1 , 1  / 1 , 1  / 1 ).  Thus,  for  the  sulfur  atom  and  six  fluorine  atoms 
189  primitive  gaussian  functions  were  used  for  the  MCSCF  calculations. 

The  MCSCF  calculation  was  performed  by  defining  an  active  space  common  to 
both  the  anion  and  neutral  molecule  and  allowing  all  single  and  double  excitations 
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Table  I.  Active  space  for  the  MCSCF 
calculation. 


Orbital 

Anion  orbital  energy 
(Hartrees) 

0.31846 

0.30446 

*b.. 

0.28418 

0.23683 

*(•- 

0.18869 

0.16323 

6a, g 

-0.14944 

III, 

-0.43445 

^hu 

-0.45389 

51, u 

-0.46512 

3c, 

-0.45452 

from  the  SCF  configuration  within  this  active  space.  The  active  molecular  orbitals, 
along  with  their  orbital  energies  are  presented  in  Table  1.  For  the  anion,  this  active 
space  generated  4,863  csf’s,  while  2,1 15  CSF’s  were  generated  for  the  neutral.  The 
CAS-MCSCF  calculations  in  this  study  required  nearly  300  h  using  a  CRay-ymp 
computer. 

The  calculated  total  energies  at  the  SCF  level  along  the  totally  symmetric  stretching 
mode  are  presented  in  Table  ii  and  the  potential  energy  curves  are  presented  in 


Table  II.  scf  and  mcscf  total  encr^es  (Hartrees)  for  SF^  and  SFj . 


R  (au) 

SCF 

CAS-MCSCF 

Neutral  ('A],) 

Anion  (^A|,) 

Neutral  ('Ai,) 

Anion  (^A],) 

2.75 

-994.235392 

-994.230650 

2.80 

-994.581848 

-994.576528 

2  85 

-994  282693 

-994.277987 

2.90 

-994.289813 

-994.285143 

-994.610003 

2.914 

-994.290161 

2.95 

-994.288100 

-994.283491 

-994  617690 

3.00 

-994.278938 

-994.274437 

-994.612580 

-994.60^661 

3.05 

-994.263536 

-994.259264 

3.10 

-994.242951 

-994.239444 

-994.585061 

-994.581183 

3.125 

-994.257088 

3.15 

-994.218107 

-994.259593 

-994.590706 

3.182 

-994.260823 

3.20 

-994  189809 

-994.260377 

-994.594521 

3.225 

-994.258832 

3.25 

-994.158760 

-994.256113 

-994.513949 

-994.593266 

3.35 

-994.090751 

-994.235175 

-994.567369 
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Figure  1.  A  striking  feature  of  the  anion  curve  is  the  local  minimum  at  the  neutral 
equilibrium  geometry.  This  feature  was  not  mentioned  in  the  work  by  Miyoshi, 
but  it  was  found  by  us  and  by  Klobukowski  that  the  composition  of  the  anion  6ai^ 
orbital  is  dominated  by  the  most  diffuse  functions  in  the  region  of  the  neutral  R^,, 
thus  the  extra  electron  is  not  truly  bound  to  the  SFf,  molecule  at  this  geometry. 
Instead,  this  region  of  the  anion  curve  represents  the  neutral  molecule,  with  the 
free  electron  occupying  the  most  diffuse  orbital  available  within  the  basis  set.  In 
the  current  basis,  this  orbital  is  seen  to  reside  almost  entirely  on  the  sulfur  atom 
and  is  dominated  by  contributions  from  the  very  diffuse  i  gaussian  functions  added 
to  the  Huzinaga  basis.  Therefore,  in  the  limit  of  an  infinitely  diffuse  basis  set,  the 
electron  simply  detaches  at  this  geometry,  leaving  the  neutral  SF,,  molecule  and  a 
free  electron. 

The  total  energies  and  potential  energy  curves  at  the  mcscf  level  are  also  shown 
in  Table  II  and  Figure  1.  Spectroscopic  properties  from  these  curves  were  obtained 
bv  passing  a  rounded  ramp  interpolating  function  through  the  calculated  data  [15]. 

Interpolated  data  generated  in  this  way  were  then  fit  to  a  Hulburt-Hirschfelder 
potential  function  [16].  The  spectroscopic  properties  and  aEA  obtained  from  these 
potential  energy  curves  are  presented  in  Table  III.  The  known  experimental  spec¬ 
troscopic  values  are  in  excellent  agreement  with  those  calculated  at  the  mcscf  level. 
However,  the  calculated  Ea’s  at  the  scf  level  arc  in  very  poor  agreement  with 
experiment  and  in  fact,  have  the  wrong  sign.  Our  scF  value  of  -9.68  eV  is  lower 
than  that  obtained  in  previous  studies.  The  inclusion  of  electron  correlation  at  the 
CAS-MCSCF  level  presented  here  does  little  to  improve  the  situation.  It  should  be 
noted  that  the  correlation  energy  recovered  in  this  study  is  only  about  one-third 
that  obtained  by  Miyoshi.  By  simply  adding  their  correlation  energies  to  our  SCF 
energies,  a  value  of  +0.68  eV  for  the  aEA  cf  SFf,  is  obtained. 


16 

14 


E 


4 
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0 _ 

1.4  1.5  1.6  1.7  1.8 
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Figure  1  M 1  (solid  lines)  .ind  (  as-mcs(1  (dashed  lines)  polcntial  energy  eurves  for  the 
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Table  III.  Spectroscopic  properties  for  SFj  and  SF^. 


SCF 

CAS-MCSCF 

Expenment 

R,(A) 

neutral 

1.54 

1.56 

1.565  [17] 

anion 

1.68 

1.70 

0),  (cm"') 

neutral 

863 

776 

769  [17] 

anion 

680 

558 

neutral 

23 

19 

anion 

14 

10 

SF^  adiabatic  electron 

-0.68 

-0.63 

1.0  [7.9] 

affinity  (eV) 

To  complete  the  characterization  of  the  potential  energy  curves  of  SFe  and 
SFa ,  the  Hulburt  -Hirschfelder  potentials  were  used  to  generate  the  first  few  vibra¬ 
tional  levels  for  the  molecules.  These  levels,  for  the  neutral  and  anion  at  both  the 
SCF  and  MCSCF  levels  are  presented  in  Table  IV.  These  data  show  at  most  one 
bound  vibrational  level  for  the  anion.  Above  this  energy,  the  electron  detaches. 
Note  that  the  first  vibrational  level  of  the  anion  is  very  close  in  energy  to  the  second 
vibrational  level  of  the  neutral. 

To  account  for  possible  Jahn-Teller  effects,  selected  nonoctahedral  geometries 
were  also  considered  in  this  study.  These  calculations,  as  well  as  calculations  of 
neutral  and  anion  excited  states  were  performed  using  the  scf  basis  set  described 
above,  without  the /function  and  diffuse  5-functions  on  sulfur. 

The  potential  energy  curves  for  the  '  T and  '  r,,,  states  of  the  neutral  as  well  as 
the  and  ^Tu,  states  of  the  anion  are  shown  in  Figure  2,  along  with  the  'A,,  and 
'Aig  states  of  the  neutral  and  anion,  respectively.  For  the  ground-state  neutral  mol¬ 
ecule,  the  outer  electron  configuration  in  octahedral  symmetry  is  ( lf2«)*(5fiu)* 
( l/i^.)'’,  while  for  the  anion,  the  ground  state  is  ( lf2H)*’(5fi„)^(  l/i^)®(6aig)'.  The 
neutral  and  anion  T states  are  created  from  the  excitation  \Ug — while  the 
T Ui  states  are  obtained  from  the  excitation  5f|„ — ba^g.  The  very  large  electronic 
energies  separating  these  excited  states  from  the  ground  states  make  it  unlikely  that 
they  are  involved  in  any  experimental  determinations  of  the  aEA  of  SF6,  nor  is  it 
likely  that  they  play  a  role  in  any  autodetachment  processes  for  the  anion. 


Tabi.e  IV  Calculated  vibrational  levels  (in  eV)  relative  to  the 
CAS-MCSCF  energy  of  SF6  at  R,. 


Level 

SCF 

CAS-MCSCF 

Anion 

Neutral 

Anion 

Neutral 

1 

0.93 

0.31 

0.83 

0.28 

2 

1.41 

0.92 

1.64 

0.83 

3 

1.87 

1.49 

1.62 

1.35 

4 

2.31 

2.03 

1.99 

1.85 

5 

2.73 

2.53 

2.35 

2.31 
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Figure  2.  SCF  potential  energy  curves  for  the  ground  and  first  two  excited  states  of  SFj 
(solid  lines)  and  SF6  (dashed  lines). 


While  the  ground  state  of  SFs  has  long  been  known  to  possess  octahedral  symmetry 
[15],  no  experimental  evidence  that  this  is  the  case  for  the  anion  has  been  presented. 
Breaking  octahedral  symmetry  increases  the  number  of  symmetry  adapted  basis 
functions  used  in  theoretical  calculations.  This  added  complexity  makes  it  clear 
why  octahedral  symmetry  has  been  assumed  for  the  anion  in  all  calculations  to 
date.  In  this  study  we  have  calculated  the  potential  energy  surface  for  selected 
distortions  from  octahedral  symmetry.  For  example,  the  molecule  was  allowed  to 
distort  along  the  Eg  vibrational  mode.  That  is,  the  axial  fluorine  atoms  along  the  z 
axis  were  moved  together,  and  the  equatorial  fluorine  atoms  in  the  xy  plane  were 
moved  together.  The  potential  surface  for  this  distortion,  which  maintains  D2A 
symmetry  is  shown  in  Figure  3.  It  seems  clear  that  this  state  of  the  anion  does 
indeed  prefer  an  octahedral  geometry  and  all  distortions  from  octahedral  symmetry 
increase  the  energy.  There  are  15  vibrational  modes  for  the  anion  and  neutral 
molecule.  These  correspond  to  the  symmetric  A  and  double  degenerate  Eg  modes 
discussed  above,  as  well  as  the  triply  degenerate  T i„  (2),  T2g,  and  T2U  vibrational 
modes.  The  /l|g  and  Eg  modes  involve  only  bond  stretching,  while  the  T2g  and  T2U 
modes  involve  only  angle  deformation  and  the  two  T\u  modes  involve  both  bond 
stretching  and  angle  deformation.  All  of  these  fundamental  modes  have  been  ob¬ 
served  in  the  neutral  molecule,  but  no  such  experimental  data  exists  for  the  anion. 

In  conclusion,  the  results  of  this  research  indicate  that  the  formidable  problem 
of  calculating  the  aEA  of  SFe  remains  unsolved.  While  the  currently  accepted  ex¬ 
perimental  value  has  been  obtained  by  others,  our  contention  is  that  these  results 
are  fortuitous  and  further  work  in  both  the  experimental  and  theoretical  detem'i- 
nations  is  needed.  For  theoretical  studies,  a  systematic  basis  set  evaluation,  in  which 
basis  functions  are  added  and  varied  until  the  value  of  the  electron  affinity  stabilizes 
is  needed.  Klobukowski’s  work  is  a  good  beginning  for  this  effort.  Experimentally, 
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Figure  3.  SCF  potential  energy  surface  of  the  ground  state  of  SF?  near  R,  along  the  Eg 

vibrational  mode. 

it  is  difficult  in  the  reported  collision  studies  to  identify  precisely  the  vibrational 
temperature  of  the  anions  produced.  Anions  formed  in  vibrationally  excited  states 
could  produce  erroneous  values  for  the  aEA. 

Distortions  conserving  D2/,  symmetry  show  clearly  that  the  molecule  prefers  oc¬ 
tahedral  symmetry.  Other  distortions  are  certainly  possible  and  should  be  explored. 
We  have  also  explored  some  low-lying  excited  states  and  shown  that  they  are  much 
too  high  in  energy  to  contribute  to  the  experimental  determination  of  the  aEA. 

In  regard  to  the  electron  autodetachment  process  for  the  anion,  the  potential 
energy  curves  presented  here  are  consistent  with  the  experimental  results.  In  order 
to  create  the  anion  in  the  first  place,  at  least  one  vibrational  state  should  be  available. 
Our  results  show  that  there  is  exactly  one.  The  ease  of  autodetachment  would 
require  this  state  to  easily  crossover  to  the  neutral  molecule /free  electron  state.  As 
has  been  shown  by  others  [18]  for  anion  and  neutral  vibrational  states  and 
x2',  respectively,  with  energies  Cy  and  c?-,  the  transition  x;;  -►  x?-  is  efficient  if: 

( 1 )  the  anion  and  neutral  potential  energy  surfaces  intersect  or  approach  one 
another  in  regions  of  the  coordinate  space  where  Xy-j  and  x2',  are  nonvan¬ 
ishing  and  the  highest  occupied  molecular  orbital  of  the  anion  becomes  very 
diffuse, 

(2)  the  change  in  slope  in  going  from  one  surface  to  the  other  is  small,  and 

(3)  Cy  -  Cy'  is  small. 
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These  criteria  are  met  for  the  A  if,  states  of  the  anion  and  neutral.  If  this  process  is 
indeed  involved  in  the  autoneutralization  of  SF^.  then  we  would  predict  a  strong 
positive  temperature  dependence  on  the  rate  of  electron  detachment. 

Based  on  these  findings,  the  multiple  electron  autodetachment  rates  observed 
for  SFfc  arise  from  initially  creating  anions  in  various  different  states,  which  each 
relax  to  a  vibrationally  hot  ground-state  molecule  at  different  rates.  The  electron 
in  the  vibrationally  excited  ground-state  anions  then  detach  according  to  the  mech¬ 
anism  described  above. 
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Abstract 

A  review  of  the  various  problems  facing  any  software  developer  targeting  massively  parallel  processing 
( MPP)  systems  is  presented  Issues  specific  to  computational  chemistry  application  software  will  also  be 
outlined.  Computational  chemistry  software  ported  to  and  designed  for  the  Intel  Touchstone  Delta 
Supercomputer  will  be  discussed.  Recommendations  for  future  directions  will  also  be  made  e  1993  John 
Wiley  &  Sons.  Inc. 


Introduction 

The  advent  of  massively  parallel  processing  (mpp)  supercomputers  has  been  an 
exciting  and  challenging  benefit  to  computational  science.  Many  of  the  algorithms 
and  theoretical  models  used  by  the  computational  chemistry  field  are  very  compute¬ 
intensive  and  the  computational  chemistry  market  is  an  obvious  target  of  many 
MPP  vendors.  Computational  chemists  have  long  been  at  the  forefront  of  utilizing 
and  developing  software  on  the  leading  edge  of  computational  technology.  The 
delivery  of  first-  and  second-generation  usable  mpp  hardware  has  enticed  many 
computational  chemistry  groups  to  begin  focusing  efforts  on  the  development  of 
chemistry  software  for  parallel  computing  systems.  To  date,  efforts  on  modern  mpp 
systems  are  distributed  among  primarily  academic  and  national  laboratory  envi¬ 
ronments  with  little  effort  from  the  vendor  community.  These  efforts  also  span  the 
entire  spectrum  of  computational  chemistry  methodologies  and  algorithms,  e.g., 
from  molecular  modeling  and  dynamics  to  full  configuration-interaction  calcula¬ 
tions,  and  have  been  conducted  on  the  gamut  of  available  mpp  hardware.  Further¬ 
more,  these  efforts  have  made  significant  progress  but  the  use  of  high  performance 
computing  systems,  specifically  massively  parallel  computers,  is  far  fronp  routine. 

In  this  article  1  outline  issues  that  challenge  software  development  with  respect 
to  a  very  technologically  volatile  hardware  industry  as  a  whole  with  a  focus  on  MPP 
systems  of  today  and  tomorrow.  The  perspective  is  that  of  a  computational  chemistry 
application  developer  and  what  1  see  available  now  and  in  the  near  future.  No  one 
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can  predict  long-term  trends  in  the  extremely  volatile  computer  industry,  so  I  will 
not  try,  except  to  state  that  software  development  will  be  different  than  the  way 
applications  are  developed  today.  A  discussion  of  the  general  issues  with  respect  to 
MPP  technology  and  issues  around  computational  chemistry  applications  is  pre¬ 
sented,  as  well  as  results  of  chemistry  applications  on  the  Intel  Touchstone  Delta. 
Conclusions  and  recommendations  arc  made  in  the  final  section. 


Software  Development  Issues  for  mpp  Computational  Chemistry  Software 

Software  Engineering.  Hardware,  and  Software  Life  Cycle 

Any  software  engineering  (SE)  text  warns  that  the  lack  of  effort  in  the  design  of 
modular  and  reusable  software  will  eventually  cause  a  complex  software  system  to 
collapse  under  its  own  weight  [  1  ] .  The  concepts  of  modularity  and  reusability  are 
without  regard  to  the  particular  programming  language  used  for  a  given  software 
system.  The  staffing  requirements  for  maintaining  a  required  but  poorly  designed 
application  are  substantial.  Computational  chemistry  software  applications  are  not 
immune  from  this  process;  in  fact,  many  of  the  software  developers  in  computational 
chemistry  are  graduate  students  rarely  trained  in  any  aspects  of  modem  computer 
science  or  SE.  The  programming  effort  is  usually  the  last  part  of  the  theoretical  or 
model  development  and  is  sometimes  less  interesting  to  a  student  trying  to  finish  a 
thesis  effort. 

Computing  hardware  has  undergone  a  tremendous  series  of  advances  over  the 
last  two  decades.  The  Supercomputing  industry  was  born  in  the  1960s,  flourished 
in  the  70s  and  80s,  and  has  grown  drastically  in  the  last  few  years.  In  the  1970s 
and  early  1980s  the  hardware  designs  lasted  on  the  order  of  5  to  7  years.  The 
doubling  of  computer  power  with  advances  in  hardware  technology,  and  the  interface 
to  that  technology,  operating  systems,  optimizing  compiler  design,  etc.,  progressed 
at  a  rate  where  software  developers  had  the  time  to  adapt  and  make  algorithmic 
modifications  to  make  optimal  use  of  the  computing  resources  available.  In  the 
last  5  years  the  hardware  technology  growth  curve  has  taken  a  drastic  change.  With 
the  advent  of  reduced  instruction  set  chip  ( RISC)  technology  the  period  for  doubling 
of  raw  compute  power  is  now  somewhere  between  12  and  18  months  (see  Fig.  1 
[2]).  By  the  time  a  computer  system  is  procured  and  delivered  it  is  most  likely  out 
of  date  (although  not  obsolete).  This  rapid  development  has  even  caused  the  com¬ 
puting  industry  to  implement  leap  frog  hardware  development  efforts  to  keep  pace 
with  the  demands  of  the  computing  user  base. 

The  hardware  used  in  the  past  and  today  has  a  standard  life  cycle  [1].  When  the 
hardware  is  first  delivered  or  developed  it  usually  has  a  high  failure  rate  or  is  not 
as  useful  as  might  be  expected.  This  is  sometimes  due  to  the  overall  software  interface 
to  that  hardware.  The  bugs  in  the  system  get  worked  out  and  the  system  becomes 
useful.  Then  the  us^r  community,  computational  chemists  included,  saturate  the 
resource  to  do  scientific  development  and  applications.  Over  time,  the  system  be¬ 
comes  obsolete  due  either  to  the  inevitable  hardware  failures  or,  as  is  more  likely. 
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Figure  I.  Tne  rate  of  change  of  raw  compute  power  available  from  workstations  [2] 


to  the  fact  that  the  resource  can  no  longer  meet  the  computational  requirements 
of  the  user  community  in  a  cost  effective  manner  (see  Fig.  2). 

This  simplistic  “usefulness”  model  also  applies  to  software  as  well  [1],  The  initial 
software  application  usually  has  a  very  limited  set  of  functionality.  As  more  robust 
algorithms  and  additional  functionality  are  implemented  the  software  becomes  more 
“useful”  to  the  user  community.  Without  further  algorithmic  developments  the  “use¬ 
fulness”  of  a  software  system  will  asymptotically  approach  some  steady-state  level  and 
not  deviate  from  that  (see  Fig.  3).  What  more  realistically  happens  is  that  as  new 
functionality  is  added  more  bugs  and  design  flaws  are  uncovered  and  the  failure  rate 
of  the  application  grows,  with  the  software  becoming  less  “useful”.  In  time,  maintenance 
and  development  efforts  usually  reduce  the  failure  rate  and  make  the  code  “useful” 
again.  It  has  been  said  that  the  process  of  porting  an  application  to  an  MPP  environment 
is  a  process  of  “rebugging  software”  and  the  experience  of  the  efforts  in  the  compu¬ 
tational  chemistry  community  have  shown  this  to  be  true. 


Programming  Models 

There  are  a  wide  variety  of  specific  programming  models  and  tools  that  can  be 
used  to  develop  a  working  programming  system  on  a  parallel  computer  f  3-6] ,  The 
obvious  and  key  point  is  that  there  must  be  a  parallel  algorithm  for  the  requisite 
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Figure  2.  Hardware  life  cycle  (Usefulness  overtime). 


computational  task(s).  The  various  programming  models  fall  into  the  traditional 
classes  of  data  parallel  (  dp),  shared  memory  (SM),  and  distributed  data/task  models. 
The  latter  is  the  common  multiple  instruction  multiple  data  (mimd)  programming 
model  with  message  passing  between  processors.  In  practice  most  developers  use  a 
single  program  MIMD  model  and  assign  data  and  tasks  based  upon  the  identity  of 


Figure  3.  Software  life  cycle  (Usefulness  overtime). 
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a  given  processing  node.  The  basic  problem  with  all  programming  models  is  that 
on  any  given  hardware  there  exists  a  locality  of  data  problem.  The  program  becomes 
a  bookkeeping  algorithm  that  finds  the  data  for  the  computational  task  at  hand. 

In  traditional  SM  environments  there  is  a  flat  memory  so  access  to  any  segment 
of  memory  is  uniform.  In  dp  and  MIMD  you  have  an  obvious  data  locality  problem 
when  sharing  data  among  processes  and  tasks.  Simple  replicated  data  algorithms 
circumvent  this  by  holding  a  copy  of  the  requisite  vectors/ matrices  on  each  node. 
With  the  current  vendor  offerings  of  16  to  64  Mbytes  per  physical  processing  node, 
this  is  a  clear  limitation.  Moreover,  the  replicated  data  algorithms  do  not  scale  to 
hundreds  or  thousands  of  processors.  These  obvious  problems  have  caused  many 
vendors  to  look  at  globally  addressed  memory  that  is  physically  distributed.  This 
gives  the  programmer  the  look  and  feel  ofaSM  programming  environment,  a  major 
benefit,  but  there  are  potentially  drastic  performance  penalties  for  accessing  nonlocal 
memory.  This  brings  the  data  locality  problem  back  to  the  programmer  and  the 
software  tools  available  on  the  system.  Now  that  several  general  aspects  of  various 
programming  models  have  been  outlined,  a  discussion  of  each  in  more  detail  follows. 

The  Shared  Memory  programming  model  is  probably  the  most  widely  used  and 
best  understood  model  simply  because  of  the  amount  of  time  that  the  computational 
community  has  used  this  technique,  but  few  quantum  chemistry  codes  make  use 
of  this  programming  model  even  today.  There  are  several  specific  approaches  that 
have  been  implemented  on  UNIX  Woikstations  and  on  various  low  and  high  end 
supercomputers,  ( Silicon  Graphics.  Stardent.  Alliant.  Convex.  Cray.  etc.).  The  gen¬ 
eral  scheme  used  to  parallelize  applications  is  to  identify  shared  and  priv  ate  segments 
of  memory  and  have  the  owner  compute  a  specific  portion  of  the  shared  data 
structures  and  all  of  the  private  data/tasks  available  to  the  specific  process.  There 
has  been  much  compiler  work  done  on  this  programming  model  and  it  is  reasonably 
well  understood  by  the  computer  science  field.  Programmers  can  tune  and  optimize 
their  code  using  compiler  directives  that  help  the  compiler  understand  the  appli¬ 
cation  layout.  This  is  in  my  estimation  the  most  efficient  parallel  programming 
model  for  computational  chemistry  applications  in  wide  use  today.  Unfortunately, 
the  underlying  hardware  is  not  scalable;  it  is  very  expensive  to  realistically  extend 
to  large  numbers  of  processors  ( e.g.,  greater  than  100)  and  very  large  memory  sizes 
(e.g.,  greater  than  10  Gwords).  This  has  forced  the  development  of  the  above- 
mentioned  global  accessible  memory  that  is  physically  distributed  (e.g.,  Kendall 
Square  Research.  Cray  Research  Inc.  T3D,  etc.).  The  overall  programming  model 
stays  the  same  but  there  is  an  added  task  of  making  sure  the  data  locality  is  preserved 
to  avoid  thrashing  of  pages  from  nonlocal  processors. 

The  data  parallel  programming  model  is  one  that  has  been  used  primarily  on 
the  SlMD  architectures  by  design  but  is  not  limited  to  these  machines.  This  is  also 
the  underlying  principle  behind  the  High  Performance  FORTRAN  ( HPF)  language 
specification  [7],  which  augments  the  recently  standardized  FORTRAN90.  HPF 
offers  the  ability  to  distribute  vectors  and  matrices  across  the  processes  on  a  machine 
via  compiler  directives  and  declaration  statements.  The  DP  rule  of  thumb  is  that 
the  owner  computes  the  portion  of  the  matrix /vector  that  it  controls.  The  DP 
programming  model  works  extremely  well  if  there  is  no  load  balancing  or  data 
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dependencies  across  processes  (Finite  Element  or  grid  calculations.  Fourier  trans¬ 
forms,  etc.).  In  a  pure  simd  program,  load  imbalance  causes  manv  processors  to 
be  idle  at  either  the  beginning  or  end  of  a  series  of  parallel  tasks.  HPF  offers  extrinsic 
procedures  or  routines  to  handle  access  to  message-passing  facilities  which  can 
handle  data  dependency  aspects.  Most  computational  chemistry  algorithms  have 
either  very  irregular  data  structures  or  access  a  regular  data  structure  in  an  irregular 
fashion  and  are  thus  not  suited  for  pure  DP  programming  models.  For  example,  in 
the  formation  of  the  closed-shell  Fock  matrix  a  two-electron  integral  contributes 
to  six  independent  Fock  matrix  elements  all  of  which  may  not  be  accessi'  e  in  a 
given  distribution  of  the  Fock  matrix.  The  HPF  draft  standard  acknowledges  the 
difficulties  of  irregular  data  structures  or  access  to  data  structures,  and  there  are 
plans  for  a  follow  on  language  specification  that  will  address  irregularity  issues. 
There  is  great  potential  for  such  standard  languages  or  extensions  to  existing  lan¬ 
guages  once  this  issue  is  addressed. 

The  message  passing  programming  model  is  probably  the  most  widely  used  pro¬ 
gramming  model  on  distributed  memory  architectures.  The  basic  principles  are 
that  every  process  has  a  local  memory  addressable  only  by  that  specific  processor 
and  it  can  only  access  nonlocal  information  by  passing  a  message  to  another  process 
that  has  other  needed  data.  Tasks  and  the  required  data  structures  can  be  farmed 
out  to  various  processors  via  these  messages  and  the  oarallel  calculation  performed. 
Messages  can  usually  be  sent  synchronously  or  asynchronously,  but  this  is  system 
dependent.  Synchronous  messages  require  the  cooperation  of  both  processes.  Asyn¬ 
chronous  message  passing  can  be  useful  but  requires  buffer  space  for  messages  to 
make  them  effectixe.  The  ability  to  pass  asynchronous  mes-sages  offers  a  more  robust 
programming  environment  because  it  does  not  require  a  sequential  bottleneck  ^^f 
the  sending  and  receiving  processors.  A  message-passing  program  makes  the  speed 
and  latency  of  the  underlying  hardware  an  important  aspect  of  the  algorithm  and 
software  design.  If  the  application  requires  only  short  messages  then  the  latency  or 
overhead  for  sending  each  message  is  important  (c.g..  molecular  dynamics  simu¬ 
lations).  If  the  application  requires  a  few  messages  that  are  relatively  large  (ca.  1 
Mbyte)  then  the  speed  or  bandwidth  of  the  underlying  communication  network  is 
important  (c.g.,  a  replicated  data  Hartree-Fock  code).  Many  computational  chem¬ 
istry  applications  contain  both  aspects.  The  general  course  grained  aspect  of  integral 
generation  and  Hartree-Fock  procedures  are  ideal  foi  this  programming  model 
and  the  currently  available  vendor  offerings.  More  complicated  and  useful  chemistry 
applications  have  finer  granularity  and  arc  thus  limited  by  the  underlying  com¬ 
munication  network  (e.g..  Multiconfiguration  Self-Consistent  Field  energy  and  en¬ 
ergy  derivative  methods). 

The  final  programming  model  that  1  would  like  to  discuss  is  that  of  distributed 
data.  This  programming  model  borrows  strongly  from  that  of  the  Linda  language 
(8).  The  concept  is  straightforward  and  is  really  an  extension  to  any  of  the  above 
programming  models.  The  concept  is  similar  to  that  of  elaborate  memory  paging 
algorithms  used  in  many  time-sharing  computers  today.  In  Linda,  the  user  has  the 
concept  of  a  tuple  as  an  abstract  data  object  that  can  be  stored  and  retrieved  to 
and  from  "tuple  space"  as  well  as  user-defined  functions  that  can  be  used  to  trans- 
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form  the  data  via  an  "eval”  call.  The  advantage  of  having  this  “secondary”  memory 
storage  is  that  the  location  of  the  data  and  the  mechanisms  for  moving  or  “paging” 
the  data  are  removed  from  the  programmer.  This  gives  the  look  and  feel  of  a  segment 
of  "shared”  memory  that  is  accessible  by  all  processes.  There  are  potentially  performance 
problems  with  the  way  the  distributed  data  is  accessed,  stored,  and  transformed  but 
the  distributed  data  programming  model  lessens  the  impact  on  computational  pro¬ 
grammers  using  distributed  computing  models.  The  details  of  the  storing  and  retrieving 
data  from  the  distributed  data  space  can  be  implemented  in  shared  memory  or  message 
passing  allowing  applications  to  be  more  portable.  Harrison  has  developed  and  suc¬ 
cessfully  demonstrated  a  distributed  data  model  for  the  Intel  Touchstone  Delta  super¬ 
computer  that  uses  the  interrupt  driven  mechanisms  available  on  that  machine  [9]. 
Similar  work  has  been  done  by  Rendell  et  al.  specifically  for  tb.e  closed-shell  coupled 
cluster  algorithm  ( 10].  In  general,  computational  chemistry  applications  will  not  need 
the  full  functionality  of  a  Linda-type  implementation,  but  only  well-defined  data  types, 
le.g..  for  FORTRAN  integer,  double  precision,  character,  etc.)  [9].  This  functionality 
is  projected  to  be  the  first  useful  programming  model  for  the  next  generation  of  scalable 
computational  chemistry  applications. 

Portability  and  Resource  Utilization 

Portability  is  an  issue  that  has  plagued  FORTRAN  computational  chemistry 
applications  for  many  years.  Unlike  many  modem  languages,  there  is  no  language- 
specific  mechanism  for  isolating  machine-dependent  code  in  FORTRAN.  There 
are  several  different  approaches  that  can  be  used  to  get  around  this  problem.  Many 
free  and  some  commercial  systems  for  FORTRAN  code  maintenance  exist  and  are 
widely  used  by  the  computational  chemistry  community.  Writing  only 
FORTRAN77  or  FORTRAN90  is  not  a  viable  option  because  many  chemistry 
applications  interface  with  the  system  environment  to  get  timing  information,  use 
special  disk  I/O  routines,  check  system  runtime  characteristics,  etc.  The  various 
complexities  of  the  above-mentioned  programming  models  will  also  add  to  the 
overall  complexity  of  a  “portable”  application.  A  review  of  the  computational 
chemistry  literature  over  the  last  few  years  will  show  that  chemists  will  use  as  many 
of  the  theoretical  models  as  is  feasible  for  the  solution  of  a  given  chemistry  problem. 
This  aspect  alone  will  compel  the  integration  of  computational  chemistry  appli¬ 
cations  into  a  suite  of  functionality  with  a  common  “user  interface.”  The  devel¬ 
opment  of  this  interface  is  a  research  topic  in  its  own  right  and  beyond  the  scope 
of  this  article.  It  is  imperative  that  the  core  functionality  application  suite  use  mod¬ 
ularity  and  more  commercial  style  software  practices  ( e.g..  long-term  use  or  reuse 
of  software)  to  maintain  the  integrity  of  applications  across  the  various  platforms, 
from  the  workstation  to  the  high  performance  computing  supercomputer. 

Resource  utilization  is  an  issue  that  will  have  to  be  addressed  by  both  the  user 
and  vendor  communities.  Users  are  accustomed  to  sharing  workstations  and  tra¬ 
ditional  supercomputers  based  on  a  round  robin  or  time  slice  multi-user  scheduling 
system.  On  current  MPP  offerings  this  is  simply  not  feasible.  The  scheduling  of 
resource  utilization  is  a  research  topic  in  the  computer  science  field.  Users  will  have 
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to  become  accustomed  to  more  batch  utilization  and  space  sharing  of  the  resources. 
The  disk  I/O  capacity  is  usually  the  limiting  faaor.  For  e.\ample.  on  the  Intel  Touch¬ 
stone  Delta  supercomputer  there  are  512  compute  nodes  with  16  Mb\les  of  memory 
each,  for  a  total  of  8  Gbvles  of  memory.  The  aggregate  1  /O  rate  on  that  machine  under 
the  normal  operating  system  is  less  than  12  Mbytes/sec.  This  means  to  roll  out  a  job 
using  all  nodes  would  require  at  least  1 1  min.  The  next  generation  mpps  will  not  be 
much  better  because  the  I/O  subsystem  is  the  least  improved  component.  Again  this 
is  an  active  computer  science  research  topic.  The  computational  chemistry  community 
cannot  and  will  not  wait  for  these  problems  to  be  solved.  This  means  that  the  application 
developers  must  be  more  aggressive  in  checkpointing  their  own  algorithms  with  only 
the  requisite  restart  data  being  written  to  disk. 

Review  of  Chemistry  Applications 

The  use  of  parallel  applications  in  chemistry  is  not  a  new  idea.  Reports  of  using 
available  parallelism  on  minicomputers  date  back  to  the  early  and  mid  1980s  [1 1- 
15 j.  Tiiere  are  a  .'ew  current  research  efforts  around  the  world  targeted  at  the  de¬ 
velopment  of  software  on  current  and  future  generation  MPPs.  These  include  mostly 
academic  and  national  laboratory  efforts,  although  a  few  vendors  have  openly  stated 
that  they  have  started  a  port  of  the  Gaussian  software  in  collaboration  with  Gaussian. 
Inc.  The  academic  research  efforts  include  GAMESS-USA  from  Mark  Gordon's 
group  at  Iowa  State  University,  DISCO  from  Jan  Almlof  and  coworkers  at  University 
of  Minnesota,  Columbus  from  Hans  Lischka  at  the  University  of  Vienna  (in  col¬ 
laboration  with  the  Ohio  State  University  and  Argonne  National  Laboratory ).  var¬ 
ious  applications  from  Bill  Goddard  at  Caltech,  and  a  new  initiative  at  the  San 
Diego  Supercomputer  Center  run  by  Peter  Taylor.  The  national  laboratory  efforts 
include  various  chemistry'  applications  at  Ai^onne  National  Laboratory,  Pacific 
Northwest  Laboratory.  Sandia  Laboratories,  the  National  Institutes  of  Health,  and 
the  SERC  Daresbury  Laboratory  in  England.  This  list  does  not  encompass  all  re¬ 
searchers  working  on  or  planning  mpp  application  development  but  does  include 
the  research  groups  that  have  significant  resources  for  their  effort.  1  have  also  limited 
the  list  to  efforts  1  have  some  direct  knowledge  of  the  software  being  developed. 

Molecular  dynamics  (md)  applications  have  been  using  MPP  systems  from  the 
beginning  of  the  development  of  these  parallel  computing  systems.  Various  algo¬ 
rithms  have  been  developed,  systolic  loop,  linked  cells,  and  replicated  data  systems 
[16,17].  Due  to  the  relatively  small  memory  requirements  of  most  md  applications 
the  replicated  data  algorithms  have  been  most  w.dely  used  (c.f  Ref  [17]).  The 
advent  of  larger  memory  capacities  and  faster  communication  subsystems  with 
decreased  latency  characteristics  on  next  generation  MPPs  will  keep  the  replicated 
data  algorithms  in  wide  use  over  the  next  few  years.  The  same  technological  advances 
will  also  allow  the  refinement  and  improved  performance  of  the  other  parallel 
algorithms  that  have  been  implemented.  The  replicated  data  algorithms  do  not 
scale  to  thousands  of  nodes  and  thus  md  applications  will  need  further  development 
once  the  scientific  demands  of  the  applications  increase.  The  md  applications  are 
approaching  routine  utilization  of  current  MPP  systems  and  are  generating  results 
that  require  the  increased  computational  resources  available  at  various  MPP  sites. 
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Traditional  ab  initio  software  applications  have  had  much  less  routine  devel¬ 
opment  on  MPP  machines.  Simple  replicated  data  algoiithms  are  useful  but  the 
memorv  size  on  most  mpp  machines  ranges  from  1 6  to  32  Mbytes  per  node  which 
is  a  severe  limitation.  For  e.xample,  in  a  replicated  data  Hartree-Fock  code,  the 
entire  density  matrix  and  Fock  matrix  would  be  stored  on  each  node.  This  would 
allow  each  integral  to  be  partially  summed  with  the  appropriate  density  matrix 
element  in  to  the  requisite  six  Fock  matrix  elements.  Once  all  integrals  are  computed 
the  “partial"  Fock  matrices  would  then  be  globally  combined  into  the  full  Fock 
.Matrix  and  diagonalized.  After  a  new  density  matrix  was  formed  and  broadcast  to 
each  process  the  iterative  process  would  continue.  This  scheme  replicates  two 
matrices  and  is  thus  parallel  over  the  A’VS  integral  generation  work.  However,  on 
a  machine  that  has  16  Mbytes  of  memory  you  could  have  no  more  than  two 
1000  X  1000  square  matrices  in  core  memory.  In  reality  the  operating  system  takes 
up  some  memory,  the  software  uses  memory,  and  thus  there  is  room  for  much 
fewer  matrix  elements.  The  thrust  of  most  efforts  is  to  go  beyond  this  limitation 
by  dis,  ibuting  the  data  structures  of  a  given  calculation  to  allow  the  problem  to 
scale  to  the  full  memory  of  the  machine  and  not  the  limitations  imposed  by  the 
per  node  memory.  From  this  simple  example  it  is  hopefully  clear  that  the  transition 
to  MPP  software  development  also  requires  more  up-front  analysis  of  the  algorithmic 
designs. 

In  the  above  Hartree  Fock  application  domain  the  type  of  distribution  has  different 
computation,  communication,  and  memory  tradeoffs.  Colvin  and  coworkers  [18] 
have  implemented  an  ar.ilication  that  distributes  both  the  Fock  and  density  matrices 
that  requires  A"'/2  instead  of  the  typical  A’V8  integral  generation  work.  This  dis¬ 
tribution  scheme  also  can  suffer  from  load  imbalance  in  the  parallel  integral  com¬ 
pulation.  In  this  distribution  scheme  a  lack  of  memory  has  been  traded  for  more 
integral  computation.  The  most  promising  technique  to  date,  is  that  of  Foster  [19] 
who  has  developed  a  distribution  scheme  that  stores  0(A)  Fock  and  density  matrix 
elements  on  each  processor  but  it  is  based  on  the  following  assumptions.  The  in¬ 
tegrals  all  cost  the  same  number  of  flops  and  are  computed  one  at  a  time  (i.e.,  not 
in  groups  over  shells).  Unfortunately,  these  assumptions  are  not  in  line  with  modem 
efficient  and  parallel  integral  algorithms.  In  this  distribution  scheme  communication 
is  increased  to  allow  for  a  fixed  computational  cost  of  the  integral  evaluation.  However, 
Foster's  efforts  do  show  promise  once  the  shell  grouping  of  integrals  is  addressed. 

Even  with  the  above-mentioned  problems  there  are  several  useful  ab  initio  ap¬ 
plications  in  use  on  MPr  systems.  GAMES-USA.  GAMES-UK,  and  Columbus 
arc  now  in  production  use  by  their  respective  groups  on  In.tl  mimd  machines  with 
specific  functionality  { i.e.,  not  all )  parallelized  [20-23  ] .  Harrison  and  Stahlberg 
have  implemented  an  object-oriented  style  full  ci  code  on  the  Intel  Touchstone 
Delta  supercomputer  that  sustains  4  GFLOPS  (20%  of  peak  performance)  [24]. 
The  distributed  data  models  used  by  Harrison  and  Stahlberg  were  implemented  by 
Rendell  and  coworkers  in  his  coupled-cluster  singles  and  doubles  code  [10].  Fey- 
creisen  and  coworkers  [25]  have  implemented  a  master/slave  repiicated  data  version 
of  Almlof 's  DISCO  SCF/MP2  program  on  the  Intel  Touchstone  Delta  supercomputer 
using  a  message  passing  library.  TCGMSG.  written  by  Harrison  [26].  DISCO 
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has  also  been  ported  to  workstation  clusters  using  TCGMSG,  LINDA.  PVh''.  and 
EXPRESS  [27],  For  all  these  codes  to  become  production  quality  the  above-mentioned 
scaling  issues  need  to  be  addressed.  These  developments  show  great  promise  and  are 
likely  to  be  the  foundation  of  ah  initio  applications  developed  in  the  future.  I  would 
recommend  the  reading  some  of  the  specific  references  for  more  details  of  the  imple¬ 
mentation  and  performance  parameters  of  each  of  these  codes  on  current  mpp  systems. 

Conclusions 

What  is  needed  to  bring  MPP  into  routine  computational  chemistry  production? 
This  can  be  summed  up  in  one  word;  software.  This  means  both  the  software  to 
facilitate  application  developments  and  the  application  software  itself.  It  is  unlikely 
that  the  large  industrial  user  community  will  convert  to  mpp  utilization  until  the 
commercial  software  base  or  high  quality,  high  performance  software  from  academic 
and  national  laboratory  efforts  is  available  for  use  on  these  high  performance  su¬ 
percomputers  or  MPPs.  The  commercial  software  developers  probably  won’t  make 
the  effort  until  a  more  significant  market  exists.  This  will  require  the  computing 
environments  on  the  mpp  machines  to  be  much  more  robust  and  a  pool  experienced 
development  personnel  must  become  available  to  commercial  software  companies 
from  the  academic  areas. 

The  development  of  new  algorithms  and  associated  software  must  keep  abreast 
of  changes  in  the  computing  environments  available.  In  this  article  I  have  aitempted 
to  point  out  a  subset  of  the  issues  that  need  to  be  considered  and  addressed  by  both 
the  computational  science  and  computer  science  communities.  MPP  development 
has  coerced  a  coupling  of  these  disciplines  and  this  coupling  provides  a  new  op¬ 
portunity  to  guide  the  development  to  the  solutions  of  some  of  these  issues.  No 
one  discipline  can  solve  all  the  problems  that  exist  or  that  will  be  uncovered  over 
the  next  decade.  I  have  also  tried  to  point  out  some  of  the  problems  associated  with 
computational  chemistrv'  applications  with  respect  to  mpp  development  and  pro¬ 
duction  use.  The  important  factor  here  is  doing  the  chemistry  required  to  solve 
problems  posed  to  us  either  in  basic  or  applied  research,  mpps  are  an  obvious  tool 
to  use  due  to  the  computational  requirements  of  the  theoretical  models  used  today 
and  that  will  be  used  in  the  future.  I  have  also  outlined  a  subset  of  the  research 
efforts  currently  in  place.  Beyond  the  icquired  computational  science  training  and 
education  it  is  important  that  development  feroups  get  acces*;  to  the  latest  technology 
to  develop  and  refine  research  ideas.  Moreover,  collaborative  efforts  with  computer 
science  and  other  computational  science  efforts  are  essential  for  the  development 
of  MPP  software.  In  regards  to  the  question  1  have  posed  in  the  title,  is  the  transition 
to  MPP  high  performance  computing  a  simple  one?  1  hope  so. 
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Abstract 

Algonthms  arc  designed  to  implement  molecular-dsnamies  simulations  on  emerging  concurrent  ar¬ 
chitectures  For  ssstems  with  finite-range  interacti''''s,  a  domain  decomposition  algonthm  is  used  to 
implement  the  multiple-time-step  (MTS)  approach  to  molecular-djnamics  (md)  simulations  on  distnbuted- 
memory  multiple  instructions  multiple  data  (mimd)  machines  This  approach  reduces  the  computation 
of  forces  significantly  by  exploiting  the  different  time  scales  for  short-range  and  intermediate-range  in¬ 
teractions.  Parallel  algonthms  are  also  designed  for  md  simulations  of  bulk  Coulombic  systems.  The 
performance  of  these  algonthms  is  tested  on  the  Intel  iPSC/860  system.  The  computational  complexity 
of  these  algonthms  is  0(.\)  and  parallel  efficiencies  close  to  0.9.  Molecular-dynamics  simulations  are 
earned  out  to  investigate  the  structural  and  dynamical  properties  of  highly  densified  and  also  porous 
silica  glasses  Changes  in  the  short-range  and  intermediate-range  order  in  amorphous  SiOj  are  determined 
at  different  densities  in  the  range  of  4.28-0.1  g/cm’.  Resul's  for  internal  surface  area  and  surface-to- 
volume  ratio  in  porous  SiO;  aie  also  discussed.  *  199.T  John  Wiley  &  Sons.  Inc 

Introduction 

Despite  significant  recent  developments  in  materials-simulation  techniques  [1- 
6].  the  goal  of  reliably  predicting  the  properties  of  new  materials  in  advance  of 
fabrication  and  measurement  has  not  yet  been  achieved.  The  primary  reason  for 
this  lack  of  success  is  the  inability  of  sequential  machines  to  handle  large-scale 
simulations.  For  example,  MD  simulations  for  long-range  interactions  scale  as  N^, 
where  N  is  the  number  of  particles  in  the  system.  In  many  physical  systems,  the 
desired  system  sizes  are  in  the  range  of  10*  particles.  These  are  beyond  the  compute 
power  of  most  sequential  machines.  However,  the  MD  technique  has  considerable 
inherent  parallelism.  By  exploiting  this  parallelism  on  emerging  parallel  architec¬ 
tures.  it  is  possible  to  perform  large-scale  simulations  for  complex  materials. 

In  the  past  two  years,  we  have  used  the  parallel  computer  architectures  in  our 
Concurrent  Computing  Laboratory  for  Materials  Simulations  (CCLMS)  to  carry  out 
MD  simulations  on  network  glasses.  The  CCLMS  has  the  following  computational 
facilities: 

•  MasPar  1208B — an  SIMD  (single  instruction  multiple  data)  machine  with  8192 
processors 


•  Also  at  Laboratory  for  Physical  Chemistry.  University  of  Amsterdam.  The  Netherlands. 


International  Journal  of  Quantum  Chemistry.  Quantum  Chemistry  Sympiostum  27.  781-792  (1993) 
c  l993Jobn  Wiley* Sons.  Inc  CCC {X)20-7608/93/0l078l-l2 
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•  Intel  iPSC/860 — an  eight-node  disfributed-memory  MIMI)  (multiple  instruc¬ 
tions  multiple  data)  machine 

•  Intel  iWARP — a  64-cell  MtMt)  machine  with  systolic  communication 

•  Silicon  Graphics  IRIS  4D/340VGX  Power  Center  for  real-time  visualization 

Plans  am  in  place  to  connect  these  machines  by  fddi  (fiber  distributed  data  interface) 
to  form  a  distributed  multipnrallcl  processing  network  so  that  diflerent  parts  of  a 
large-scale  simulation  can  run  concurrently  on  dificrent  parallel  architectures  along 
with  real-time  visualization. 

In  this  paper,  we  will  describe  our  recent  work  on  parallel  algorithms  for  the  md 
approach  and  simulation  results  for  structural  properties  of  network  glasses.  The 
outline  of  this  paper  is  as  follows;  the  next  section  deals  with  an  implementation 
of  the  MD  approach  on  the  iPSC/860  system,  and  the  performance  of  the  parallel 
MD  algorithm  is  discussed  in  the  third  section.  Simulation  results  for  structural 
properties  of  highly  densified  and  porous  SiOj  glasses  arc  discussed  in  the  fourth 
section  followed  by  concluding  remarks  in  the  fifth  section. 

Parallel  Algorithms  for  Molecular  Dynamics 

.Molecular-dynamics  approach  has  played  a  key  role  in  our  understanding  of 
classical  and  quantum  microscopic  processes  in  physical  systems  [1],  In  the  MD 
approach,  one  obtains  the  phase-space  trajectories  of  particles  from  the  numerical 
solution  of  Newton’s  equations.  Physical  properties  of  a  system  are  calculated  from 
phase-space  trajectories  of  the  constituent  particles.  The  interparticle  interaction 
energy  is  a  vital  input  to  MD  simulations.  This  N-body  term  is  commonly  expressed 
as  a  combination  of  one*,  two-,  three-body  potentials,  etc. 

For  systems  with  a  finite-range  intcrparticle  interaction,  the  total  number  of  pairs 
contributing  to  the  energy  and  forces  is  N*Nf,f2,  where  Nf,  is  the  average  number 
of  particles  within  the  range  of  the  interaction,  r^.  An  efficient  way  to  calculate 
interparticle  interactions  is  to  use  the  Hnked-list  method.  In  this  approach,  t||e 
simulation  cell  is  divided  into  smaller  cells,  each  with  an  edge  Lin  {L  is  the 
length  of  the  MD  cell)  which  is  slightly  larger  than  r^.  Using  two  integer  arrays,  a 
list  of  particles  in  each  cell  is  constructed.  The  first  array  identifies  the  particle  at 
the  top  of  the  list  in  each  cell  and  the  second  array  links  particles  belonging  to  the 
same  cell. 

A  major  advantage  of  the  linked-cell  list  technique  is  that  the  computation  time 
is  proportional  to  \4«N*{L/nf,  since  each  ccU  has  26  neighboring  cells  (Newton’s 
third  law  is  used  to  reduce  the  computation  by  a  factor  of  2).  Furthermore,  with 
the  linked-Iist  method  the  minimum-image  convention  can  be  implemented  effi¬ 
ciently  on  distributed-memory  MIMD  machines. 

T)  e  computation  of  forces  can  be  further  reduced  with  the  mulliple-time-step 
(MTS)  approach.  The  MTS  approach  exploits  the  fact  that  the  force  experienced  by 
a  particle  can  be  separated  into  a  rapidly  varying  primary  component  and  a  slowly 
varying  secondary  component.  The  primary  interaction  arises  from  nearest  neigh¬ 
bors  of  a  particle,  whereas  the  secondary  forces  are  due  to  other  particles  within  its 
range  of  interaction.  The  calculation  of  forces  thus  proceeds  in  two  steps: 
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a.  Using  the  linked-cell  list  method,  a  table  of  primary  neighbors  is  constructed 
for  each  particle  at  regular  time  intervals.  In  addition,  this  table  includes  neighbors 
within  a  spherical  shell  between  and  r,,  +  5,  where  is  the  nearest-neighbor 
separation  and  s  is  referred  to  as  the  skin.  The  skin  is  added  to  avoid  updating  the 
neighbor  list  at  each  time  step.  The  primary  contributions  to  potential  energy  and 
forces  are  computed  with  the  aid  of  the  neighbor  list. 

b.  The  secondary  forces  F, .,(/),  vary  slowly  in  time  and  therefore  are  calculated 
with  the  Taylor  series  expansion, 

F,,(/o  +  KAt)  =  2  f\%)  .  { 1 ) 

where  F^j  is  the  /th  time  derivative  of  the  secondary  force  at  time  to.  Similarly  the 
potential  energy  4>s  can  be  expanded  into  a  Taylor's  series: 

4., (to  +  KAt)  =  2  5-^  .  (2) 

1-0  ‘■ 

The  list  of  primary  neighbors,  F*/],  and  ¥p  are  updated  after  n  time  steps.  The 
choice  of  n  and  /max  is  dictated  by  the  energy-conservation  requirement. 

Domain  decomposition  is  the  most  appropriate  scheme  to  implement  md  sim¬ 
ulations  on  distributed-memory  mimd  machines.  In  this  case,  the  total  system  is 
divided  into  p  subsystems  of  equal  volume,  and  these  subsystems  are  geometrically 
mapped  onto  p  processors  [7-9].  For  a  cubic  system  of  volume  fi,  the  edge  of  each 
subsystem  is  /j  =  (n/p)''\  The  positions  and  time  derivatives  (up  to  fifth  order  in 
the  predictor-corrector  method  we  have  used)  of  all  the  particles  whose  coordinates 
fall  within  the  boundaries  of  a  given  subsystem  are  stored  on  the  processor  for  that 
subsystem.  The  forces  and  potential  energy  are  again  computed  with  the  MTS  ap¬ 
proach  and  the  linked-cell  list  scheme.  Domain  decomposition  is  updated  after  v 
time  steps,  just  before  constructing  neighbor  tables.  The  primary  forces  are  calculated 
with  neighbor  tables.  For  particles  at  the  boundary  of  a  subsystem,  it  is  necessary 
to  know  the  coordinates  of  particles  on  neighboring  nodes  which  are  within  r,,  +  5 
from  the  boundaries.  Relevant  data  for  particles  which  move  out  of  the  subsystem 
boundary  into  the  domain  of  a  neighboring  subsystem  are  sent  to  that  processor. 

Secondary  interactions  consist  of  intranode  and  internode  contributions.  The 
intranode  contributions  can  be  calculated  straightforwardly  with  the  linked-list 
scheme.  The  intemode  contributions  require  data  motion  (positions,  velocities, 
etc.)  for  particles  within  the  cutoff  radius  from  the  boundaries  of  subsystems  on 
different  nodes.  The  message-passing  strategy  we  have  used  is  shown  schematically 
in  Figure  1 .  First  the  data  from  node  0  is  sent  to  node  1 ,  data  from  node  1  to  node 

2 . and  data  from  node  p  -  1  to  node  0  synchronously.  Then  using  the  linked- 

cell  list  method,  the  contributions  to  secondary  forces,  potential  energy,  and  their 
time  derivatives  are  calculated.  Node  0  sends  back  the  calculated  contributions  to 
node  p  -  1  while  receiving  the  contributions  calculated  at  node  1 .  Similar  message 
passing  of  calculated  contributions  takes  place  synchronously  at  other  nodes  as 
well.  Next  node  0  receives  data  from  node  p  -  2,  node  1  from  node  p  -  1 , . . . , 
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Figure  l .  Intemode  communication  strategy  for  secondary  forces  with  eight  processors. 
Arrows  indicate  the  direction  of  data  motion. 


and  node  p  -  1  from  node  p  -  3.  The  contributions  to  forces,  potential  energy, 
and  their  time  derivatives  are  calculated  synchronously,  and  the  results  are  sent 
back  to  the  nodes  from  which  the  data  had  been  received.  This  procedure  is  con¬ 
tinued  until  all  the  necessary  interactions  have  been  computed. 

Performance  of  Parallel  Molecular  Dynamics  Algorithm 

The  performance  of  domain-decomposition  algorithm  for  MTS-MD  simulations 
was  evaluated  on  the  eight-node  Intel  iPSC/860  system  in  the  Concurrent  Com¬ 
puting  Laboratory  for  Materials  Simulations  at  Louisiana  State  University.,  For 
system  sizes  ranging  from  4000  to  108,000  Lennard-Jones  particles,  simulations 
were  performed  at  a  reduced  temperature  T*  (=kBT/c)  =  0.8  and  number  density 
p*  (=p«r^)  =  0.743  [7].  Equations  of  motion  were  integrated  with  a  time  step. 
A/=0.73  X  lO""'*  s.  The  primary  and  secondary  forces  were  cut  off  at  1.1  s  and  2.5 
s,  respectively,  and  the  skin  for  neighbor  tables  was  taken  to  be  0.4  s. 

In  Figure  2  we  show  the  system  size  dependence  of  the  total  execution  time  and 
communication  time.  The  computation  time  for  primary  forces  is  less  than  that 
for  secondary  forces.  The  parallel  efficiency,  j?  (=ti/{p*tp),  where  tp  is  the  execution 
time  on  p  processors  and  l\  is  the  execution  time  on  a  single  processor),  for  domain- 
decomposition  algorithm  is  estimated  to  be  0.9  and  the  algorithm  sustains  good 
load  balancing. 

Recently,  we  have  also  implemented  MD  simulations  involving  the  Ewald  sum¬ 
mation  for  Coulomb  interaction  on  distributed-memory  MIMD  machines  [8].  The 
Ewald  summation  [9]  is  the  most  widely  used  approach  for  the  long-range  Coulomb 
interaction  in  bulk  systems.  In  this  approach,  the  interaction  is  written  as  a  sum  of 
a  constant  term,  a  sum  in  the  Fourier  space,  and  a  sum  of  “short  range”  terms  in 
real  space.  The  parallel  algorithm  v/e  have  designed  for  the  Ewald  summation  reduces 
the  computational  complexity  from  to  0{N).  This  is  achieved  by  ensuring 
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N  (xIOOO) 

Figure  2  Execution  time  as  a  function  of  the  number  of  particles  in  a  Lennard-Jones 

system. 


that  both  the  real-space  and  Fourier-space  contributions  scale  linealy  with  the  size 
of  the  system.  In  real  space,  the  potential  energy  and  force  calculations  are  truncated 
at  r,  =  5r(i,  where  ro  is  the  ion-sphere  radius,  ro  =  (3/4;rr)''\  This  cutoff  maintains 
the  desired  level  of  precision — 0.01%  for  all  system  sizes.  The  real-space  contri¬ 
butions  arc  then  calculated  with  the  domain  decomposition  and  the  linked-list 
methods  which  scale  as  0{N).  The  computation  of  Fourier-space  contributions 
reveals  tha,  only  a  certain  number  of  wave  vectors  need  to  be  included.  We  per¬ 
formed  simulations  as  a  function  of  the  number  of  Fourier  components  and  system 
sizes.  It  is  found  that  an  increase  in  the  number  of  wave  vectors  from  309  to  2192 
produces  a  change  of  0.01%  in  the  total  potential  energy.  The  computation  time 
for  the  A-space  calculation  increases  linearly  as  the  number  of  wave  vectors  increases. 
Thus,  the  total  execution  time  for  the  Ewald  sum  scales  linearly  with  the  number 
of  particles  (see  Fig.  3)  and  is  inversely  proportional  to  the  number  of  processors. 

Structural  Correlations  in  Densified  and  Porous  Silica 

Silica  (SiO:)  is  one  of  the  most  interesting  materials  because  of  its  numerous 
polymorphs.  Over  the  years,  many  attempts  have  been  made  to  investigate  the 
structure  and  dynamics  of  crystalline  and  glassy  states  of  SiO:  at  high  pressures. 
Recent  in  stlu  high-pressure  X-ray  diffraction  experiments  reveal  significant  changes 
in  the  short-range  and  intermediate-range  order  in  SiO:  glass  [  1 0]. 

We  have  performed  MD  simulations  to  investigate  the  effect  of  pressure  on  struc¬ 
tural  correlations  in  silica  glass  [1 1].  An  effective  interatomic  potential  consisting 
of  two-  and  three-body  terms  has  been  used  in  these  simulations.  The  two-body 
potential  includes  long-range  Coulomb  interaction  due  to  charge-transfer  effects, 
charge-dipole  interaction  due  to  large  electronic  polarizability  of  O'  ions,  and 
steric  repulsion  between  ions.  The  three-body  covalent  potentials  for  Si — O — Si 
and  O — Si — O  interactions  include  the  effects  of  bond  bending  and  stretching  [  1 2). 
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Figure  3.  Execution  time  as  a  function  of  the  number  of  particles  in  a  bulk  Coulombic 

system. 


Our  high-pressure  simulations  cover  a  wide  range  of  mass  density,  from  2.20  g/ 
cm’  (normal  density)  to  4.28  g/cm’.  Figure  4  shows  partial  pair-distribution  func¬ 
tions,  g„ti  (a,  /3  =  Si,  O),  for  SiO:  glasses  at  the  normal  density,  2.2  g/cm’,  and  at 
high  densities.  3.53  and  4.28  g/cm’.  For  the  normal  density  glass,  the  first  peak  in 
^si-o(f)  gives  the  Si — O  bond  length  to  be  1.61  A.  In  addition,  there  is  a  small 
shoulder  around  3.80  A  and  a  broad  peak  at  4.16  A.  The  Si — O  nearest  neighbor 
(nn)  coordination  is  4.  The  position  of  the  first  peak  in  gs,-tAr)  and  the  corresponding 
nn  Si — O  coordination  remain  almost  unchanged  as  the  density  is  increased  to 
3.53  g/cm\  At  a  pressure  of  43  GPa.  where  the  glass  density  increases  to  4.28  g/ 
cm’,  the  first  peak  in  fe_o(r)  occurs  at  1.67  A  instead  of  1.61  A  and  the  Si — O 
coordination  increases  from  4  to  5.8.  In  stishovite  (a  crystalline  phase  with  nearly 
the  same  density),  the  Si — O  bond  lengths  are  1.76  and  1.81  A,  and  the  Si — O 
coordination  is  6.  In  the  glass  at  4.28  g/cm’,  the  second  peak  in  gsi—o{f)  is  at  3. 1 5 
A,  close  to  the  next-nearest-neighbor  (nnn)  Si — O  distance  (3.20  A)  in  stishovite. 
When  the  glass  density  reaches  4.28  g/cm’,  the  first  peak  in  gsisM  splits  into  two 
peaks  located  at  2.59  A,  close  to  the  nn  Si — Si  distance  (2.67  A)  in  stishovite,  and 
at  3.07  A  which  is  close  to  the  nnn  Si — Si  distance  (3.24  A)  in  stishovite.  The  area 
under  the  first  peak  gives  a  coordination  of  2  while  the  area  under  the  first  two 
peaks  is  10.  At  normal  density,  the  nn  O — O  coordination  is  6.  It  increases  to  10 
at  3.53  g/cm’  and  to  12  at  4.28  g/cm’.  In  stishovite,  the  O — O  coordination  is  12. 

In  Figure  5.  we  present  MD  results  for  the  density  dependence  of  the  static  structure 
factor,  S(q).  In  the  normal  density  glass,  the  first  sharp  diffraction  peak  (FSDP)  is 
located  at  1 .6  A  With  an  increase  in  the  density,  the  height  of  the  FSDP  decreases, 
its  width  increases,  and  its  position  shifts  to  higher  values  of  q.  (Elastic  compression 
cannot  account  for  the  observed  shift  in  the  position  of  the  FSDP.)  In  addition,  a 
new  peak  appears  when  the  density  is  increased  by  20%.  Located  at  2.85  A*',  the 
peak  grows  under  pressure  with  only  a  sli^t  shift  in  its  position.  The  simulation 


Figure  4.  Partial  pair-distribution  functions  (solid  lines)  and  coordination  numbers  (dotted  lines)  for  SiOz  glasses  at  normal  and  high  densities  at 
300  K.  Sharp  peaks  in  the  figure  at  4.28  g/cm^  correspond  to  pair-distnbution  functions  for  crystalline  slishovite. 


788 


KALIA  ET  AL. 


q  (A-‘) 

Figure  5.  Static  structure  factor.  S{q),  of  normal  and  high  density  SiOj  glasses  at  300  K 


results  for  the  effect  of  pressure  on  S{q)  are  in  good  agreement  with  high-pressure 
X-ray  measurements  by  Meade  et  al.  [10]. 

Figure  6  displays  the  MD  results  for  O — Si — O  and  Si — O — Si  bond-angle  dis¬ 
tributions  in  SiOj  glasses  at  normal  and  high  densities.  As  the  density  increases, 
the  peaks  in  these  distributions  broaden  and  also  shift  to  lower  angles  because  of 
increased  distortions  of  Si(Oi/2)4  tetrahedra.  At  normal  density,  the  O — Si — O  dis¬ 
tribution  has  a  peak  at  109°.  When  the  glass  density  reaches  4.28  g/cm^,  the  O — 
Si — O  distribution  develops  broad  peaks  at  90°  and  171°.  In  the  normal  density 
Si02  glass,  the  Si — O — Si  bond-angle  distribution  has  a  peak  at  142°  with  an  fwhm 
of  26°.  (Both  of  these  results  are  in  excellent  agreement  with  nmr  measurements 
[12].)  With  densification,  this  peak  shifts  to  lower  values.  The  Si — O — Si  bond- 
angle  has  broad  peaks  around  95°  and  128°  in  the  glass  at  density  4.28  g/cm^. 
These  values  are  close  to  the  Si — O — Si  angles,  98.65°  and  130.67°,  in  stishovite. 
Thus,  the  results  for  pair-distribution  functions  and  bond-angle  distributions  in 
Si02  glass  at  4.28  g/cm^  provide  strong  evidence  for  comer-sharing  and  edge-sharing 
Si(Oi/3)6  octahedra. 

Recently,  we  have  also  investigated  structural  correlations  in  porous  silica.  This 
material  is  environmentally  safe,  has  a  large  thermal  resistance,  and  high  optical 
transmission.  It  is  an  excellent  alternative  toCFC  foamed  plastic  in  thermal  insulation 
of  refrigerators  and  also  a  desirable  material  for  passive  solar  energy  collection 
devices.  Efforts  are  also  being  made  to  produce  optical  switching  devices  by  embed¬ 
ding  semiconducting  microclusters  in  the  pores  of  aerogel  silica. 
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Figure  6.  O — Si — O  and  Si — O — Si  bond-angle  distributions  for  normal  and  high  densitv 

SiO:  glasses. 


Our  MD  simulations  for  porous  silica  cover  a  wide  range  of  densities,  from  2.2 
to  0.1  g/cm^  [13].  The  simulated  systems  are  large  enough  (41.472  particles;  the 
lowest  density  system  is  a  cube  with  an  edge  of  240  A)  to  cover  the  entire  range  of 
structural  correlations.  Computer  generated  porous  glasses  v  ere  prepared  as  follows: 
The  starting  configuration  was  (3-cristobalite  at  room  temperature.  This  system  was 
heated  to  5000  K  over  a  period  of  72,000  time  steps  (1  time  step  =  0.5  X  10  s) 
and  then  equilibrated  for  30,000  time  steps.  Subsequently,  we  cooled  the  system 
to  4000  K  over  12,000  time  steps  and  again  thermalized  for  30,000  time  steps. 
Following  this  cooling  and  thermalization  process,  glasses  were  generated  at  3000, 
2000.  1500.  600,  and  300  K.  At  each  temperature.  MD  simulations  were  performed 
for  an  additional  30,000-60,000  time  steps.  Porous  glasses  were  produced  by  uni¬ 
formly  scaling  the  coordinates  of  all  the  particles  in  the  room-temperature  glass  by 
a  factor  of  1 .02- 1 .02o.  After  scaling,  the  system  was  thermalized  for  30,000  time 
steps.  Subsequently,  the  conjugate  gradient  scheme  was  used  to  bring  the  system 
to  the  lowest  local  energy  configuration.  In  such  a  relaxed  system,  particles  were 
given  random  velocities  according  to  Maxwell-Boltzmann  distribution  and  statistical 
averages  were  accumulated  over  9000  time  steps. 

In  Figure  7(a).  we  show  pair-distribution  functions  on  a  logarithmic  scale  at 
several  densities.  At  lower  densities,  one  finds  not  only  the  peak  at  1.61  A  but  an 
additional  peak  at  1 .58  A.  The  latter  is  due  to  Si — O  bond  in  triangular  units  whose 
number  grows  as  the  density  is  lowered.  Superimposed  on  the  peaks  in  g(r)  is  a 
powei-law  decay  from  which  one  can  determine  the  fractal  dimension  d,  of  the 
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p  (g/cm>) 

Figure  7  (a)  Log-!og  plot  of  pair  distribution  functions.  g(r).  of  silica  g'asscs  at  densities 

at  0.1.  0.4.  0.8.  and  1.5  g/cni’.  (b)  Fractal  dimension  d/a$  a  function  of  density. 


silica  network:  dj  =  3  +  d  Iog[5<r)]/6^  log(/-).  Figure  7(b)  shows  the  decrease  in  the 
fractal  dimension  as  the  density  decreases. 

Two  of  the  most  important  parameters  for  porous  glasses  are  the  internal  surface 
area  of  pores  and  the  pore  surface  area-to-volume  ratio.  These  characteristics  of 
pores  are  determined  as  follows:  The  system  is  divided  into  cubic  cells  of  volume 
(4  A)\  Counting  the  number  of  empty  cells  and  multiplying  it  by  (4  A)’  gives  the 
pore  volume.  The  pore  surface  area  is  determined  by  the  number  of  interfaces 
separating  empty  cells  adjacent  to  occupied  cells  and  multiplying  that  number  by 
(4  A)‘.  Figure  8(a)  shows  the  increase  in  the  internal  surface  area  and  Figure  8(b) 
the  decrease  in  the  pore  surface  area-to-volume  ratio  as  the  density  decreases.  The 
latter  is  a  measure  of  the  average  pore  size.  The  density  dependence  of  the  internal 
surface  area  of  pores  is  in  good  agreement  with  experiment  [14]. 


Conclusion 

Molecular-dynamics  simulations  have  been  efficiently  implemented  on  distrib¬ 
uted-memory  MIMD  machines  with  the  aid  of  multiple-time-step  approach  and  the 
linked-cell  list  scheme.  The  complexity  of  this  parallel  algorithm  is  0{N)  and  it 
scales  inversely  with  the  number  of  processors.  We  have  applied  this  algorithm  to 
investigate  structural  correlations  in  SiO’  glasses  at  various  densities.  Currently,  we 
are  implementing  this  approach  on  the  Delta  machine  at  Caltech  so  that  we  can 
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Figure  8  (a)  Internal  pore  surface  S  versus  density  calculated  from  the  configuration  of 

the  MD  porous  glasses,  (b)  Pore  surface-to-volume  ratio.  S/1 .  vs  density  calculated  from 
the  configuration  of  the  MD  porous  glasses. 


simulate  sufficiently  large  systems  (~  10*  particles)  to  investigate  atomistically  the 
phenomenon  of  fracture  in  ceramics  and  glasses. 
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Abstract 

In  this  article  we  descnbe  the  implementation  of  a  scheme  based  upon  the  McMurchie-Davidson 
recursion  relationships  to  calculate  two-electron  integrals  over  Gaussian-type  functions  in  extended  systems. 
Features  illustrating  the  computational  efficiency  of  the  approach  are  highlighted  together  with  additional 
aspects  of  our  program  to  compute  the  ah  initio  band  structures  of  polymers.  The  algorithm  is  applied 
in  calculation  of  the  band  structure  and  Sum  Over  States  polanzabilitv  of  polyethylene  and  polysilane 
using  standard  basis  sets  augmented  with  polanzation  functions.  The  role  of  polanzation  functions  is 
seen  to  be  small  in  polyethylene,  but  in  polysilane  the  silicon  d-orbitals  make  significant  contributions 
to  the  bonding  along  the  backbone  of  the  chain.  This  increase  in  bonding  character  results  in  a  reduction 
in  the  calculated  polanzabil'iv  m  going  from  the  unpolanzed  to  polanzed  basis  sets.  €  1993  John  Wiiey 
&  Sons.  Inc 


Introduction 

There  has  been  much  development  in  recent  years  in  the  theory  and  application 
of  the  quantum  theory  of  polymers  to  the  point  where  ab  initio  self-consistent  field 
(SCF)  calculations  can  be  performed  routinely  on  systems  with  small  and  moderately 
sized  unit  cells  [  1  ] .  Studies  on  simple  polymers  ( for  example;  polyethylene,  poly¬ 
acetylene,  polysilane,  •  •  • )  have  provided  valuable  insight  into  the  electronic  struc¬ 
tures  of  polymers  and  an  understanding  of  their  properties.  However,  almost  ail 
these  studies  have  been  carried  out  using  either  minimal  or  double-zeta  basis  sets, 
with  tb  '.  use  of  extended  or  polarized  basis  sets  still  being  exceptional.  In  molecular 
studies,  the  use  of  extended  and  polarized  basis  sets  is  i  iw  commonplace  and  is 
seen  to  be  important,  particularly  in  the  reliable  prediction  of  molecular  geometries 
and  properties.  The  development  of  efficient  algorithms  to  enable  functions  of 
higher  angular  momenta  to  be  included  in  calculations  upon  infinite  systems  is 
now  desirable  in  order  to  investigate  the  contributions  such  functions  make  to  the 
computed  energies,  band  structures,  and  properties.  The  importance,  or  otherwise, 
of  d-orbitals  in  the  bonding  of  polymeric  systems  has  recently  been  the  source  of 
some  debate  in  the  literature  [2.3]. 


•  Research  Assistant  of  the  National  Fund  For  Saentific  Research  ( Belgium).  Present  address  Quantum 
Theory  Projeci,  University  of  Rorida.  Williamson  Hall.  362.  Gainesville.  FL  3261  l-2i3S. 


International  Journal  of  Quantum  Chemistry.  Quantum  Chemistry  Symposium  27.  793-806  (1993) 
c  1993  John  Wiley  &  Sons.  Inc.  CCC  0020-7608/9.3/010793-14 


794 


MOSLEY  ET  AL 


Organic  polymers  are  the  subject  of  major  interest  due  to  their  potential  both  as 
conductors  and  as  important  materials  in  opto-electronics  [4,5].  The  accurate 
theoretical  prediction  of  the  electronic  structure  and  properties  has  an  important 
role  to  play  in  providing  insight  into  the  mechanisms  and  structure-property  re¬ 
lationships  involved  and.  in  order  to  fulfil  this,  the  development  of  efficient  and 
powerful  algorithms  is  essential. 

Generally  speaking,  the  evaluation  of  the  two-electron  integrals  is  the  most  time- 
consuming  step  in  polymeric  calculations.  Several  schemes  to  compute  these  in¬ 
tegrals  over  Gaussian  basis  functions  have  been  developed  and  implemented  for 
use  in  molecular  calculations  [6- 10] .  In  extended  systems  the  basic  problem  remains 
the  same  as  in  the  molecular  case,  though  efficient  algorithms  also  depend  on  specific 
use  being  made  of  features  inherent  in  polymers,  such  as  the  translational  and 
helical  symmetry. 

The  theory  of  lcaO  band  structure  calculations  has  been  covered  in  detail  in 
recent  reviews  [1,1 1].  To  avoid  repetition,  we  restrict  our  theoretical  discussion  as 
far  as  possible  to  aspects  directly  related  to  the  two-electron  integrals.  We  then 
present  a  description  of  the  implementation  of  an  algorithm  based  upon  the 
McMurchie-Davidson  scheme  [  7  ]  to  the  calculation  of  two-elcctron  integrals  in 
infinite  systems.  A  brief  description  of  the  use  of  permutational  symmetry  in  the 
integral  indices  and  of  the  exploitation  of  symmetry  in  the  construction  of  the  Fock 
matrices  is  also  given.  For  the  sake  of  completeness,  we  also  include  a  brief  de¬ 
scription  of  the  computation  of  the  asymptotic  polanzability  per  unit  cell  of  infinite 
systems  using  the  Sum  Over  States  method  [12], 

Using  the  integral  algorithm  outlined  in  the  next  sections,  we  present  the  results 
of  a  study  into  the  effect  of  the  inclusion  of  d-orbitals  in  the  basis  sets  for  polyethylene 
and  polysilane.  The  first  of  these  systems  is  one  of  the  classic  systems  to  study  in 
polymer  quantum  chemistry  (reviewed  recently  in  [13,14]),  and  has  been  studied 
previously  in  the  ab  initio  crystal  orbital  approach  with  polarization  functions  added 
to  the  basis  set  [15]  Polysilane  was  chosen  because  of  its  structural  similarity  to 
polyethylene  and  the  fact  that  one  can  reasonably  expect  the  inclusion  of  polarization 
functions  to  have  a  more  pronounced  effect  than  in  a  system  containing  first-row 
atoms  on’  Polysilane  is  the  source  of  considerable  interest  due  to  the  sigma  con- 
jugatioi  .ong  the  polymer  backbone  [16].  Finally,  we  present  some  conclusions 
which  can  be  drawn  from  the  preliminary  results  and  an  indication  of  potential 
avenues  for  both  short-  and  long-term  investigation. 


Aspects  of  Computation 

Two-Electron  Integrals  in  lcao  Calculations  of  Band  Structures 

The  standard  quantum  mechanical  treatment  of  polymers  is  based  on  the  periodic 
model  of  the  polymeric  chain.  Through  Bloch's  theorem,  the  one-electron  wave- 
functions  can  be  represented  as  Bloch  functions 


r)  =  exp[/A:r]M„(A:.  r) 


(1) 
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in  which  the  periodicity  axis  is  chosen  to  be  the  c-axis,  exp[/A:c]  is  a  non-periodic 
free-electron  like  contnbution  and  u„(k,  r)  is  a  periodic  contribution,  having  the 
same  periodicity  as  the  lattice  r)  =  r  +  jac:),  where  a  is  the  length  of 
the  unit  cell  and  j  is  an  integer).  The  LCAO  Bloch  form  of  a  single  atomic  orbital, 
X(r  -  P).  where  P  represents  the  co-ordinates  of  the  origin  of  the  atomic  orbital, 
is  written 


<t)(k,  r)  =  2  CjXfr  -  P  -jac:) 

=  Nc''^  2  exp[/A.7a]X(r  -  P  -yae.-) 

J 


(2) 


where  is  the  normalization  factor  for  a  polymer  of  Nc  cells,  and  the  sum 
over  J  represents  a  sum  over  N(_  cells.  The  LCAO  Bloch  form  describing  an  orbital 
over  the  whole  polymer  is  expressed  as 

r)  =  2  2  c°/„Xp(r  -  P  -jac:) 

J  P 

=  Nc'^‘  2  exp[/T7a]  2  Cp„{k)Xp{T  -  P  -jas;)  (3) 

J  p 

where  n  is  the  band  index,  k  the  position  vector  in  the  first  Brillouin  zone,  and  the 
superscripts  0  /  identifying  the  zeroth  and  7th  cells,  respectively.  The  Fock  matrix 
elements.  are  given  by 

fpi  =  J  Mr  “  P)S^X„(r  -  Q-jae:) 

=  J  xj!(r)7x^(r)c/r  (4) 

where  7  is  the  one-electron  Fock  operator.  Equation  (4)  can  be  expanded  into  its 
individual  terms  to  give 

-Z  I  +2222  p^![2(/?^'Ip"s')  -  (p.rVV)]  (5) 

/i  ^  h  I  r  s 


in  which  Tph  and  F  |  %re  the  kinetic  energy  and  potential  energy  matrix  elements. 
The  two-electron  integrals  appearing  in  Eq.  ( 5 )  are  defined  as 


(W' I 


/J 


x;;(r)Xi(r)X?(r')x'(r') 


dr  dr'. 


r  -  r- 


(6) 


The  electron-electron  Coulomb  contribution  to  the  Fock  matrices  can  be  simplified 
10(17.18] 

c:;'=  2  2  2/>ri2(/^VirV'^"')] 

m  r  ^  h 

=  2  2  2 . 1  (7) 


where  use  has  been  made  of  the  periodicity  of  the  electron  density.  Dri  =  ' 
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[17,18],  and  the  sums  over  /  and  h  in  Eq.  (5)  have  been  replaced  by  a  new  index 
m  =  i  -  h.  The  most  important  feature  to  note  here  is  that  the  integral  enclosed 
within  the  curly  braces  {p^q^\r*s'*^'")  =  which  we  describe  as 

the  “compacted  Coulomb  integral”  [18],  can  be  calculated  and  stored  once  in  the 
evaluation  of  the  two-electron  integrals  and  does  not  evolve  in  successive  iterations 
of  the  SCF  procedure.  This  results  in  a  reduction  in  the  number  of  integrals  stored, 
and  a  subsequent  reduction  in  the  number  of  I/O  operations  and  the  manipulations 
which  need  to  made  to  compute  the  Coulomb  contribution  to  the  Fock  matrices. 

In  the  program  developed  in  our  laboratory  [19,20]  two  different  ranges  are 
used  for  the  evaluation  of  the  Coulomb  and  exchange  contributions  to  the  Fock 
matrices.  Firstly,  there  is  a  short-range  region,/,  /  =  [-N,  +A^],  in  which  all  the 
electron-repulsion  integrals  are  computed  exactly.  Beyond  this,  we  define  an  in¬ 
termediate-range  region,  h  =  where  M  ^  2N,  in  which  the  Coulomb 

contributions  are  evaluated  exactly  and  summed  into  the  compacted  integral. 
Finally,  beyond  M  a  multipole  expansion  technique  is  used  to  treat  the  Coulomb 
interaction  to  infinity  [21]  taking  the  nuclear  attraction  Coulomb  term  to  give  a 
convergent  series.  For  the  exchange  contribution,  however,  the  evaluation  of  the 
terms  is  restricted  to  the  short-range  region  only  at  the  moment.  The  investigation 
into  methods  which  correctly  evaluate  the  exchange  contributions  to  infinity  is 
an  active  area  of  research  [22,23]. 


Integral  Evaluation  in  Extended  Systems 


Generally,  the  contracted  Gaussian  atomic  orbitals  used  in  molecular  calculations, 
defined  by  their  set  exponents  and  coefficients,  are  assigned  to  a  unique  center.  In 
symmetric  molecules,  the  symmetry  operations  relating  different  centers  (of  the 
same  atom  type)  can  be  used  to  reduce  the  computational  task  involved.  In  poly¬ 
meric  calculations,  the  same  atomic  orbital  exponents  and  coefficients  assigned  to 
an  atom  in  the  reference  unit  cell  are  used  in  the  translational  repeats  of  the  unit 
cell  along  the  polymer  chain.  Efficient  computation  relies,  in  part,  on  the  maximum 
reuse  of  information  and  avoiding  the  unnecessary  recalculation  of  factors.  With 
this  in  mind,  it  is  possible  to  simplify  the  calculation  of  integrals  in  extended  systems 
through  identifying  factors  which  can  be  said  to  be  “translationally  invariant,”  that 
is  terms  involving  the  orbital  exponents  and  coefficients,  prior  to  considering  the 
translational  repeat.  The  existence  of  point  group  symmetry  in  the  unit  cell  and 
the  adjacent  cells  being  related  by  a  helical  twist  [  24,25  ]  also  serve  to  reduce  the 
magnitude  of  the  computational  task,  though  these  are  outside  the  scope  of  the 
discussion  here. 

A  schematic  representation  of  an  algorithm  based  upon  the  McMurchie-Davidson 
( M-D )  scheme  [  7  ]  adapted  to  extended  systems  for  the  calculation  of  elertron- 
repulsion  integrals 


{pq'\r''s''^‘) 


X»(r)Xi(r)X?(r')X?-'(r') 


(8) 


extended  systems  is  given  in  Figure  1.  The  upper-case  indices,  P,  Q,  R,  S,  are  used 
to  identify  the  shells,  while  the  lower-case  indices  refer  to  atomic  orbitals.  The  M — 
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- loop  over  shell  indices  P  and  Q 

Compute  cell-independent  information 
Pre-factors  for  charge  distribution 

-  loop  overy-f.N 

Compute  cell-dependent  Information  and 
generate  charge  distributions  for  indices  p  and  g 
over  numcel  cells 


-  loop  over  shell  indices  R  and  S 

Compute  cell-independent  information 
Pre-factors  for  charge  distribution 

- loop  0  !-1,N 

Compute  cell-dependent  information  and 
generate  charge  distributions  for  indices  r  ands 
for  a  single  combination  of  cell  indices 

-  loop  over jml.N 

- loop  over  fj-f.M 

Compute  integrals  X(p,q,r,s,j,h,l) 


Process  integrals  over  M  cells 


Figure  1 .  Representation  of  McMurchie-Davidson-like  algorithm  for  the  computation 
of  two-electron  integrals  applied  to  extended  systems. 


D  scheme  is  based  upon  expressing  the  two-electron  integrals  in  terms  of  auxiliary 
functions  which  can  be  expressed  in  terms  of  a  set  of  recursion  relations. 

Looking  at  Figure  1  in  a  little  more  detail,  the  first  step  inside  the  outer  loop 
over  shells  P  and  Q  is  to  compute  the  factors  necessary  for  the  computation  of  the 
charge  distributions  which  are  independent  of  the  origins  of  the  shells,  that  is  factors 
which  only  depend  on  the  exponents  and  coefficients  of  the  Gaussian  primitives 
which  go  to  make  up  the  shells.  The  recursion  relations  necessary  to  compute  the 
overlap  distributions  are  then  generated  using  the  recursion  relations  for  the  indices 
p  and  q  over  all  cells  to  be  included  in  the  short-range  region.  This  information  is 
stored  in  core  memory  for  fast  access  in  the  integral  evaluation  steps  in  the  inner 
loops  for  all 7  =  -t-A^. 

Analogous  steps  are  then  carried  out  for  the  second  pair  of  indices  R  and  S  to 
generate  the  cell  independent  information,  followed  by  the  cell  dependent  factors 
inside  the  loop  over  /.  These  are  formed  for  a  single  value  of  /  at  a  time  and  make 
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use  of  the  fact  that  the  center  of  the  overlap  distribution  for  r  and  s  is  subject  to  a 
translation  of  ha  along  the  periodicity  axis  in  the  evaluation  of  the  integrals  in  the 
loop  over  h.  The  integral  evaluation,  which  is  performed  in  the  innermost  loop 
over  cells,  is  essentially  identical  to  the  molecular  case.  A  total  of  A/  X  num  integrals, 
where  num  is  the  number  of  integrals  resulting  from  the  quartet  of  shells,  P,Q,R, 
5,  is  calculated  in  a  single  step  through  the  loop  over  h.  These  integrals  are  stor"'^ 
in  core  memory  prior  to  being  processed  and  written  to  disk.  This  processing  consists 
of  a  final  test  to  ensure  that  the  value  of  the  integrals  exceed  a  certain  threshold 
and  that  the  triplet  of  cell  indices  are  within  the  defined  range  and  will  contribute 
to  the  Fock  matrices. 

The  algorithm  outlined  above  and  illustrated  in  Figure  1  is  included  in  the  latest 
release  of  the  PLH  program,  PLH-93  [19]. 

Use  of  Permutation  Symmetry 

Since  the  calculation  of  the  two-electron  integrals  is  so  computationally  intensive, 
it  is  beneficial  to  exploit  measures  which  can  result  in  economies.  The  permutational 
symmetry  of  the  indices  of  the  integrals  [26]  can  be  used  to  reduce  the  number  of 
integrals  which  need  to  be  calculated  and  stored.  This  is  a  well-known  feature  for 
molecular  calculations  [  27  ] ,  and  its  application  in  ab  initio  polymeric  calculations 
is  demonstrated  below. 

For  the  molecular  case,  a  two-electron  integral  {pg\rs)  is  equal  to  seven  other 
integrals  (assuming p,  q,  r,  s  are  not  equal)  obtained  through  permuting. the  indices 
as 

{pq\rs)  =  (pq\sr)  =  {qp\rs)  =  (qp\sr)  =  (rs\pq)  =  {rs\qp)  =  {sr\pq)  =  {sr\qp) 

which  means  that  only  one  of  these  integrals  has  to  be  calculated  and  stored. 

Adopting  the  shorthand  notation  for  the  compacted  Coulomb  integral  6pq,s{j, 
"?)  =  { p'^q’\r*s*^"' } ,  the  following  equalities  can  be  used  in  polymeric  calculations 

^pqnij^  rn)  —  Jy  ttl)  Jy  rn)  —  ^  rspq^tn  y  J') 

~  Uryqp(m,  ~j)  —  Usrpq(~m,j)  —  U  „qp(—m ,  ~  j) 

These  relations  are  only  formally  valid  for  the  summation  -  oo  <  <  oo .  Thus 

the  inclusion  of  long-range  effects  is  important  in  the  use  of  the  above  relations. 
An  equivalent  set  of  relations  holds  for  the  exchange  integrals 

{pq>\r''s'’*"’)  =  =  {qp-^\s''*'^-^r''-J) 

=  {rs"'\p-''q>->')  =  (rs"'W-‘'p-'’)  =  {sr-'”\p-''-”'q> 


Calculation  of  Sum  Over  States  Polarizabilities 

Recently,  a  scheme  has  been  proposed  for  the  rapid  computation  of  the  asymptotic 
longitudinal  frequency-independent  polarizability  per  unit  cell  of  infinite  systems 
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within  the  Sum  Over  States  ( SOS)  approximation  [12].  This  calculation  is  performed 
as  a  post-SCF  step,  and  through  providing  an  estimate  of  the  polarizability  can 
provide  at  leas;  a  qualitative  insight  into  the  nature  of  bonding  and  the  extent  of 
conjugation  !n  polymers. 

The  longitudinal  polarizability  can  be  expressed  as 


=  4  2  Z  2 


where  the  sums  over  i  and  a  represent  sums  over  occupied  and  unoccupied  bands 
respectively,  n,a(fc)are  the  dipole  transition  strengths  [28-30].  In  order  to  calculate 
the  polarizability  per  unit  cell,  the  sum  over  k  is  replaced  by  an  integration  over 
the  first  Brillouin  zone,  to  give 

^  —  y  y  r  , > 

Nr  ^  ,  J  c,{k)-c,(k) 

=  2  J  k)  dk 
=  2  «;.-(/)  . 


The  final  result  in  the  above  expression  illustrates  how  the  polarizability  can  be 
decomposed  into  the  contributions  made  from  each  of  the  occupied  bands.  This 
can  be  used  to  establish  relationships  between  the  topology  of  the  bands  and  the 
polarizability  to  provide  insight  into  the  nature  of  bonding  and  conjugation  in  the 
polymer  [12,31]. 

E\  idently,  the  SOS  scheme  has  its  limitations.  It  corresponds  to  an  uncoupled 
scheme  and  its  numerical  results  can  be  only  be  treated  as  very  rough  estimates. 
However,  it  is  able  to  predict  the  correct  trends  of  the  polarizabilities  in  groups  of 
compounds  and  can  thus  enable  qualitative  comparisons  to  be  made. 


Results 

The  two  systems  considered  here  are  polyethylene  and  polysilane.  Initially,  a 
series  of  “restricted”  geometry  optimizations  were  carried  out  on  oligomers  of  in¬ 
creasing  size,  H  —  (X2H4)„ — H,  where  X  -  C,  Si  and  n  =  1-6,  with  the  6-3 IG 
and  6-31G*  (a^  =  0.8)  basis  sets  [32]  for  the  alkane  chain  and  the  3-21G  and  3- 
2 1 G  *  ( =  0.45 )  basis  sets  [  33  ]  for  the  silane  chain,  using  Gaussian-90  [  34  ] .  The 
geometry  optimization  was  restricted  in  the  sense  that  all  atoms  forming  the  back¬ 
bone  of  the  oligomers  were  constrained  to  be  coplanar,  the  plane  containing  the 
backbone  designated  a  mirror-plane,  and  a  C2  rotation  axis  maintained  perpendic¬ 
ular  to  the  plane.  The  parameters  used  in  the  polymeric  calculations  were  deduced 
from  the  geometry  of  the  central — C2H4—  and  -  Si  2H4 — groups  obtained  in  the 
optimization  of  the  largest  oligomer.  These  geometries  are  summarized  in  Table  1. 

The  calculations  to  compute  the  total  energy,  band  structure,  and  sos  polariz¬ 
abilities  were  carried  out  using  PLH-93  [19].  In  all  cases  the  short-  and  intermediate- 
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Table  I.  Optimized  geometncal  parameters  and  unit  cell  lengths  used  in  polymenc  calculations. 
Parameters  deduced  from  central  — C2H4—  and  — S12H4 —  units  in  oligomenc  optimizations  on 
C12H24  and  S112H26  with  their  respective  basis  sets. 


Rx-x 

Rx-h 

<(X  — X  — X) 

<(H  — X  — H) 

a 

Polyethylene: 

6-3 1 G 

1.533 

1.088 

113.25 

106.35 

2  560380 

6-3  IG‘ 

1.530 

1.089 

113.32 

106.18 

2.556396 

Ref.  [14] 

1. 541 

1.087 

112.5 

107.2 

2.563 

Ref.  [35] 

1.534  ±0.006 

1.05 

1 12.0  ±  0.3 

106.5 

2.54 

Ref.  [36] 

1.533 

1.07 

112.0 

107.0 

2.46 

Ref.  [37] 

1.527  ±0.007 

1.091 

1 12.0  ±  0.8 

109.0 

2.53 

Ref.  [38] 

1.586 

1.08 

107.7  ±  0.3 

109.0 

2.55 

Polysilane; 

3-2  IG 

2.384 

1  489 

111.72 

108.29 

3.946324 

3-2  IG* 

2.346 

1  481 

112.29 

107.56 

3.897304 

All  bond  lengths  are  given  in  A  and  angles  in  degrees. 


ranges  of  interaction  were  defined  to  be  2Af+  1  =  13and2A/+  1  =  25.  respectively. 
The  thresholds  for  the  two-electron  integrals  was  set  to  10”’  a.u.  and  the  convergence 
criterion  for  the  density  matrix  elements  to  b?  10“-.  The  results  of  these  calculations 
are  shown  in  Table  II.  The  analysis  of  the  contributions  to  the  computed  polariz¬ 
abilities  in  terms  from  each  of  the  valence  bands  are  given  in  Tables  III  and  IV. 

Discussion 

For  polyethylene,  the  addition  of  polarization  functions  to  the  basis  set  has  only 
a  small  effect..  In  the  first  instance,  this  is  manifest  in  the  small  differences  between 


Table  II.  Total  energies  per  unit  cell  and  longitudinal  polarizabilities  per  unit  cell  and  per  unit  length 
for  infinite  polyethylene  and  polysilanc  chains. 


Total  energy 

per 

unit  cell 

per 

unit  length 

Polyethylene. 

6-31 G* 

-78  036867 

16.417 

3.392 

6-3  IG" 

-78.036857 

16.355 

3.385 

6-3 IG*  " 

-78.069384 

16.712 

3.459 

Polysilanc. 

3-21 G’ 

-577.116837 

63.778 

8.552 

.7-2  IG" 

-577.116213 

61.101 

8.296 

.3-2  IG*" 

-577  292676 

54.259 

7.367 

All  values  are  given  in  atomic  units. 

•Quantities  are  calculated  at  the  optimized  geometry  of  the  6-3 IG  or  3-21  G  basis  sets,  respectively. 
Geomctncs  obtained  with  the  polanzed  basis  sets  arc  used. 
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TABLb  III.  Valence  band  contnbutions  to  the  SOS  longitudinal 
polarizabilities  of  polyethylene. 


Band 

6-3  IG* 

6-3  IG" 

6-3 IG*  " 

3 

0.153 

0.153 

0.163 

4 

0.439 

0.441 

0.539 

5 

0.489 

0.491 

0.640 

6 

2.0.32 

2.050 

2.465 

7 

6.257 

6.237 

5.911 

8 

7.038 

6.976 

6.985 

16417 

16.355 

10.712 

“'■Results  were  obtained  using  the  6-.^!G  and  the  6-3IG*  optimized 
gcometnes.  rcspccti\cl>. 

All  \alucs  arc  in  atomic  units. 


the  geometries  obtained  with  the  6-3 IG  and  6-3 IG*  basis  sets.  This  is  not  surprising, 
as  one  would  expect  the  unpolarized  6-3  IG  basis  to  give  a  good  description  of  the 
bonding  in  ethylene  chain,  and  that  this  description  would  not  be  greatly  enhanced 
through  the  addition  of  polarizatton  functions  only.  The  two  basis  sets  used  here, 
however,  do  show  an  improved  agreement  with  experimentally  determined  geo¬ 
metrical  parameters  over  those  obtained  with  smaller  basis  sets  (see  [14]  for  a 
discussion).  The  small  lowering  in  energy  and  slight  increase  in  longitudinal  po¬ 
larizability  indicate  that  the  effect  of  the  presence  of  d-orbitals  is  minimal.  The 
valence  band  structure  of  polyethylene  with  the  6-3  IG*  basis  is  given  in  Figure  2. 
The  6-3  IG  band  structure  is  not  included  as  the  differences  between  it  and  Figure 
2  are  almost  imperceptible.  The  differences  between  the  band  structures  obtained 
with  the  6-3 IG  and  6-3 IG*  basis  sets  are  very  small  indeed,  the  discrepancies 
between  the  band  energies  all  being  <10  mHartree. 


Table  IV.  Valence  band  contributions  to  the  SOS  longitudinal 
polanzabilities  of  polysilanc. 


Band 

3-2  IG* 

3-2  IG" 

3-2 IG*  " 

II 

0  606 

0.557 

0.704 

12 

0.596 

0.602 

1.087 

13 

1  428 

1.476 

1.940 

14 

1.902 

2.006 

3.018 

15 

14.557 

14.261 

11.706 

16 

44  662 

42.172 

35.681 

„,,/A 

63.778 

61.101 

54.259 

•'■  Results  were  obtained  using  the  3-21G  and  the  3-21G*  optimized 
gcometnes.  respectively. 

All  values  are  in  atomic  units. 
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Figure  2.  Valence  Bands  of  polyethylene  calculated  with  the  6-3 1 G  *  basis  set. 


The  analysis  of  the  contributions  of  the  occupied  bands  to  the  computed  polar¬ 
izabilities  of  polyethylene  are  similar  in  both  cases,  which  in  turn  are  similar  to 
analyses  previously  published  [14,31]  using  smaller  basis  sets.  The  sizes  of  the  sos 
polarizabilities  obtained  here  with  the  6-3 IG  and  6-3 IG*  basis  sets  show  the  ex¬ 
pected  slight  increase  over  those  obtained  in  these  studies  with  more  limited  basis 
sets  («::  =  16.332  (3-21G)  and  a;;  =  8.080  (STO-3G)).  The  largest  contribution 
to  the  polarizability  is  due  to  the  highest  occupied  band.  At  the  point  k  =  0,  this 
band  is  highly  delocalized,  and  analysis  of  the  lcao  coefficients  shows  that  it  is 
predominantly  composed  of  contributions  from  carbon  P;  orbitals  [13].  On  the 
addition  of  d-orbitals,  this  band  is  slightly  stabilized  by  small  d-components  of  the 
LCAO  coefficients  along  the  polymer  backbone.  This  results  in  a  slight  increase  in 
the  gap  between  the  valence  and  conduction  bands  (8.6  -►  8.8eV). 

The  effect  of  the  inclusion  of  polarization  functions  is  much  more  striking  in  the 
case  of  polysilane.  The  optimized  geometries  of  the  oligomers  with  the  two  basis 
sets  are  noticibly  different,  with  a  shortening  of  the  Si — Si  bond,  a  slight  widening 
of  the  Si — Si — Si  angle  in  the  backbone  and  a  closing  of  the  H — Si — H  angle 
in  going  from  the  3-2  IG  to  3-2  IG*  basis  sets.  These  observations  are  in  agreement 
with  previous  studies  [  35-37  ] .  The  band  structures  and  density  of  states  of  polysilane 
obtained  with  the  two  basis  sets  are  represented  in  Figures  3  and  4.  The  most 
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Figure  3.  Valence  Bands  and  Density  of  States  of  polysilane  calculated  with  the  3-2 IG 

basis  set 


significant  change  is  apparent  in  the  values  of  the  polanzability,  which  show  roughly 
a  10%  decrease  on  the  addition  of  the  polarization  functions.  This  is  surprising,  as 
one  might  intuitively  expect  an  increase  in  the  polarizability  on  the  addition  of 
polarization  functions.  This  effect  has  been  noted  previously  [38]  in  coupled  Har- 
tree-Fock  calculations  of  the  polarizability  of  silane  and  disilane  with  more  extensive 
basis  sets.  Thus,  it  seems  that  the  observed  decrease  in  the  polarizability  here  is  not 
solely  due  the  deficiencies  of  the  sos  method  or  small  size  of  the  basis  set,  but 
instead  point  to  a  role  for  d-orbitals  in  the  description  in  the  bonding  in  polysilant. 
This  is  supported  by  the  observed  shortening  in  the  Si  —  Si  bond  and  not  insignif¬ 
icant  contributions  from  the  d-orbitals  to  the  lcao  coefficients.  Further  calculations 
[38-40]  indicate  that  the  choice  of  exponents  of  the  polarization  functions  is  crucial 
and  that,  ideally,  more  than  one  set  is  desirable  to  describe  the  bonding  and  po¬ 
larizability. 

The  major  contributions  to  the  computed  polarizabilities  are  due  to  the  highest 
occupied  valence  band,  as  was  the  case  with  polyethylene.  At  A:  =  0,  this  band  is 
predominantly  composed  of  silicon  p.  orbitals,  with  smaller,  though  significant, 
contributions  from  the  d-orbitals  with  lobes  pointing  along  the  polymer  backbone 
for  the  polarized  basis  set.  The  marked  reduction  in  the  contribution  to  the  polar- 
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Figure  4.  Valence  Bands  and  Density  of  States  of  polysilane  calculated  with  the  3-2 IG* 

basis  set. 


izability  from  this  band  on  the  addition  of  the  polarization  functions  indicate  in¬ 
creased  localization  of  the  charge  and  a  reduction  in  the  o-conjugation  along  the 
polymer  backbone.  We  are  not  in  a  position  to  say  that  the  a-conjugation  was 
previously  oversuted,  but  rather  that  the  3-2 IG*  basis  should  give  an  improved 
description  of  the  bonding  in  polysilane  and  that  more  accurate  studies  will  rely 
on  more  extended  basis  sets,  perhaps  with  multiple  sets  of  polarization  functions 
with  carefully  chosen  exponents. 


Conclusion 

In  this  article  we  have  given  a  brief  description  of  an  algorithm  particularly  suited 
to  the  computation  of  two-electron  integrals  in  extended  systems,  together  with  the 
application  of  the  algorithm  to  the  calculation  of  the  band  structure  and  sos  po¬ 
larizabilities  of  polyethylene  and  polysilane  using  standard  basis  sets  augmented 
with  piolarization  functions.  In  the  case  of  polysilane,  the  polarization  functions 
added  to  the  silicon  atoms  play  a  role  in  increasing  the  bonding  character  along 
the  polymer  backbone.  This  improvement  on  the  description  of  the  bonding  in 
polysilane  is  at  the  expense  of  the  calculated  polarizability,  suggesting  that  more 
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than  one  set  of  d-exponents  are  necessary  to  describe  the  bonding  and  response 
properties  correctly.  Further  investigation  is  also  necessary  into  the  question  of 
whether  it  is  appropriate  to  use  the  same  exponents  in  molecular  and  polymeric 
calculations. 

There  is  much  scope  for  further  work  into  the  effect  of  the  addition  of  polarization 
functions  into  the  computed  energies  and  properties  of  polymeric  systems. 
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Abstract 

Laser  illumination  of  the  junction  of  a  scanning  tunneling  microscope  (STM)  modulates  the  barrier 
height  by  adding  the  electric  field  of  light.  Numerical  simulations  suggest  that  the  dependence  of  tunneling 
current  on  the  magnitude  and  frequency  of  the  modulation  can  serve  as  a  basis  for  determining  the 
duration  of  barrier  traversal.  Modulation  increases  the  mean  value  of  the  current  when  the  frequency  is 
much  less  than  the  reciprocal  of  the  traversal  time.  Our  simulations  also  suggest  that  a  670  nm  laser 
diode  at  a  power  density  of  100  W/cm^  will  reduce  the  tunneling  current.  The  latter  effect  is  contrary 
to  most  competing  phenomena  such  as  current  rectification,  photo-assisted  tunneling,  thermal-assisted 
tunneling,  and  thermal  expansion.  We  are  compleung  a  novel  stm  which  has  decreased  noise  and  increased 
stability  in  order  to  make  the  laser/STM  measurements.  ®  1993  John  Wiley  &  Sons.  Inc. 

Introduction 

The  duration  of  quantum  tunneling  is  a  topic  that  has  generated  much  interest 
and  controversy  ( for  reviews,  see  Refs.  [1]).  Experiments  with  heterostructures  [2] 
and  Josephson  junctions  [3]  suggest  that  a  specific  time  is  associated  with  barrier 
traversal.  Quantum  mechanics  does  not  describe  the  motion  of  particles  within  the 
classically  forbidden  region,  so  that  a  variety  of  procedures  have  been  used  to  cal¬ 
culate  the  duration  of  quantum  tunneling.  However,  the  results  do  not  agree  [  1  ] . 
Most  methods  give  a  definite  value  of  traversal  time  for  a  specific  problem,  which 
appears  inconsistent  with  the  statistical  nature  of  quantum  phenomena.  Distribu¬ 
tions  of  tunneling  times  have  been  predicted  using  Feynman  path  integrals  [4], 
Bohm’s  causal  interpretation  of  quantum  mechanics  [  5  ] ,  and  a  hybrid  procedure 
[6].  Again  the  results  do  not  agree. 

It  has  been  shown  [7  ]  that  tunneling  through  the  reaction  barrier  must  be  added 
to  the  models  in  transition  state  theory  in  order  to  obtain  accurate  quantal  micro- 
canonical  rate  constants  at  high  energies.  Thus,  accurate  modeling  of  chemical 
reactivity  requires  correcting  for  time  delays  due  to  tunneling,  in  addition  to  the 
well-known  delays  associated  with  passage  over  the  barrier  [8].  Tunneling  times 
must  also  be  considered  for  accurately  modeling  solid  state  devices.  Measured  tunnel 
conductance  in  heterostructures  differs  from  theory  by  as  much  as  2  orders  of 
magnitude  unless  static  image  corrections  are  used  for  large  traversal  times  and 
dynamic  effects  for  short  traversal  times  [2],  Tunneling  occurs  in  most  solid  state 
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devices,  at  least  in  the  ohmic  contacts  [9].  The  barrier  traversal  time  in  ohmic 
contacts  is  probably  on  the  order  of  10  fs,  so  that  it  may  generally  be  ignored;  but 
it  should  be  considered  in  picosecond  switching  and  terahertz  technology.  We  an¬ 
ticipate  that  the  effects  of  tunneling  times  will  also  be  encountered  as  progress 
continues  on  the  generation  of  femtosecond  optical  pulses  [10]. 

Several  experimental  methods  have  been  used  to  examine  tunneling  times,  but 
the  results  are  inconclusive.  ( 1 )  Analyses  of  measurements  of  tunnel  conductance 
in  heterostructures  [2]  suggest  that  the  response  of  image  charges  is  dependent  on 
the  traversal  time.  There  is  much  scatter  in  the  data,  but  the  results  appear  to  be 
consistent  with  the  semiclassical  traversal  time.  (2)  Measurements  of  the  effect  of 
a  static  magnetic  field  on  the  tunneling  conductance  of  heterostructures  have  been 
used  to  determine  the  traversal  time  [11],  but  the  data  may  also  be  explained 
without  reference  to  tunneling  times  [12].  (3)  Tunneling  dynamics  was  studied  in 
a  shunted  Josephson  junction  [3],  but  this  involves  tunneling  between  the  states 
of  a  device  and  does  not  directly  relate  to  tunneling  by  particles.  (4)  An  operational 
tunneling  time  was  determined  from  measurements  of  current  rectification  in  a 
laser-illuminated  scanning  tunneling  microscope  ( STM )  [  1 3  ] .  However,  the  observed 
decrease  in  current  with  increasing  barrier  length  is  not  significantly  different  from 
the  exponential  dependence  of  the  transmission  coefficient  on  the  barrier  length, 
so  that  the  data  may  be  explained  without  reference  to  tunneling  times. 

Measurements  of  tunneling  times  are  needed  because  of  the  significance  of  this 
topic  and  the  present  lack  of  agreement  in  theoretical  analyses.  We  will  make  laser/ 
STM  experiments  that  are  based  on  modulation  of  the  barrier  height  by  the  electric 
field  of  light,  rather  than  current  rectification  used  in  the  earlier  studies  by  Nguyen 
et  al.  [13].  Theory  suggests  [14]  that  tunneling  has  two  distinct  regimes  in  respect 
to  the  modulation  frequency  with  a  crossover  where  the  angular  frequency  equals 
the  reciprocal  of  the  traversal  time. 

Method  of  Analysis 

We  model  tunneling  [  1 5  ]  as  a  “climb  over  the  barrier”  during  energy  fluctuations 
consistent  with  the  uncertainty  principle.  Cohen  postulated  [16]  that  the  probability 
of  a  fluctuation  decreases  exponentially  with  the  product  of  the  magnitude  and 
duration,  which  product  we  refer  to  as  the  action  of  a  fluctuation.  Cohen  did  not 
treat  tunneling  times,  and  considered  only  the  most  probable  fluctuations  (minimum 
action  permitting  tunneling),  thus  deriving  the  \v'kb  solution  for  an  opaque  barrier. 
We  have  shown  [15]  that  this  procedure  also  results  in  the  semiclassical  traversal 
time,  given  by  =  dVm/{2(Vo  -  E)) ,  where  m  and  E are  the  mass  and  energy 
of  the  particle  and  Vq  and  d  are  the  height  and  length  of  a  rectangular  barrier.  We 
follow  the  convention  of  calling  this  time  “semiclassical”  because  it  is  the  classical 
time  for  traversing  the  inverted  barrier  [14], 

While  the  result  is  by  no  means  unanimous,  a  number  of  other  procedures  in 
addition  to  considerations  of  energy  fluctuations  [15]  also  result  in  the  semiclassical 
traversal  time.  These  methods  include  { 1 ) barrier  modulation  [14],  (2)  modulating 
the  incident  wave  [17],  (3)  solutions  fora  sharp  wave  packet  [18],  (4)  flux-flux 
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correlation  factors  [19].  (5)  dissipation  by  a  string  [20],  (6)  dynamic  image  po¬ 
tentials  [ 2 1  ] ,  ( 7 )  temperature  effects  in  macroscopic  tunneling  [22  ] ,  ( 8 )  “bounce” 
time  for  macroscopic  tunneling  [23],  (9)  probability  amplitudes  for  momentum 
[24],  (10)  duality  with  ray  optics  [25],  and  (11)  calculations  with  path  in¬ 
tegrals  [26], 

We  make  the  approximation  of  using  a  rectangular  barrier  in  modeling  the  effect 
of  a  laser  on  the  current  in  an  stm  .  Others  have  suggested  several  more  elaborate 
expressions  for  the  potential  [  27  ] .  The  accuracy  of  these  expressions  may  be  ques¬ 
tioned  because  they  contain  appreciable  image  corrections,  and  it  appears  that  the 
use  of  static  or  dynamic  images  is  itself  dependent  upon  the  traversal  time  [2],  It 
is  common  to  approximate  an  arbitrary  barrier  by  an  “equivalent”  rectangular 
barrier  having  the  same  average  height  in  calculations  of  tunneling  current  [28], 
so  that  this  approximation  is  made  for  the  present. 

The  barrier  length  d  is  divided  into  A'  segments  of  length  dJN.  such  that  each 
part  is  small  enough  that  the  potential  is  approximately  a  constant  I  ' during  transit 
by  an  electron.  We  assume  that  an  energy  fluctuation  causes  the  particle  to  traverse 
each  segment,  and  set  the  magnitude  A£  =  T  -  £  +  mv~/2.  where  the  particle 
has  mass  i>t.  velocity  v.  and  unperturbed  energy  £.  Within  each  segment  we  consider 
only  the  most  probable  fluctuations,  those  with  the  least  action  permitting 
tunneling.  Thus,  within  each  segment  the  velocity  t'  equals  \2(V  -  E)/m.  the 
action  of  the  fluctuation  equals  -  £)/A'.  and  the  traversal  time  is  given 

by  the  semiclassical  value  T  =  d^m/(2{l'(,  -  E))/A’. 

For  each  chosen  set  of  the  parameters — wavelength,  barrier  length  and  height, 
and  level  of  modulation — a  numerical  simulation  is  made  for  M  different  values 
of  the  modulation  phase  at  which  the  electron  enters  the  barrier.  In  each  simulation 
the  values  of  T  and  A  are  calculated  within  each  segment  using  the  instantaneous 
value  of  the  modulated  potential  for  1 and  summed  to  deterrnine  T,  and  which 
are  the  traversal  time  and  action  for  traversing  the  entire  barrier.  The  relative  value 
of  the  transmission  probability  is  determined  in  each  of  the  M  calculations  by 
assuming  that  the  probability  of  the  fluctuation  is  proportional  to  expl-.-lv/ft) 
[15],  where  h  is  Planck's  constant.  The  relative  value  of  the  current  with  modulation 
is  then  found  by  dividing  the  mean  value  of  the  exponential  for  each  of  the  M 
calculations  by  the  value  of  the  exponential  for  no  modulation.  The  calculations 
are  repeated  with  larger  values  of  A'  and  A/  until  these  two  parameters  are  found 
to  have  a  negligible  effect  on  the  output.  The  results  of  several  simulations  are 
presented  in  the  following  section  of  this  paper. 

Results  of  Analysis 

Figure  1  shows  the  relative  current  as  a  function  of  the  wavelength  of  modulation 
for  4.0  eV  electrons  with  a  rectangular  barrier  having  a  length  of  6.0  A  and  a  height 
of  5.0 F.  The  three  levels  of  modulation  correspond  to  illumination  at  different 
power  densities.  For  modulation  at  long  wavelengths,  the  electron  traverses  an 
approximately  static  barrier  in  which  the  height  is  either  increased  or  decreased, 
depending  on  the  phase  at  the  time  of  entry.  Due  to  the  exponential  dependence 
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Figure  I  Relative  current  vs.  modulating  wavelength  for  4.0  eV  electrons  with  barrier 
length  =  6.0  A  and  hei^t  =  5.0  F. 


of  current  on  barrier  height,  these  effects  do  not  cancel.  Instead,  the  modulation 
increases  the  mean  value  of  the  current  at  long  wavelengths. 

For  modulation  at  short  wavelengths  the  height  of  the  barrier  is  alternately  in¬ 
creased  and  decreased  during  barrier  traversal.  Due  to  the  nonlinear  dependence 
of  current  on  barrier  height,  the  effects  of  the  intervals  with  increased  barrier  height 
dominate,  and  modulation  tends  to  decrease  the  mean  value  of  the  current  at  short 
wavelengths.  Since  there  is  a  distribution  of  traversal  times,  instead  of  the  definite 


Figure  2.  Relative  current  vs.  level  of  modulation  for  4.0  eV  electrons  with  barrier  height 
=  S.OFand  modulating  wavelength  =  670  nm. 
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values  of  time  implicit  in  other  analyses  [14]  the  transition  between  the  regimes 
for  long  and  short  wavelengths  in  Figure  1  is  quite  broad. 

Figure  2  shows  the  relative  current  as  a  function  of  the  level  of  modulation  for 
4.0  eV  electrons  with  a  barrier  height  of  5.0  F.  In  Figure  2  the  wavelength  of  mod¬ 
ulation  is  670  nm,  which  is  a  typical  value  for  visible  laser  diodes.  A  power  density 
of  100  W/cm',  providing  adequate  modulation,  may  be  obtained  with  a  20  mW 
laser  diode  focused  to  a  minimal  spot  size.  The  data  in  Figure  2  suggest  that,  for 
barrier  lengths  between  6  and  1 0  A,  the  current  decreases  as  the  power  density  of 
the  modulating  signal  is  increased. 

Description  of  Apparatus 

We  will  make  laser/STM  experiments  pursuant  to  determining  the  barrier  traversal 
time.  Initially  we  will  determine  the  dependence  of  current  on  power  density  when 
the  STM  junction  is  illuminated  with  a  670  nm  laser  diode,  and  the  results  will  be 
compared  with  simulations  such  as  that  shown  in  Figure  2.  At  a  later  point,  we 
will  measure  the  current  when  two  or  more  lasers  sequentially  illumine  the  junction 
of  an  STM,  and  look  for  the  transition  shown  in  Figure  1.  We  are  completing  a 
novel  STM  with  decreased  noise  and  increased  stability  in  order  to  perform  these 
experiments.  The  feedback  loop  used  to  maintain  a  constant  tunneling  current  will 
be  interrupted  for  a  brief  period  (typically  less  than  100  ms)  while  the  current  is 
recorded  and  the  laser  is  pulsed. 

We  use  a  lead  zirconate  titanate  piezoelectnc  ceramic  tube  as  an  electromechanical 
transducer  for  .v-v-r  fine  positioning  of  the  tip  and/or  sample,  as  is  generally  done 
in  an  STM  (for  reviews  of  STM,  see  Refs.  [29]).  The  piezoelectric  tubes  (ebl-3 
piezoelectric  actuators  [30])  have  a  length  of  1.5  cm,  diameter  of  0.64  cm,  and 
wall  thickness  of  0.056  cm.  The  maximum  recommended  dnve  is  530  V,  and  the 
computed  displacements  [3 1  ]  are  72  A/ V  in  z  and  84  A/ V  in  .v and  v.  The  quadrant 
electrodes  on  the  piezoelectric  tube  are  fed  with  balanced  x  and  supplies,  and  the 
inner  electrode  is  fed  from  an  unbalanced  r  supply  to  provide  orthogonal  fine 
positioning.  Images  will  be  taken  immediately  before  and  aftei  the  lasei  is  pulsed 
in  order  to  detect  possible  changes  in  position  or  structure. 

The  circuit  which  drives  the  piezoelectric  transducer  has  already  been  completed. 
This  circuit  changes  the  voltage  on  the  transducer  in  a  series  of  small  steps  that  are 
determined  by  the  sampled  tunneling  current  through  feedback.  The  potential  sup¬ 
plied  to  the  inner  electrode  of  the  transducer  must  have  a  range  of  +500  to  -500 
V  in  order  to  cause  displacements  that  exceed  the  resolution  of  the  associated 
mechanical  system  used  for  coarse  positioning.  The  driver  circuit  has  unusually 
stringent  requirements  regarding  precision,  regulation,  and  ripple.  Fluctuations  due 
to  ripple  or  instability  must  be  less  than  1  mV  to  limit  aberrant  motion.  The  driver 
circuit  contains  a  stable  high  voltage  amplifier  similar  to  that  described  by  Park 
and  Quate  [32].  which  has  two  stages  of  regulation. 

Others  have  used  a  variety  of  different  devices  [29]  for  coarse  positioning  for  the 
tip  and/or  sample.  We  use  a  differential  micrometer  with  a  crossed-roller  bearing 
translation  stage  (DM- 13  and  425-OMA  [33]).  The  differential  micrometer  has  a 
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fine  scale  with  graduations  of  1  fim, «  resolution  of  0.07  ^m,  and  a  backlash  of  0. 1 
;itm.  Thus,  off-the-shelf  instruments  are  used  to  obtain  a  resolution  much  finer  than 
the  limit  for  displacements  of  the  piezoelectric  transducer. 

A  low-noise  chopper  stabilized  operational  amplifier  [  34  ]  preamplifies  the  tun¬ 
neling  current  in  our  STM .  The  chopper  circuitry  repeatedly  nulls  the  input  offset 
voltage  to  correct  for  variations  due  to  time,  temperature,  common-mode  voltage, 
and  the  power  supply,  thus  providing  high  dc  precision  and  reducing  the  low- 
frequency  (1/f)  noise.  The  operational  amplifier  we  use  [34]  has  a  current  offset 
of  2.0  pA  and  an  equivalent  input  noise  current  of  4.0  fA/Hz.  The  tunneling  signal 
is  typically  on  the  order  of  1  nA.  with  an  impedance  of  a  few  MQ  [  32  ] .  Due  to  the 
high  source  impedance,  we  locate  the  preamplifier  adjacent  to  the  junction  of  the 
STM  to  reduce  the  capacitance  shunting  the  input  to  ground.  The  preamplifier  is 
followed  by  an  instrumentation  amplifier  located  inside  the  main  electronics  box. 
The  instrumentation  amplifier  provides  further  gain  with  a  very  high  common¬ 
mode  rejection  ratio. 

A  multifunction  I/O  board  [35]  is  installed  in  the  486  PC  dedicated  to  this 
project.  Sou  ware  for  data  acquisition,  feedback  control,  image  proces.sing.  and  gating 
of  the  laser  is  being  written  in  Lab  Windows.  The  output  of  the  instrumentation 
amplifier  is  connected  directly  to  the  I/O  board.  After  simulating  various  control 
algorithms,  we  have  chosen  to  use  adaptive  digital  incremental  feedback  control 
with  a  predictive  error  signal.  Incremental  algorithms  provide  finer  control  than 
direct  algorithms  and  also  prevent  integral  windup  [36].  The  predictive  error  is 
calculated  from  a  linear  regression  with  a  set  of  values  proportional  to  the  logarithm 
of  the  ratios  of  the  sampled  tunneling  currents  to  the  set  point  current.  Our  sim¬ 
ulations  suggest  that  the  use  of  logarithms  in  the  error  signal  causes  a  substantial 
increase  in  the  stability  of  feedback  control.  This  improvement  may  be  understood 
because  the  logarithm  of  the  tunneling  current  varies  linearly  with  the  length  of 
the  junction,  which  varies  linearly  with  the  voltage  to  the  piezoelectric  transducer, 
which  is  controlled  by  the  error  signal. 

Discussion  and  Conclusions 

Many  different  phenomena  occur  in  a  laser-illuminated  STM  [37]  in  addition  to 
the  effect  which  we  have  modeled.  These  effects  include  current  rectification,  photo- 
assisted  tunneling,  thermal-assisted  tunneling,  thermal  expansion,  and  the  direct 
interaction  of  tunneling  particles  with  modulation  quanta  [14].  In  our  initial  ex¬ 
periments  following  Figure  2,  the  use  of  a  visible  laser  will  deciease  the  effects  of 
current  rectification  [38].  The  relatively  low  power  densities  will  reduce  thermal 
effects  as  well  as  current  rectification,  but  all  of  these  phenomena  must  be  considered. 
Fortunately,  the  predicted  decrease  in  current  with  illumination  of  the  junction  is 
contrary  to  most  of  the  competing  phenomena. 

Experiments  made  later  to  look  for  the  transition  shown  in  Figure  1  would  be 
more  difficult  to  interpret  because  the  competing  phenomena  have  different  fre¬ 
quency  dependence.  However,  since  decreased  current  is  predicted  for  modulation 
at  shorter  wavelengths  (increased  quanta  size),  the  predicted  effect  differs  from 
most  of  the  competing  phenomena. 
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Abstract 

Manv  xcnobiotics  containing  N  or  S  heteroatoms  arc  metabolized  by  the  cytochrome  P450s.  leading 
to  a  variety  of  products  which  can  be  toxic,  carcinogenic,  or  detoxifying.  Thus,  it  is  important  to  try  to 
establish  molecular  cntena  that  modulate  competitive  product  formation  for  these  types  of  compounds 
In  the  absence  of  3D  structures  for  the  P450  isozymes  that  are  responsible  for  the  oxidations  of  N-  and 
S-containing  compounds,  we  have  focused  here  on  the  charactenzalion  and  identification  of  possible 
electronic  and  thermodynamic  factors  that  could  be  modulators  of  dillercnt  types  of  product  formation. 
Specifically,  the  competition  among  N-oxidation.  N-hydroxylation  and  C„-hydrovylation  for  three  amines 
were  examined  Similarly,  three  thioethers  were  studied  for  their  internal  competition  between  S-oxidation 
and  C„-hydro.xylation  The  results  obtained  indicate  that  the  stability  of  the  cation  radical  intermediate 
formed  by  the  one  electron  transfer  mechanism  is  not  a  determinant  of  differences  in  product  distnbution 
between  the  two  types  of  compounds.  Rather,  relative  product  stability  appears  to  be  a  significant  modulator 
of  product  distribution  explaining  why  S-oxide  formation  is  favored  over  N-oxidc  and  why  C„-hydrox- 
ylation  is  usually  favored  over  N-oxide  formation  in  amines  <  IW.t  John  Wiley  cS.  Sons.  Inc 

Introduction 

Many  nitrogen-  and  sulfur-containing  compounds  are  used  as  therapeutic  agents, 
insecticides,  or  industrial  substances.  When  metabolized  by  the  cytochrome  P450 
enzymes  [1],  responsible  for  the  initial  metabolism  of  many  xenobiotics,  these  N- 
and  S-containing  compounds  are  transformed  into  species  that  can  be  either  toxic, 
carcinogenic,  or  detoxifying  [2],  It  is  therefore  important  to  try  to  identify  molecular 
criteria  that  modulate  competitive  product  formation  for  these  types  of  compounds 
to  be  used  for  risk  assessment.  Ideally,  two  types  of  properties  should  be  considered; 
( 1 )  the  steric  interaction  between  the  substrate  and  the  enzyme  and  (2)  the  intrinsic 
electronic  properties  of  the  parent  substrate  compound  and  the  thermodynamics 
of  the  reaction  pathways  leading  to  its  products.  However,  in  the  absence  of  3D 
structures  of  the  P450  isozymes  that  are  responsible  for  the  oxidations  of  these 
compounds,  we  have  focused  here  on  the  second  type  only,  namely,  the  charac¬ 
terization  and  identification  of  possible  electronic  and  thermodynamic  factors  that 
could  be  modulators  of  different  types  of  product  formation. 

For  amines  and  thioethers.  two  qualitatively  different  P450-mediated  oxidations 
were  observed.  The  first  involves  an  oxidation  at  the  hetcroatom  site  which  converts 
the  substrate  into  its  corresponding  heteroatom  oxide  or  heteroatom  hydroxide. 
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The  second  one  is  C„-hydroxylation  which  is  usually  followed  by  dealkylation.  The 
reaction  mechanisms  and  factors  that  lead  to  these  competing  products  are  stilt  not 
very  well  characterized. 

Recently,  Guengerich  and  coworkers  proposed  a  unified  mechanism  of  many 
types  of  reaction  catalyzed  by  the  cytochrome  P450s  based  on  a  common  first  step 
of  a  one-electron  transfer  from  the  substrate  to  the  ferry  1  [FeO]  species  of  the  heme 
protein  [  3  ] .  This  mechanism  has  been  observed  in  studies  using  metalloporphyrin- 
catalyzed  model  systems  for  cytochrome  P450  monooxygeneses  [4].  For  N-  and 
S-containing  compounds,  it  leads  to  the  formation  of  a  heteroatom  cation  radical 
and  compound  II  of  P450  as  shown  below: 

R_X~R'+  [Fe  =  0)''  -*  [R— X— R’)^-+  [Fe=0)'''  (I) 

Compound  1  Compound  II 

Guengerich  et  al.  [  3  ]  have  suggested  that  this  cation  radical  may  be  the  common 
intermediate  for  both  heteroatom  oxygenation  and  C„-hydroxyIation. 

Another  frequently  proposed  pathway  for  C„-hydroxylation  is  an  initial  hydrogen 
abstraction  by  the  ferryl  oxygen  followed  by  fast  radical  recombination  [  5  ] .  For 
primary  and  secondary  amines,  hydrogen  abstraction  at  the  nitrogen  atom  site  is 
also  possible,  leading  to  N-hydroxide  products  [6]; 

H  OH 

I  F«!:0  Fe=OH  I 

R— X— C  R'  — *  R— X— C„— R' - -  R— X— C„— R" 

H  OH 

I  F«=0  F«=OH  I 

R— N— R'  —  R— N— R’ - ►  R— N— R' 

Determination  of  which  of  these  proposed  mechanisms  is  preferred  by  purely  ex¬ 
perimental  methods  is  very  difficult  due  to  the  transient  nature  of  the  different 
proposed  intermediates.  On  the  other  hand,  theoretical  methods  can  be  very  useful 
in  characterizing  and  comparing  these  different  reaction  pathways  since  they  can 
include  descriptions  of  these  transient  intermediates. 

From  earlier  studies  [  7  ]  of  the  electronic  structure  of  the  [  Fe — O  ]  prosthetic 
moiety  of  P450  in  its  active  state,  the  ground  state  of  compound  I  was  found  to  be 
a  quartet  with  two  unpaired  electrons  localizing  partly  on  the  2P^  and  2P,  orbitals 
of  the  oxygen  atom.  A  triplet  oxene  species,  0(^P),  is  the  simplest  model  capable 
of  representing  the  triplet  character  of  the  [Fe — O)  moiety,  and  was  found  to  cor¬ 
rectly  describe  many  reactions  catalyzed  by  P450s  [8].  Therefore,  in  this  study, 
0(  V)  was  used  in  model  oxidation  reaction  of  three  amines  and  three  thioethers 
that  weie  characterized  by  ab  initio  quantum  chemistry  calculations  of  the  reactants, 
the  radical  intermediates  and  the  oxidation  products.  Figure  1  shows  the  three 
amines  (methyl  amine,  methyl  ethyl  amine,  and  A'-methyl  aniline)  and  the  three 
thioethers  ( methanethiol,  methyl  ethyl  thioether,  and  thioanisole)  chosen  in  this 
investigation. 
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Figure  I .  Structures  of  the  three  amiries  and  the  three  thioethcrs  chosen  for  this  study. 


The  goals  of  this  study  are:  (I)  to  identi*’^  i:  e  origin  of  the  observed  pref¬ 
erence  for  heteroatorn  oxide  formation  in  S-containing  compounds  compared 
to  N-containing  compounds,  (II)  to  characterize  the  internal  competition  of 
different  oxidation  inactions  in  amines  to  explain  why  C„>hydroxylation-KleaI- 
kyiation  is  usually  preferred,  and  (III)  to  characterize  the  internal  competition 
of  reactions  in  thioethers  to  explain  why  sulfoxide  formation  is  usually 
preferred. 
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Methods 

The  Gaussian  92  program  [9]  was  used  for  all  calculations.  For  each  species,  the 
fully  optimized  geometry  in  its  ground  electronic  state  was  obtained  using  standard 
energy  gradient  techniques.  A  3-21G*  basis  set  with  an  extra  d  function  for  the  N 
atom  (a  =  0.8)  and  a  6-31G*  basis  set  were  used  in  these  calculations.  Restricted 
and  unrestricted  Hartree-Fock  methods  were  used  for  closed  shell  and  open  shell 
species,  respectively.  If  nonnegligible  spin  contamination  was  found  for  any  open 
shell  species,  a  calculation  using  restricted  open  shell  Hartree-Fock  method  (ROHF) 
and  the  6-3 IG*  basis  set  was  performed.  The  computations  for  all  compounds 
were  performed  at  the  SCF  levels  of  theory,  i.e.,  SCF/3-21G*  (Nd  =  0.8)  and  SCF/ 
6-3 IG*.  For  methyl  ethyl  amine  and  methyl  ethyl  thioether,  both  the  trans  and 
gauche  C — X — C — X  ( X  =  N  or  S )  conformations  were  minimized  at  the  SCF/  3- 
21G*  levels  and  the  more  stable  conformation  was  then  chosen  for  subsequent 
studies.  For  A-methyl  aniline  and  thioanisole.  the  rotational  barrier  about  the  Q — 
N  and  the  C* — S  ( 0  represents  the  phenyl  ring)  bonds  were  examined  by  studying 
the  energies  of  the  perpendicular  and  the  planar  side  chain  conformations  using 
both  3-21G*  +  Nd  and  the  6-31G*  basis  sets.  The  conformation  with  the  lower 
energy  in  each  molecule  was  used  as  the  parent  compound  in  this  study. 

For  each  of  the  three  amines,  the  three  types  of  radical  intermediates,  i.e.,  the 
N-cation  radical,  C„-radical,  and  N-radical.  and  the  three  reaction  products,  N- 
oxide,  Ca-hydroxide,  and  N-hydroxide,  were  calculated.  For  methyl  ethyl  amine, 
there  are  two  possible  Ca-hydroxylation  sites;  the  primary  carbon  of  the  methyl 
group  and  the  secondary  carbon  of  the  ethyl  group.  Therefore,  both  of  them  were 
considered  in  this  investigation.  For  the  three  thioethers,  the  S-cation  radical  and 
the  C„-radical  along  with  their  oxidation  products,  i.e.,  sulfoxide  and  Cc,-hydroxide 
were  characterized.  Again,  both  C„-hydroxylation  sites  in  methyl  ethyl  thioether 
were  considered. 


Results  and  Discussion 

Tables  I  and  11  list  the  calculated  energies  of  all  N-containing  and  S-containing 
species,  respectively.  Comparing  the  energies  of  the  two  conformations  studied  for 
methyl  ethyl  amine,  the  trans  conformer  (as  shown  in  Fig.  1 )  was  found  to  be 
slightly  lower  by  1.0  kcal/mol  than  the  gauche  conformer  from  the  3-21G*  -t-  Nd 
calculation.  Similarly,  for  methyl  ethyl  thioether,  the  energy  of  the  trans  conformer 
is  lower  than  the  gauche  conformer  by  0.2  kcal/mol.  Therefore,  the  trans  confor¬ 
mation  was  used  for  these  two  parent  compounds,  and  all  intermediates  and  oxi¬ 
dation  products  derived  from  the  trans  conformer  were  calculated. 

For  A-methyl  aniline,  both  3-2 IG*  +  Nd  and  6-3 IG*  predict  a  planar  — 

— N — Cm  (where  C«  indicates  a  C  atom  in  the  phenyl  ring,  while  Cm  is  the 
methyl  carbon  atom)  conformation  as  the  lowest  energy  conformer.  The  rotational 
barrier  from  planar  to  perpendicular  conformation  is  found  to  be  0.3  kcal/mol 
from  3-2 IG*  +  Nd  and  1.6  kcal/mol  from  6-3 IG*  calculation,  respectively.  On 
the  contrary,  for  thioanisole,  the  perpendicular  conformation  was  found  to  be  en¬ 
ergetically  more  favorable  by  1.7  and  1.2  kcal/mol  using  the  two  levels  of  calcu- 
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Table  I  Absolute  energies  of  molecular  species  involving  in  the 
oxidation  of  N-containing  compounds.* 


Molecule 

3-2 IG*  +  Nd(0.8) 

6-3 IG* 

0('P) 

-74.393657 

-74.783934 

-74.778966" 

o- 

-74.271936 

-74.718701 

OH 

-74.970228 

-75.382275 

-75.378369" 

CHjNHj 

-94.720975 

-95.209829 

CH,N’H2 

-94.452862 

-94.932226 

CH2NH2 

-94.101057 

-94.586733 

CHjNH 

-94.106132 

-94.591912 

CH3N(0)H2 

-169.078832 

-169.973532 

CHjNH(OH) 

-169.119729 

-170.011578 

CH2(0H)NH2 

-169.164438 

-170.068356 

CH3NHCH2CH3  (gauche) 

-172.356145 

CH3NHCH2CH3  (irons) 

-172.357785 

-173.276489 

CH3N"HCH2CHj 

-172.117563 

-173.027538 

CH3NHCHCH3 

-171  738334 

-172.655294 

CH2NHCH2CHJ 

-171.737029 

-172.652886 

CH3NCH2CH3 

-171.748831 

-172  665531 

CH3NH(0)CH2CH} 

-246.729561 

-248.050747 

CH3N(0H)CH2CH3 

-246.755815 

-248.0768.32 

CHjNHCHfOHlCHj 

-246.805451 

-248.136431 

CHjfOHlNHCHjCHj 

-246.799194 

-248.133485 

Phe-NH-CH3  (90°) 

-322.988352 

-324.753486 

Phe-NH  CHjflSO") 

-322.988887 

-324.756110 

Phe-N^H-CHj 

-324.522313" 

Phe-NH-CH2 

-324.129758" 

Phe-N-CHj 

-324.140495" 

Phe-NH(0)-CH3 

-397  352822 

-399.519511 

Phe-N(OH)-CH, 

-397  390592 

-399.555674 

Phe-NH-CH2(OH) 

-397.432723 

-399.613990 

*  If  not  noted,  rhf  and  uhf  were  used  for  closed  shell  and  open 
shell  species. 

*’  From  an  ROHF  calculation. 


lations.  Therefore,  the  parent  compound  of  a  planar  conformation  was  used  for  N- 
methyl  aniline  while  a  perpendicular  conformation  was  used  for  thioanisole. 

The  calculated  expectation  values  of  S'  are  between  0.75  and  0.77  using  the  uhf 
method  for  all  radical  intermediates  except  those  derived  from  A'^-methyl  aniline. 
Large  spin  contaminations  ((5^>  >  0.8)  were  found  for  its  N-cation  radical.  Co- 
radical  and  N-radical.  Therefore,  rohf  and  6-3 1 G  *  basis  set  were  used  to  reoptimize 
these  three  radical  species.  Their  minimized  energy  are  listed  in  Table  I. 
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Table  II.  Absolute  energies  of  molecular  species  involving  in  the 
oxidation  of  S-containing  compounds  * 


Molecule 

3-2 1G° 

6-3 1G» 

CHjSH 

-435.637275 

-437.700320 

CHjS'H 

-435.327695 

-437.393159 

CHjSH 

-435.01 1752 

-437.072347 

CHjStOIH 

-510.010483 

-512.488249 

CHitOHjSH 

-510.065842 

-512.548412 

CHjSCH-CHj  (gauche) 

-513.276171 

CH3SCH2CH3  (irons) 

-513.276465 

-515.770527 

CH3STH2CH3 

-512.999688 

-515.495419 

CH,SCHCH3 

-512.654411 

-515.147053 

CH2SCH2CH, 

-512.546506 

-515  143483 

CH3S(0)CH2CH3 

-587.668361 

-590.573572 

CH3SCH(0H)CH3 

-587.713438 

-590.623479 

CH2(0H)SCH:CH3 

-587.701380 

-590.614460 

Phe-S-CH3  (90°) 

-663.904681 

-667.245282 

Phe-S-CHidSO") 

-663.902025 

-667.243374 

Phe-S'-CH.IOO”) 

-663.625604 

-666.968958 

Phe-S-CHj  (90°) 

-663.279328 

-666.617558 

Phe-S(0)-CH3 

-738.299270 

-742.047557 

Phe-S-CH2(OH)  (~90°) 

-738.331089 

-742.090428 

•  RHF  and  UHF  methods  were  used  for  closed  shell  and  open  shell 
species,  respectively.  Energies  are  in  hartrees. 


Origin  of  Observed  Preference  for  Heteroalom  Oxide  Formation:  S  >  N 

As  proposed  by  Guengerich  et  al.  [3],  heteroatom  oxide  is  formed  via  an  initial 
electron  transfer  from  the  substrate  to  the  [FeO]  moiety  of  P450,  followed  by  an 
oxygen  transfer  in  the  opposite  direction.  From  energetic  consideration,  whether 
or  not  the  electron  transfer  occurs  is  determined  by  the  free  energy  change  associated 
with  Eq.  ( 1 ).  An  estimate  of  this  change  can  be  made  if  the  ionization  potential  of 
the  substrate  R — X — R'  and  the  electron  affinity  of  P450  compound  I  are  known. 
However,  there  has  been  no  report  on  the  magnitude  of  the  affinity  of  compound 
I,  and  the  stability  of  the  cation  radical  relative  to  its  parent  molecule  [i.e.,  the 
ionization  potential  (IP)]  is  used  as  a  measure  of  the  relative  tendency  of  various 
substrates  to  be  oxidized  via  the  one  electron  transfer  reaction  [3].  To  this  end, 
the  ionization  potential  of  each  of  the  six  parent  compounds  was  calculated  and 
the  results  are  shown  in  Table  III(A).  Comparison  between  the  amines  and  the 
corresponding  thioethers  with  similar  structures  shows  that  the  IPs  fcr  the  amines 
are  lower  than  those  of  the  thioethers.  For  example,  from  6-3 IG*  calculation,  ip 
is  174.2  kcal/mol  for  methyl  amine  and  192.7  kcal/mol  for  methanethiol.  The  IP 
for  methyl  ethyl  thioether  is  16.4  kcal/mol  higher  than  that  of  methyl  ethyl  amine. 
Similarly,  the  IP  for  thioanisole  ( 173.4  from  UHF)  is  also  higher  than  that  of  the 
N-methyl  aniline  ( 146.7  from  rohf).  Based  on  these  results,  we  concluded:  (1) 
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I  Aiii  I  III.  (A  I  loni/iition  potentials  orheteroatom-containing  parent  compounds  obtained 
from  the  energy  dillerente  between  neutral  and  cationic  species.  (B)  Fnergies  of  heteroatom 
oxide  products  relative  to  the  parent  compound  and  0(’P).* 


3-2 IG* 

6-3 IG* 

(A)  loni/ation  potential 

riiiNHi 

I6K.2 

174.2 

ni,sii 

194.3 

192.7 

CM, PH: 

180.9 

CH,NIK';n, 

ISO.T 

156.0 

(•||,SC'I1:CH, 

173.7 

172.4 

Phc-NHCHj 

146.7'’ 

Phc-.S-CII, 

175.1 

173.4 

(B)  Itclaiivc  energy  of  hcleroalom  oxide 
prcxlucls 

CM, Nil. 

22.5 

12.7 

riljSII 

12.8 

-2.5 

CM, PM: 

-56.9 

ClliNMCjH,, 

1.3.7 

6.1 

CHvSCH.-CM.,  ' 

l.l 

-12.0 

Phe-NHCM, 

18.7 

12.9 

Phe-.S-CHj 

0.6 

-11.5 

*  All  energies  arc  in  kcal/mol. 
From  an  Koiir  calculation. 


In  order  for  all  six  substrates  to  undergo  one-clcctron  transfer  mechanism,  the 
electron  affinity  of  P450  compound  I  must  be  at  least  150-190  kcal/mol.  (2)  The 
lower  ionization  potentials  of  amine  compounds  compared  to  the  thioether  ana¬ 
logues  imply  a  higher  probability  of  one-electron  oxidation  in  amines  than  in 
ihioethers.  If  the  production  of  the  cation  radical  is  the  rate-limiting  step  in  the 
hetcroatom  oxide  formation,  these  results  indicate  a  higher  preference  of  oxide 
formation  in  amines  than  in  thioethers.  This  inference  is  in  contradiction  with  the 
observed  preference  of  heteroatom  oxide  formation  in  S-containing  compounds 
compared  to  N-containing  compounds.  Therefore,  the  observed  preference  can  be 
explained  in  two  ways:  (i)  the  oxide  formation  docs  not  proceed  via  the  one-electron 
transfer  mechanism  or  ( ii )  the  formation  of  the  cation  radical  is  not  the  rate-limiting 
step  during  the  enzymatic  cycle  of  this  mechanism.  Without  knowledge  of  the 
electron  affinity  of  P450  compound  I,  needed  to  assess  the  possibility  of  electron 
transfer  reactions  or  of  the  detailed  rate  constant  for  each  enzymatic  step,  the  only 
conclusion  that  can  be  reached  here  and  the  one  most  relevant  for  our  goals  is  that 
the  relative  stability  of  the  cation  radical  cannot  be  used  as  a  modulator  of  the 
preference  of  heteroatom  oxide  formation. 

Another  possible  mechanism  for  the  heteroatom  oxide  formation  is  a  one-step 
direct  oxygen  atom  transfer  without  the  formation  of  the  intermediate  cation  radical. 
Based  on  this  mechanism,  in  Table  IIK  B)  a  comparison  of  the  heteroatom  oxide 
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stabilities  of  thv,  amines  with  those  of  the  thioethers  was  made  in  order  to  examine 
if  product  stability  is  a  possible  modulator  of  the  observed  preference  of  heteroatom 
oxide  formation  in  S-containing  compounds  than  in  N-containing  compounds.  For 
each  of  the  three  pairs  of  compounds,  the  S-oxide  is,  in  each  case,  more  stable  than 
the  corresponding  N-oxide.  For  example,  from  6-3 IG*  calculation,  the  difference 
in  stability  between  the  oxides  of  methyl  amine  and  methanethiol  is  15.2  kcal/ 
mol.  Similarly,  the  differences  for  the  methyl  ethyl  amine  and  methyl  ethyl  thioether 
are  18.1  and  24.4  kcal/mol  for  the  two  aryl  analogs.  These  results  demonstrate 
that  the  relative  heteroatom  oxide  stability  is  a  very  good  indicator  of  the  observed 
preference  of  S-containing  compounds  for  heteroatom  oxide  formation  compared 
to  N-containing  compounds. 

To  further  explore  the  reliability  of  heteroatom  oxide  stability,  rather  than  the 
cation  radical  stability,  as  a  good  indicator  of  the  preference  of  heteroatom  oxide 
formation  in  various  compounds,  we  also  calculated  the  energy  of  methyl  phosphine, 
its  cation  radical,  and  methyl  phosphine  oxide  using  the  6-310*  basis  set.  P450- 
mediated  oxidations  of  phosphorous  containing  compounds  lead  to  a  greater  pref¬ 
erence  for  heteroatom  oxide  formation  than  in  either  N-  or  S-containing  compounds. 
As  shown  in  Table  III(A),  the  rank  order  of  IP  for  the  N-,  S-,  and  P-containing 
cation  radicals,  S  >  P  >  N,  does  not  parallel  their  preference  for  heteroatom  oxide 
formation  P  >  S  >  N.  The  result  provides  further  evidence  that  the  relative  cation 
radical  stability  is  not  a  determinant  of  heteroatom  oxide  formation.  Instead,  the 
rank  order  of  the  stability  of  heteroatom  oxide  product  shown  in  Table  111(B), 
P  >  S  >  N,  is  consistent  with  preference  for  heteroatom  oxide  formation  and 
reinforces  our  conclusion  that  relative  oxide  stability  is  a  significant  modulator  of 
P450-mediated  oxide  formation. 

Internal  Competition  in  Formation  of  Three  P450  Mediated  Oxidation 
Products  of  Amine  Substrates 

To  identify  reliable  determinants  of  product  distribution  in  cytochrome  P450- 
mediated  oxidations  in  amines,  plausible  pathways  to  the  three  types  of  products, 
C„-hydroxides,  N-hydroxides,  and  N-oxides,  were  considered.  Specifically,  the  for¬ 
mation  of  C„-hydroxides  and  N-hydroxides  was  assumed  to  proceed  via  an  initial 
hydrogen  atom  abstraction.  The  formation  of  N-oxide  was  assumed  to  proceed  via 
a  one-step  direct  oxygen  transfer  mechanism  due  to  its  success  in  explaining  the 
preference  of  oxide  formation  between  N-  and  S-compounds.  The  stabilities  of  the 
C„-radical,  the  N-radical  intermediates  in  the  proposed  H-abstraction  mechanism 
and  the  N-oxidc  product  were  hence  calculated.  The  results  are  shown  in  Table  IV 
for  the  three  amine  substrates  studied.  For  all  three  compounds,  the  C„-radical  is 
less  stable  than  both  the  N-radical  intermediate  and  the  N — O  product  by  a  few 
kcal/mol,  a  result  that  is  in  contrast  to  observed  preference  of  Ca-hydroxylation- 
dealkylation  in  amines.  Therefore,  the  relative  stability  of  competing  C„-  and  N- 
radical  intermediates  is  not  a  reliable  detei-minant  of  different  product  formation. 

Comparison  of  the  stability  of  the  three  oxidation  products  also  shown  in  Table 
IV  gives  the  order  Ca-hydroxide  >  N-hydroxide  >  N-oxide  in  all  three  amines.  The 
formation  of  the  C„-hydroxides  and  the  N-hydroxides  are  both  exothermic  while 
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Table  IV.  Relative  reaction  energies  (kcal/mol)  for  N-containing  compounds. 


3-2  IG» 

6-3 IG* 

Reactants 

CH3NH2  +  0(^P) 

0.0 

0.0 

C„-H  intermediates 

CH2NH2  +  OH 

27.2 

15.5 

N-OH  intermediates 

CHjNH  +  OH 

24.0 

12.3 

N-0  product 

CHjNlOIH- 

22.5 

12  7 

N-OH  .  oduct 

CHjNfOHIH 

-3.2 

-11.2 

C„-OH  product 

CH2(0H)NH2 

-31.3 

-46.8 

Reactants 

CH3NHC2H5  +  0(^P) 

0.0 

0.0 

C.-OH  intermediates 

CHjNHCHCH,  +  OH 

26.9 

14.4 

C„-OH  intermediates 

CH2NHCH2CHJ  OH 

27.7 

15.9 

N-OH  intermediates 

CH3NC2H5  +  OH 

20.3 

9.5 

N-0  product 

CH3NH(0)C2H5 

13.7 

6.1 

N-OH  product 

CH3N(0H)CH2CH3 

-2.7 

-10.3 

C„-OH  product 

CH3NHCH{0H)CH3 

-33.9 

-47.7 

C„-OH  product 

CH2(OH)NHC2Hj 

-30.0 

-45.8 

Reactants 

Phe-NHCHj  +  0(’P) 

0.0 

0.0 

C«-OH  intermediates 

Phe-NHCH2  +  OH 

16.9* 

N-OH  intermediates 

Phe-NCHj  +  OH 

10.2* 

N-0  product 

Phe-NH(0)CH3 

18.7 

12.9 

N-OH  product 

Phe-N(OH)CH, 

-5.05 

-9.8 

C„-OH  product 

Phe-NHCH2(OH) 

-31.5 

-46.4 

•  From  an  rohf  calculation. 


the  production  of  N-oxide  is  endothermic.  The  large  energy  difference  of  ~50- 
~60  kcal/mol  obtained  from  6-3 IG*  between  the  N-oxide  and  the  C„-hydroxide 
in  the  three  amines  suggests  that  the  formation  of  Ca-hydroxy  amine  is  energetically 
much  more  favorable.  This  result  agrees  with  the  observation  by  Guengerich  et  al. 
[  3  ]  and  others  [10]  that  C„-hydroxylation-dealkylation  is  the  predominant  oxidation 
reaction  in  amines.  Thus  it  appears  that  the  relative  stabilities  of  the  different  prod¬ 
ucts  modulate  the  competition  among  the  various  reactions.  Among  the  two  het¬ 
eroatom  oxygenations,  N — OH  was  found  to  be  ~35  kcal/mol  more  stable  than 
N — O  product  and  would  therefore  be  predicted  to  be  the  next  most  abundant 
product. 

Internal  Competition  in  Formation  of  Two  P450  Mediated  Oxidation 
Products  of  Thioether  Substrates 

To  identify  reliable  modulators  of  cytochrome  P450-mediated  oxidations  of  sul¬ 
fur-containing  compounds,  as  for  the  N-containing  compounds,  it  was  assumed 
that  Ca-hydroxylation  proceeds  via  H-abstraction  and  sulfoxide  formation  via  direct 
oxygen  transfer.  The  stability  of  the  C„-radical  and  OH  radical  intermediates  for 
each  of  the  three  substrates  was  calculated  as  shown  in  Table  V,  and  compared 


824 


LOEW  AND  CHANG 


Table  V  Relative  reaction  energies  (kcal/mol)  for  S-containing  compounds. 


3-2 1G‘ 

6-3 IG* 

Reactants 

CHjSH  +  CK’P) 

0.0 

0.0 

C„-OH  intermediates 

CHjSH  +  OH 

30.7 

18.6 

S-0  product 

CH3S(0)H 

12.8 

-2.5 

C„-OH  product 

CH2(0H)SH 

-21.9 

-40.2 

Reactants 

CHjSCjHj  +  0(^P) 

0.0 

0.0 

C„-OH  intermediate^ 

CHjSCHCH,  +  OH 

28.5 

15.8 

Co-OH  intermediates 

CHjSCHjCHj  +  OH 

30.6 

18.8 

S-O  product 

CHjStOICjHj 

l.l 

-12.0 

C„-OH  product 

CH3SCH(0H)CH, 

-27.2 

-43.3 

C„-OH  product 

CH2(0H)SCH2CH3 

-19.6 

-37.8 

Reactant 

Phe-SCHj  +  0(^P) 

0.0 

0.0 

C„-OH  intermediates 

Phe-SCHj  +  OH 

30.6 

18.4 

S-0  product 

Phe-S(0)CH3 

-0  6 

-11.5 

C,-OH  product 

Phe-SCHjtOH) 

-20.6 

-38.4 

with  the  stability  of  sulfoxide.  In  contrast  to  the  results  for  the  N-containing  com¬ 
pounds,  for  all  three  thioethers  the  sulfoxide  is  more  stable  than  the  intermediates 
Co-radical  and  OH  radical  by  ~20-~30  kcal/mol.  Thus,  if  the  formation  of  the 
C„-radical  and  OH  radical  is  the  rate  limiting  step  in  Co-hydroxylation  for  thioethers, 
more  S-oxide  than  C„-hydroxide  product  is  predicted,  which  is  consistent  with 
observations. 

To  determine  the  uniqueness  of  this  inference,  a  second  comparison  was  made 
of  the  product  stabilities,  as  shown  in  Table  V.  For  all  three  S-containing  parent 
compounds,  the  C„-hydroxylation  product  was  found  to  be  energetically  more  stable 
than  the  sulfoxide  by  ~25 — 38  kcal/mol.  Thus,  if  the  relative  product  stability 
is  used  as  a  criterion  for  predicting  product  distribution,  it  leads  to  the  prediction 
that  the  C„-hydroxide  is  the  predominant  product  in  all  three  thioethers,  a  result 
which  contradicts  known  experimental  works  of  a  few  selected  thioethers  [11]. 
Therefore,  it  can  be  concluded  that  the  relative  stability  between  the  radical  inter¬ 
mediates  and  the  sulfoxide,  rather  than  the  final  oxidation  products,  is  an  important 
factor  of  product  distribution  of  thioethers. 


Conclusion 

Ab  initio  quantum  chemistry  calculations  were  performed  for  N-  and  S-containing 
substrates  in  order  to  identify  reliable  electronic  and  thermodynamic  criteria  for 
P450  mediated  product  formation.  The  relative  stability  of  N-  vs.  S-oxide  products 
rather  than  that  of  the  corresponding  cation  radicals  was  found  to  be  a  significant 
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modulator  explaining  the  observed  preference  of  oxide  formation  in  S-containing 
compounds  compared  to  N-containing  compounds. 

Relative  product  stability  also  appears  to  be  a  significant  modulator  of  internal 
competition  of  the  three  oxidation  products  of  amines.  For  the  three  amines  studied, 
the  relative  product  stabilities  are  C„-hydroxide  >  N-hydroxide  >  N-oxide.  paral¬ 
leling  the  observed  product  distribution.  By  contrast,  for  the  three  thioethers.  the 
energy  difference  between  the  C„-radical  intermediates  in  the  hydroxylation  reaction 
and  the  sulfoxide  products  appears  to  be  an  important  factor  in  determining  the 
preference  of  sulfoxide  formation  over  C„-hydroxylation.  The  difference  in  mod¬ 
ulators  found  for  the  oxidations  of  N-  and  S-containing  compounds  could  be  due 
to  the  difference  in  the  rate  determining  step  for  hydroxylation  because  of  the 
different  thermodynamic  stabilities  of  their  radical  intermediates  and  the  oxidation 
products. 

Although  useful  electronic  and  thermodynamic  properties  have  emerged  for  as¬ 
sessing  qualitatively  different  P450-mediated  oxidation  product  formation  in  N- 
and  S-containing  substrates,  the  current  investigation  is  not  definitive  for  explaining 
all  product  distributions  for  any  N-  or  S-containing  compounds.  Among  the  res¬ 
ervations  are  that  the  triplet  oxygen  atom  model,  though  successful  in  studies  of 
many  other  P450  catalyzed  oxidation  reactions  [8],  may  not  be  a  sufficient  model 
for  comparisons  among  all  types  of  N-  and  S-containing  compounds.  Ideally,  the 
complete  oxo-iron-porphyrin  complex  should  be  included  in  the  calculation  of 
the  reaction  paths  and  products.  However,  such  a  calculation  is  formidable  at  the 
present  time  due  to  the  large  demand  of  the  CPU  and  memory  for  such  a  large 
system.  Another  concern  is  that  the  reaction  mechanisms  can  be  substrate  depen¬ 
dent.  For  example,  although  the  one-electron  transfer  mechanism  for  the  six  com¬ 
pounds  studied  here  needs  not  be  invoked  to  explain  their  difference  in  product 
distnbution,  there  is  experimental  evidence  that  this  mechanism  could  occur  for 
substrates  which  can  stabilize  the  cation  radical  through  extensive  delocalization 
[12],  Finally,  the  possible  modulation  of  product  distribution  from  steric  interactions 
between  substrate  and  P450  could  not  be  investigated  due  to  the  lack  of  the  3D 
structures  of  the  P450  isozymes  responsible  for  the  metabolism  of  the  N-  and  S- 
containing  compounds.  To  this  end.  the  experimental  determination  through  X- 
ray  crystallography  or  nmr  studies  and  the  theoretical  predictions  by  computer 
modeling  of  the  3D  structures  of  the  relevant  P450  isozymes  would  be  very  helpful. 
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